- 11 Aug, 2025 1 commit
-
-
Eugene Cheah authored
-
- 06 Aug, 2025 1 commit
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 05 Aug, 2025 1 commit
-
-
Wentao Ye authored
Signed-off-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
- 28 Jul, 2025 1 commit
-
-
Wentao Ye authored
[Bug] Enforce contiguous input for `dynamic_scaled_fp8_quant` and `static_scaled_fp8_quant` (#21773) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 18 Jul, 2025 1 commit
-
-
Shu Wang authored
Signed-off-by:
shuw <shuw@nvidia.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
- 17 Jul, 2025 1 commit
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 15 Jul, 2025 1 commit
-
-
Alexander Matveev authored
Signed-off-by:Alexander Matveev <amatveev@redhat.com>
-
- 09 Jul, 2025 1 commit
-
-
Tuan, Hoang-Trong authored
Signed-off-by:
Tuan M. Hoang-Trong <tmhoangt@us.ibm.com> Co-authored-by:
Tuan M. Hoang-Trong <tmhoangt@us.ibm.com> Co-authored-by:
Tyler Michael Smith <tysmith@redhat.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
- 07 Jul, 2025 1 commit
-
-
Yan Ma authored
Signed-off-by:yan <yan.ma@intel.com>
-
- 04 Jul, 2025 1 commit
-
-
Duncan Moss authored
Signed-off-by:
Duncan Moss <djm.moss@gmail.com> Co-authored-by:
Duncan Moss <dmoss@nvidia.com>
-
- 02 Jul, 2025 1 commit
-
-
bnellnm authored
Signed-off-by:
Bill Nell <bnell@redhat.com> Signed-off-by:
ElizaWszola <ewszola@redhat.com> Co-authored-by:
ElizaWszola <ewszola@redhat.com>
-
- 01 Jul, 2025 1 commit
-
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
- 28 Jun, 2025 1 commit
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 27 Jun, 2025 1 commit
-
-
li haoyang authored
Signed-off-by:
ilmarkov <imarkov@redhat.com> Signed-off-by:
Haoyang Li <Haoyang.Li@amd.com> Co-authored-by:
ilmarkov <imarkov@redhat.com>
-
- 26 Jun, 2025 1 commit
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 25 Jun, 2025 1 commit
-
-
Eldar Kurtić authored
-
- 17 Jun, 2025 1 commit
-
-
Wentao Ye authored
Signed-off-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
- 16 Jun, 2025 1 commit
-
-
Szymon Ożóg authored
Signed-off-by:SzymonOzog <szymon.ozog@gmail.com>
-
- 12 Jun, 2025 4 commits
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Luka Govedič authored
Signed-off-by:
Luka Govedič <lgovedic@redhat.com> Co-authored-by:
Sage Moore <sage@neuralmagic.com>
-
rasmith authored
Signed-off-by:Randall Smith <Randall.Smith@amd.com>
-
Ning Xie authored
Signed-off-by:Andy Xie <andy.xning@gmail.com>
-
- 07 Jun, 2025 1 commit
-
-
ElizaWszola authored
Signed-off-by:
ElizaWszola <ewszola@redhat.com> Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
- 05 Jun, 2025 1 commit
-
-
Chiyue Wei authored
Signed-off-by:
Chiyue Wei <chiyuew@nvidia.com> Co-authored-by:
Chiyue Wei <chiyuew@nvidia.com>
-
- 04 Jun, 2025 1 commit
-
-
Vadim Gimpelson authored
-
- 03 Jun, 2025 1 commit
-
-
Simon Mo authored
Signed-off-by:simon-mo <simon.mo@hey.com>
-
- 23 May, 2025 1 commit
-
-
Pavani Majety authored
[ModelOpt] Introduce VLLM_MAX_TOKENS_PER_EXPERT_FP4_MOE env var to control blockscale tensor allocation (#18160) Signed-off-by:Pavani Majety <pmajety@nvidia.com>
-
- 14 May, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
- 13 May, 2025 1 commit
-
-
Tao He authored
Implements dual-chunk-flash-attn backend for dual chunk attention with sparse attention support (#11844)
-
- 11 May, 2025 1 commit
-
-
Jinzhen Lin authored
Signed-off-by:Jinzhen Lin <linjinzhen@hotmail.com>
-
- 09 May, 2025 1 commit
-
-
Pavani Majety authored
-
- 07 May, 2025 4 commits
-
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
Wanrui Dai authored
Signed-off-by:
evian <eviantai@u.nus.edu> Co-authored-by:
evian <eviantai@u.nus.edu>
-
Szymon Ożóg authored
Signed-off-by:
SzymonOzog <szymon.ozog@aleph-alpha.com> Signed-off-by:
SzymonOzog <szymon.ozog@gmail.com> Signed-off-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <2037008807@qq.com>
-
- 05 May, 2025 1 commit
-
-
Jinzhen Lin authored
Signed-off-by:Jinzhen Lin <linjinzhen@hotmail.com>
-
- 03 May, 2025 1 commit
-
-
rasmith authored
Signed-off-by:Randall Smith <Randall.Smith@amd.com>
-
- 29 Apr, 2025 1 commit
-
-
Zhengyuan Su (苏政渊) authored
Signed-off-by:
苏政渊 <suzhengyuan@moonshot.cn> Co-authored-by:
苏政渊 <suzhengyuan@moonshot.cn>
-
- 27 Apr, 2025 1 commit
-
-
Kaixi Hou authored
Signed-off-by:kaixih <kaixih@nvidia.com>
-
- 22 Apr, 2025 1 commit
-
-
Charlie Fu authored
Signed-off-by:
charlifu <charlifu@amd.com> Co-authored-by:
Tyler Michael Smith <tysmith@redhat.com>
-