- 03 Oct, 2025 8 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Pavani Majety authored
[Quantization/NVFP4] Speed up TRTLLM NVFP4 MOE weight loading and fix K/V scale loading for MLA Attn (#25968) Signed-off-by:Pavani Majety <pmajety@nvidia.com>
-
Chendi.Xue authored
Signed-off-by:
Chendi Xue <Chendi.Xue@intel.com> Signed-off-by:
Chendi.Xue <chendi.xue@intel.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
Paul Pak authored
Signed-off-by:Paul Pak <paulpak58@gmail.com>
-
Egor authored
Signed-off-by:Egor <e.a.krivov@gmail.com>
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Zhewen Li authored
Signed-off-by:zhewenli <zhewenli@meta.com>
-
- 02 Oct, 2025 3 commits
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
ElizaWszola authored
Signed-off-by:
ElizaWszola <ewszola@redhat.com> Signed-off-by:
ElizaWszola <elizaw.9289@gmail.com> Signed-off-by:
Luka Govedič <lgovedic@redhat.com> Signed-off-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Luka Govedič <lgovedic@redhat.com>
-
vllmellm authored
Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
- 01 Oct, 2025 2 commits
-
-
Jerry Zhang authored
Signed-off-by:Jerry Zhang <jerryzh168@gmail.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 30 Sep, 2025 7 commits
-
-
cjackal authored
[Llama4] [multimodal] Fix misplaced dtype cast of `cos_sin_cache` in `Llama4VisionRotaryEmbedding` (#25889) Signed-off-by:cjackal <44624812+cjackal@users.noreply.github.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Pavani Majety authored
Signed-off-by:Pavani Majety <pmajety@nvidia.com>
-
Asaf Joseph Gardin authored
Signed-off-by:asafg <39553475+Josephasafg@users.noreply.github.com>
-
Yongye Zhu authored
Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Signed-off-by:
youkaichao <youkaichao@gmail.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
NickLucche <nlucches@redhat.com> Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Signed-off-by:
Barry Kang <43644113+Barry-Delaney@users.noreply.github.com> Signed-off-by:
Lucia Fang <fanglu@meta.com> Co-authored-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Lucia Fang <116399278+luccafong@users.noreply.github.com> Co-authored-by:
Lucia Fang <fanglu@meta.com> Co-authored-by:
NickLucche <nlucches@redhat.com> Co-authored-by:
Siyuan Fu <siyuanf@nvidia.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Xiaozhu Meng <mxz297@gmail.com> Co-authored-by:
Barry Kang <43644113+Barry-Delaney@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Wentao Ye authored
Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
- 29 Sep, 2025 2 commits
-
-
Thomas Parnell authored
-
Lee Nau authored
Signed-off-by:Lee Nau <lnau@nvidia.com>
-
- 27 Sep, 2025 2 commits
-
-
Tyler Michael Smith authored
Signed-off-by:Tyler Michael Smith <tlrmchlsmth@gmail.com>
-
Tyler Michael Smith authored
Signed-off-by:Tyler Michael Smith <tlrmchlsmth@gmail.com>
-
- 26 Sep, 2025 9 commits
-
-
Bram Wasti authored
Signed-off-by:Bram Wasti <bwasti@meta.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Chih-Chieh Yang authored
Signed-off-by:
Chih-Chieh-Yang <7364402+cyang49@users.noreply.github.com> Co-authored-by:
RishiAstra <40644327+RishiAstra@users.noreply.github.com>
-
Sage Moore authored
Signed-off-by:
Sage Moore <sage@neuralmagic.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
Tao He authored
[Qwen3-Next][GDN] fixes cuda graph capturing bug in GDN metadata and a stride bug in causal_conv_1d. (#25743) Signed-off-by:Tao He <linzhu.ht@alibaba-inc.com>
-
xaguilar-amd authored
Signed-off-by:xaguilar <Xavier.AguilarFruto@amd.com>
-
Aleksandr Malyshev authored
Signed-off-by:
Aleksandr Malyshev <maleksan@amd.com> Co-authored-by:
Aleksandr Malyshev <maleksan@amd.com> Co-authored-by:
Doug Lehr <douglehr@amd.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 25 Sep, 2025 5 commits
-
-
Shu Wang authored
Signed-off-by:Shu Wang. <shuw@nvidia.com>
-
Tyler Michael Smith authored
Signed-off-by:Tyler Michael Smith <tlrmchlsmth@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
XuruiYang authored
Signed-off-by:
yangxurui <yangxurui@meituan.com> Co-authored-by:
yangxurui <yangxurui@meituan.com>
-
Saman A. Pour authored
Signed-off-by:Saman Keon <samanamp@outlook.com>
-
- 24 Sep, 2025 2 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Duncan Moss authored
Signed-off-by:Duncan Moss <djm.moss@gmail.com>
-