- 24 Nov, 2025 3 commits
-
-
Roger Wang authored
Signed-off-by:Roger Wang <hey@rogerw.io>
-
Zero authored
Signed-off-by:
lim4349 <rockmanzero@naver.com> Signed-off-by:
Zero <rockmanzero@naver.com> Co-authored-by:
Cloud User <ubuntu@a100-80g-4.novalocal> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
- 23 Nov, 2025 1 commit
-
-
jiahanc authored
Signed-off-by:
jiahanc <173873397+jiahanc@users.noreply.github.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
- 22 Nov, 2025 8 commits
-
-
Federico authored
-
ZiTian Zhao authored
Signed-off-by:
zitian.zhao <zitian.zhao@tencentmusic.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Bram Wasti authored
Signed-off-by:
Bram Wasti <bwasti@meta.com> Signed-off-by:
Bram Wasti <bwasti@fb.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
Nandan Vallamdasu authored
Signed-off-by:
nandan2003 <nandan.vallamdasu@outlook.com> Signed-off-by:
Nandan Vallamdasu <nandan.vallamdasu@outlook.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
jinghanhu authored
-
FlintyLemming authored
Signed-off-by:FlintyLemming <admin@flinty.moe>
-
Lukas Geiger authored
Signed-off-by:Lukas Geiger <lukas.geiger94@gmail.com>
-
- 21 Nov, 2025 17 commits
-
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-
Julien Denize authored
Signed-off-by:
Julien Denize <julien.denize@mistral.ai> Signed-off-by:
Julien Denize <40604584+juliendenize@users.noreply.github.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
Ning Xie authored
Signed-off-by:Andy Xie <andy.xning@gmail.com>
-
Lucas Wilkinson authored
[BugFix] Make sure to allocate worst case MoE workspace during profile run in the DP + EP case (#27426) Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Mingyuan Ma authored
Signed-off-by:
mingyuanm <mingyuanm@nvidia.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
rasmith authored
[CI/Build][Kernel][AMD] Move extra dim to after load in _fwd_kv_parallel in lighting_attn.py (#29132) Signed-off-by:
Randall Smith <ransmith@amd.com> Co-authored-by:
Randall Smith <ransmith@amd.com>
-
Wentao Ye authored
Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
Aleksandr Malyshev authored
Signed-off-by:
Aleksandr Malyshev <maleksan@amd.com> Co-authored-by:
Aleksandr Malyshev <maleksan@amd.com>
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Huamin Li authored
Signed-off-by:
Huamin Li <3ericli@gmail.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Cyrus Leung authored
Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Hongxia Yang authored
Signed-off-by:Hongxia Yang <hongxia.yang@amd.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Wentao Ye authored
[Feature] Shared Experts Overlap with FI deepgemm swap kernel, 2.2% throughput improvement and 3.6% TTFT improvement (#28879) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 20 Nov, 2025 11 commits
-
-
Fanli Lin authored
Signed-off-by:Fanli Lin <fanli.lin@intel.com>
-
Zhewen Li authored
Signed-off-by:zhewenli <zhewenli@meta.com>
-
Shinichi Hemmi authored
Signed-off-by:Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com>
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
Anna Shors authored
Signed-off-by:
ashors1 <ashors@nvidia.com> Co-authored-by:
Chen Zhang <zhangch99@outlook.com>
-
Dezhan authored
Co-authored-by:Dezhan Tu <dztu@meta.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Pleaplusone authored
[ROCm][BugFix] Fix shared expert loading error when disable `VLLM_ROCM_USE_AITER_FUSION_SHARED_EXPERTS` (#28633) Signed-off-by:ganyi <ygan@amd.com>
-
Lukas Geiger authored
Signed-off-by:Lukas Geiger <lukas.geiger94@gmail.com>
-
Isotr0py authored
Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Shengliang Xu authored
Signed-off-by:Shengliang Xu <shengliangx@nvidia.com>
-