- 21 Nov, 2025 25 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
wangxiyuan authored
Signed-off-by:wangxiyuan <wangxiyuan1007@gmail.com>
-
Julien Denize authored
Signed-off-by:
Julien Denize <julien.denize@mistral.ai> Signed-off-by:
Julien Denize <40604584+juliendenize@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
sfbemerk authored
Signed-off-by:
Benjamin Merkel <benjamin.merkel@tngtech.com> Co-authored-by:
Benjamin Merkel <benjamin.merkel@tngtech.com> Co-authored-by:
Chauncey <chaunceyjiang@gmail.com>
-
who who who authored
Signed-off-by:fsx950223 <fsx950223@outlook.com>
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Huamin Li authored
Signed-off-by:
Huamin Li <3ericli@gmail.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Canlin Guo authored
Signed-off-by:gcanlin <canlinguosdu@gmail.com>
-
Chenheli Hua authored
Signed-off-by:Chenheli Hua <huachenheli@outlook.com>
-
Alex Brooks authored
Signed-off-by:Alex-Brooks <Alex.Brooks@ibm.com>
-
Jialin Ouyang authored
Signed-off-by:Jialin Ouyang <Jialin.Ouyang@gmail.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Boyuan Feng authored
Signed-off-by:Boyuan Feng <boyuan@meta.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
jeremyteboul authored
Signed-off-by:
Jeremy Teboul <jeremyteboul@fb.com> Co-authored-by:
Jeremy Teboul <jeremyteboul@fb.com>
-
zhrrr authored
Signed-off-by:
zhuhaoran <zhuhaoran.zhr@alibaba-inc.com> Signed-off-by:
izhuhaoran <izhuhaoran@qq.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
Cyrus Leung authored
Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Hongxia Yang authored
Signed-off-by:Hongxia Yang <hongxia.yang@amd.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Xiao Li authored
[AITER] [ROCm] Fix crash when loading llama4 model with old aiter version installed, fallback to forward_native implementation (#29124) Signed-off-by:Xiao Li <ilx@meta.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Wentao Ye authored
[Feature] Shared Experts Overlap with FI deepgemm swap kernel, 2.2% throughput improvement and 3.6% TTFT improvement (#28879) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 20 Nov, 2025 15 commits
-
-
Driss Guessous authored
Signed-off-by:drisspg <drisspguessous@gmail.com>
-
Software Developer authored
Signed-off-by:dsuhinin <suhinin.dmitriy@gmail.com>
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
rookie authored
Signed-off-by:
zhangguozhu <zhangguozhu@360.cn> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
zhangguozhu <zhangguozhu@360.cn> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
Fanli Lin authored
Signed-off-by:Fanli Lin <fanli.lin@intel.com>
-
Zhewen Li authored
Signed-off-by:zhewenli <zhewenli@meta.com>
-
Samit authored
Signed-off-by:
SamitHuang <285365963@qq.com> Signed-off-by:
Samit <285365963@qq.com> Signed-off-by:
samithuang <285365963@qq.com> Co-authored-by:
22quinn <33176974+22quinn@users.noreply.github.com>
-
Shinichi Hemmi authored
Signed-off-by:Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com>
-
Or Ozeri authored
Signed-off-by:
Or Ozeri <oro@il.ibm.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
Anna Shors authored
Signed-off-by:
ashors1 <ashors@nvidia.com> Co-authored-by:
Chen Zhang <zhangch99@outlook.com>
-
Dezhan authored
Co-authored-by:Dezhan Tu <dztu@meta.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Pleaplusone authored
[ROCm][BugFix] Fix shared expert loading error when disable `VLLM_ROCM_USE_AITER_FUSION_SHARED_EXPERTS` (#28633) Signed-off-by:ganyi <ygan@amd.com>
-
Cyrus Leung authored
Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-