- 09 Jan, 2026 37 commits
-
-
Jeremy Teboul authored
Signed-off-by:Jeremy Teboul <jeremyte@meta.com>
-
Wentao Ye authored
[Perf] Optimize cutlass moe problem size calculation, 5.3% E2E Throughput improvement, 2.2% TTFT improvement (#31830) Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Andrew Xia authored
Signed-off-by:
lacora <hyelacora@gmail.com> Signed-off-by:
Andrew Xia <axia@fb.com> Co-authored-by:
lacora <hyelacora@gmail.com> Co-authored-by:
Andrew Xia <axia@fb.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Yifan Qiao authored
Signed-off-by:Yifan Qiao <yifanqiao@berkeley.edu>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Shanshan Shen authored
Signed-off-by:shen-shanshan <467638484@qq.com>
-
Kevin Šuc authored
Signed-off-by:Catacomba <kevinsuc16@gmail.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Michael Goin authored
Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com>
-
R3hankhan authored
Signed-off-by:Rehan Khan <Rehan.Khan7@ibm.com>
-
maang authored
Signed-off-by:maang <maang_h@163.com>
-
Adolfo Victoria authored
Signed-off-by:
Adolfo Victoria <adolfokarim@gmail.com> Co-authored-by:
Adolfo Victoria <adovi@meta.com>
-
inkcherry authored
Signed-off-by:inkcherry <mingzhi.liu@amd.com>
-
Roger Wang authored
Signed-off-by:Roger Wang <hey@rogerw.io>
-
Andreas Karatzas authored
[ROCm][CI][V1] Fix `nixl_connector` test failure and achieve CUDA parity in `test_async_scheduling` (#32000) Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
Sophie du Couédic authored
Signed-off-by:Sophie du Couédic <sop@zurich.ibm.com>
-
Bofeng Xue authored
Signed-off-by:
Bofeng BF1 Xue <xuebf1@Lenovo.com> Co-authored-by:
Bofeng BF1 Xue <xuebf1@Lenovo.com>
-
Xin Yang authored
Signed-off-by:Xin Yang <xyangx@amazon.com>
-
vllmellm authored
[Bugfix][ROCm]Fix Qwen3-Next-80B-A3B-Thinking inference and optimize non-standard block size (544) support under rocm_atten (#31380) Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
Alex Brooks authored
Signed-off-by:Alex-Brooks <Alex.Brooks@ibm.com>
-
gnovack authored
Signed-off-by:
gnovack <gnovack@amazon.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
-
Xin Yang authored
Signed-off-by:Xin Yang <xyangx@amazon.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
TJian authored
Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
Divakar Verma authored
Signed-off-by:Divakar Verma <divakar.verma@amd.com>
-
RioS authored
Signed-off-by:
RioS <aa248424@gmail.com> Signed-off-by:
Ri0S <aa248424@gmail.com> Co-authored-by:
Chauncey <chaunceyjiang@gmail.com>
-
Xin Yang authored
Signed-off-by:Xin Yang <xyangx@amazon.com>
-
zhrrr authored
Signed-off-by:
zhuhaoran <zhuhaoran.zhr@alibaba-inc.com> Signed-off-by:
izhuhaoran <izhuhaoran@qq.com> Signed-off-by:
Nick Hill <nickhill123@gmail.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com>
-
daniel-salib authored
Signed-off-by:Daniel Salib <danielsalib@meta.com>
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
Max Hu authored
Signed-off-by:
Max Hu <maxhu@nvidia.com> Signed-off-by:
Max Hu <hyoung2991@gmail.com> Co-authored-by:
Max Hu <maxhu@nvidia.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Signed-off-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
Yongye Zhu authored
[MoE Refactoring][Bugfix]Wrap WNA16 Triton kernel into mk and change compressed tensor kernel selection (#31752) Signed-off-by:
Robert Shaw <robshaw@redhat.com> Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
- 08 Jan, 2026 3 commits
-
-
Lucas Wilkinson authored
[Misc] Fix `Current vLLM config is not set.` warnings, assert to avoid issues in the future (#31747) Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Dipika Sikka authored
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-