- 12 Jan, 2026 2 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 11 Jan, 2026 3 commits
-
-
maang authored
Signed-off-by:
maang <maang_h@163.com> Signed-off-by:
maang-h <55082429+maang-h@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Matt authored
Signed-off-by:Matthew Wong <Matthew.Wong2@amd.com>
-
Andy Liu authored
[MTP][GLM][Bugfix] Fixed .weight_scale loading logic that dropped MTP prediction accuracy with fp8+mtp (#32101) Signed-off-by:Andy Liu <andyliu@roblox.com>
-
- 10 Jan, 2026 10 commits
-
-
RickyChen / 陳昭儒 authored
Signed-off-by:
rickychen-infinirc <ricky.chen@infinirc.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Vensen authored
Signed-off-by:vensen <vensenmu@gmail.com>
-
gnovack authored
Signed-off-by:gnovack <gnovack@amazon.com>
-
Michael Goin authored
Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com>
-
Jeremy Teboul authored
Signed-off-by:Jeremy Teboul <jeremyte@meta.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
maang authored
Signed-off-by:maang <maang_h@163.com>
-
Akshat Shrivastava authored
Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
Lucas Kabela authored
Signed-off-by:Lucas Kabela <lucaskabela@meta.com>
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
- 09 Jan, 2026 16 commits
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
jiahanc authored
Signed-off-by:jiahanc <173873397+jiahanc@users.noreply.github.com>
-
Runkai Tao authored
Signed-off-by:
Runkai Tao <rt572@physics.rutgers.edu> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
-
Jeremy Teboul authored
Signed-off-by:Jeremy Teboul <jeremyte@meta.com>
-
Wentao Ye authored
[Perf] Optimize cutlass moe problem size calculation, 5.3% E2E Throughput improvement, 2.2% TTFT improvement (#31830) Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Shanshan Shen authored
Signed-off-by:shen-shanshan <467638484@qq.com>
-
Michael Goin authored
Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com>
-
maang authored
Signed-off-by:maang <maang_h@163.com>
-
Xin Yang authored
Signed-off-by:Xin Yang <xyangx@amazon.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Alex Brooks authored
Signed-off-by:Alex-Brooks <Alex.Brooks@ibm.com>
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Signed-off-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
Yongye Zhu authored
[MoE Refactoring][Bugfix]Wrap WNA16 Triton kernel into mk and change compressed tensor kernel selection (#31752) Signed-off-by:
Robert Shaw <robshaw@redhat.com> Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
- 08 Jan, 2026 9 commits
-
-
Lucas Wilkinson authored
[Misc] Fix `Current vLLM config is not set.` warnings, assert to avoid issues in the future (#31747) Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Dipika Sikka authored
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
Michael Goin authored
-
danisereb authored
Signed-off-by:Daniel Serebrenik <daserebrenik@nvidia.com>
-
yxing-bj authored
Signed-off-by:yxing <yxing@iquestlab.com>
-
Ce Zhao authored
Signed-off-by: <> Signed-off-by:
赵策 <alcor@zhaocedeMacBook-Air.local> Signed-off-by:
赵策 <alcor@mac.mynetworksettings.com> Co-authored-by:
赵策 <alcor@mac.mynetworksettings.com>
-
tianshu-Michael-yu authored
Signed-off-by:
Tianshu Yu <tianshuyu.formal@gmail.com> Signed-off-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
Bijaya Dangol authored
Signed-off-by:dangoldbj <dangoldbj23@gmail.com>
-