- 11 Dec, 2025 4 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
gh-wf authored
Signed-off-by:Wayne Ferguson <wayneferguson@gmail.com>
-
Divakar Verma authored
Signed-off-by:
Divakar Verma <divakar.verma@amd.com> Co-authored-by:
Gregory Shtrasberg <156009573+gshtras@users.noreply.github.com> Co-authored-by:
TJian <tunjian.tan@embeddedllm.com>
-
- 10 Dec, 2025 6 commits
-
-
Anker authored
Signed-off-by:
Lennart Brog <lennart.borg@list-ag.de> Signed-off-by:
Anker <20343812+anker-c2@users.noreply.github.com>
-
Lucas Wilkinson authored
[BugFix] Fix `AttributeError: 'MergedColumnParallelLinear' object has no attribute 'weight_scale'` (#30399) Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Roger Young authored
Signed-off-by:
xuebi <xuebi@minimaxi.com> Co-authored-by:
xuebi <xuebi@minimaxi.com>
-
Wilson Wu authored
Signed-off-by:
Wilson Wu <iwilsonwu@gmail.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
haoyangli-amd authored
Signed-off-by:Haoyang Li <lihaoyang0109@gmail.com>
-
ElizaWszola authored
-
- 09 Dec, 2025 15 commits
-
-
Charlie Fu authored
[Rocm][torch.compile] Adding layernorm + fp8 block quant and silu + fp8 block quant for Aiter (#25693) Signed-off-by:
charlifu <charlifu@amd.com> Signed-off-by:
Micah Williamson <micah.williamson@amd.com> Signed-off-by:
Charlie Fu <Charlie.Fu@amd.com> Co-authored-by:
Micah Williamson <micah.williamson@amd.com> Co-authored-by:
wuhuikx <hattie.wu@amd.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Gregory Shtrasberg <156009573+gshtras@users.noreply.github.com>
-
Kyle Sayers authored
Signed-off-by:Kyle Sayers <kylesayrs@gmail.com>
-
bnellnm authored
Signed-off-by:
Bill Nell <bnell@redhat.com> Signed-off-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com>
-
Tsukasa OI authored
Signed-off-by:Tsukasa OI <floss_llm@irq.a4lg.com>
-
liuquan authored
Signed-off-by:
quanliu <18646313696@163.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
vllmellm authored
Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
Dongjie Zou authored
Signed-off-by:baonudesifeizhai <baonudesifeizhai@gmail.com>
-
wang.yuqi authored
Signed-off-by:
wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by:
wang.yuqi <noooop@126.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Tsukasa OI authored
[Model][Quantization] Restore MoE + GGUF models support (incl. Qwen3 MoE) by allowing Sideload Parameters (#30116) Signed-off-by:
Tsukasa OI <floss_llm@irq.a4lg.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
liangel-02 authored
Signed-off-by:Angel Li <liangel@meta.com>
-
Michael Goin authored
Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
czhu-cohere authored
Signed-off-by:czhu-cohere <conway.zhu@cohere.com>
-
Zhewen Li authored
Signed-off-by:
zhewenli <zhewenli@meta.com> Signed-off-by:
Zhewen Li <zhewenli@meta.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Ming Yang authored
Signed-off-by:Ming Yang <minos.future@gmail.com>
-
- 08 Dec, 2025 7 commits
-
-
roikoren755 authored
Signed-off-by:Roi Koren <roik@nvidia.com>
-
Vasiliy Kuznetsov authored
Signed-off-by:vasiliy <vasiliy@fb.com>
-
shaharmor98 authored
Signed-off-by:
Shahar Mor <smor@nvidia.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
Daniel Cámpora authored
Signed-off-by:
Daniel Campora <961215+dcampora@users.noreply.github.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
wang.yuqi authored
[Model][7/N] Improve all pooling task | Deprecation as_reward_model. Extract hidden states prefer using new multi-vector retrieval API (#26686) Signed-off-by:wang.yuqi <yuqi.wang@daocloud.io>
-
Dazhi Jiang authored
Signed-off-by:Dazhi Jiang <dazhi_jiang@163.com>
-
Zhiwei authored
Signed-off-by:ZhiweiYan-96 <zhiwei.yan@amd.com>
-
- 07 Dec, 2025 5 commits
-
-
ElizaWszola authored
Signed-off-by:
ElizaWszola <ewszola@redhat.com> Signed-off-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
yewentao256 <zhyanwentao@126.com>
-
Wentao Ye authored
[Perf] Deepgemm fused layout kernel for activations, 4.3% throughput improvement, 10.7% TTFT improvement. (#29546) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Jinzhen Lin authored
Signed-off-by:
Jinzhen Lin <jinzhen.ljz@antgroup.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
-
Cyrus Leung authored
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 06 Dec, 2025 3 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Peter Salas authored
Signed-off-by:Peter Salas <peter@fixie.ai>
-