- 18 Dec, 2025 3 commits
-
-
Vasiliy Kuznetsov authored
Signed-off-by:vasiliy <vasiliy@fb.com>
-
Xin Yang authored
Signed-off-by:
Xin Yang <xyangx@amazon.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Bowen Bao authored
Signed-off-by:Bowen Bao <bowenbao@amd.com>
-
- 17 Dec, 2025 2 commits
-
-
Wentao Ye authored
[Bug] Fix AttributeError: 'ColumnParallelLinear' object has no attribute `weight_scale_inv` (#30823) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Yan Ma authored
Signed-off-by:Yan Ma <yan.ma@intel.com>
-
- 16 Dec, 2025 4 commits
-
-
Jinzhen Lin authored
Signed-off-by:
Jinzhen Lin <jinzhen.ljz@antgroup.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
jiahanc authored
Signed-off-by:jiahanc <173873397+jiahanc@users.noreply.github.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Ming Yang authored
Signed-off-by:Ming Yang <minos.future@gmail.com>
-
- 15 Dec, 2025 2 commits
-
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
Wentao Ye authored
Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
- 14 Dec, 2025 3 commits
-
-
tjp_zju authored
Signed-off-by:tjp_zju <tanjianpingzju1990@gmail.com>
-
Shengliang Xu authored
Signed-off-by:
Shengliang Xu <shengliangx@nvidia.com> Co-authored-by:
Pavani Majety <pmajety@nvidia.com>
-
Didier Durand authored
Signed-off-by:
Didier Durand <durand.didier@gmail.com> Signed-off-by:
Didier Durand <2927957+didier-durand@users.noreply.github.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
- 13 Dec, 2025 1 commit
-
-
Roberto L. Castro authored
Signed-off-by:
LopezCastroRoberto <robertol.c510@gmail.com> Signed-off-by:
Roberto L. Castro <38211239+LopezCastroRoberto@users.noreply.github.com> Co-authored-by:
youkaichao <youkaichao@gmail.com>
-
- 12 Dec, 2025 7 commits
-
-
rasmith authored
[CI/Build][Kernel][BugFix][AMD] Fix per_token_group_quant_fp8 to use correct fp8 min/max values and update atol/rtol in test_quantfp8_group_functionality (#30292) Signed-off-by:
Randall Smith <ransmith@amd.com> Co-authored-by:
Randall Smith <ransmith@amd.com>
-
danielafrimi authored
Signed-off-by:
Daniel Afrimi <dafrimi@pool0-00589.cm.cluster> Signed-off-by:
dafrimi <dafrimi@nvidia.com> Co-authored-by:
Daniel Afrimi <dafrimi@pool0-00589.cm.cluster> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
Xin Yang authored
Signed-off-by:Xin Yang <xyangx@amazon.com>
-
shivampr authored
Signed-off-by:
Shivam <shivampr.dev@gmail.com> Signed-off-by:
Shivam <shivamprasad91@gmail.com>
-
Christina Norman authored
Signed-off-by:
Christina <truffle@gmail.com> Signed-off-by:
Isotr0py <2037008807@qq.com> Signed-off-by:
Christina Norman <christina@example.com> Co-authored-by:
Isotr0py <isotr0py@users.noreply.github.com> Co-authored-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Bhanu Prakash Voutharoja authored
Signed-off-by:Bhanu068 <voutharoja.bhanu06@gmail.com>
-
jiahanc authored
Signed-off-by:jiahanc <173873397+jiahanc@users.noreply.github.com>
-
- 11 Dec, 2025 2 commits
-
-
Andrew Briand authored
Signed-off-by:
Andrew Briand <abriand@nvidia.com> Co-authored-by:
Andrew Briand <abriand@nvidia.com>
-
汪志鹏 authored
Signed-off-by:princepride <wangzhipeng628@gmail.com>
-
- 10 Dec, 2025 2 commits
-
-
Wilson Wu authored
Signed-off-by:
Wilson Wu <iwilsonwu@gmail.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
ElizaWszola authored
-
- 09 Dec, 2025 9 commits
-
-
Charlie Fu authored
[Rocm][torch.compile] Adding layernorm + fp8 block quant and silu + fp8 block quant for Aiter (#25693) Signed-off-by:
charlifu <charlifu@amd.com> Signed-off-by:
Micah Williamson <micah.williamson@amd.com> Signed-off-by:
Charlie Fu <Charlie.Fu@amd.com> Co-authored-by:
Micah Williamson <micah.williamson@amd.com> Co-authored-by:
wuhuikx <hattie.wu@amd.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Gregory Shtrasberg <156009573+gshtras@users.noreply.github.com>
-
Kyle Sayers authored
Signed-off-by:Kyle Sayers <kylesayrs@gmail.com>
-
bnellnm authored
Signed-off-by:
Bill Nell <bnell@redhat.com> Signed-off-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com>
-
vllmellm authored
Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
Dongjie Zou authored
Signed-off-by:baonudesifeizhai <baonudesifeizhai@gmail.com>
-
Tsukasa OI authored
[Model][Quantization] Restore MoE + GGUF models support (incl. Qwen3 MoE) by allowing Sideload Parameters (#30116) Signed-off-by:
Tsukasa OI <floss_llm@irq.a4lg.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Michael Goin authored
Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
czhu-cohere authored
Signed-off-by:czhu-cohere <conway.zhu@cohere.com>
-
Zhewen Li authored
Signed-off-by:
zhewenli <zhewenli@meta.com> Signed-off-by:
Zhewen Li <zhewenli@meta.com>
-
- 08 Dec, 2025 2 commits
-
-
Vasiliy Kuznetsov authored
Signed-off-by:vasiliy <vasiliy@fb.com>
-
Zhiwei authored
Signed-off-by:ZhiweiYan-96 <zhiwei.yan@amd.com>
-
- 07 Dec, 2025 2 commits
-
-
ElizaWszola authored
Signed-off-by:
ElizaWszola <ewszola@redhat.com> Signed-off-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
yewentao256 <zhyanwentao@126.com>
-
Wentao Ye authored
[Perf] Deepgemm fused layout kernel for activations, 4.3% throughput improvement, 10.7% TTFT improvement. (#29546) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 06 Dec, 2025 1 commit
-
-
yuttian1 authored
Signed-off-by:yuttian1 <yuttian@amd.com>
-