- 04 Feb, 2026 3 commits
-
-
Frank Wang authored
Signed-off-by:frankwang28 <frank.wbb@hotmail.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Shanshan Shen authored
Signed-off-by:shen-shanshan <467638484@qq.com>
-
- 03 Feb, 2026 11 commits
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Michael Goin authored
[Bugfix] Disable TRTLLM FP8 MoE if router_logits_dtype==float32 and routing_method!=DeepSeekV3 (#33613) Signed-off-by:mgoin <mgoin64@gmail.com>
-
Vadim Gimpelson authored
Signed-off-by:Vadim Gimpelson <vadim.gimpelson@gmail.com>
-
Patrick von Platen authored
Signed-off-by:
Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Shanshan Shen authored
Signed-off-by:shen-shanshan <467638484@qq.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
zxy authored
Signed-off-by:
zxy <zhou0493@e.ntu.edu.sg> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Song Zhixin authored
Signed-off-by:
jesse <szxfml@gmail.com> Signed-off-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
Michael Goin authored
[Bugfix] Disable RoutingMethodType.[Renormalize,RenormalizeNaive] TRTLLM per-tensor FP8 MoE (#33620) Signed-off-by:mgoin <mgoin64@gmail.com>
-
Kunshang Ji authored
Signed-off-by:Kunshang Ji <kunshang.ji@intel.com>
-
Shengliang Xu authored
Signed-off-by:
Shengliang Xu <shengliangx@nvidia.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
- 02 Feb, 2026 10 commits
-
-
Patrick von Platen authored
Signed-off-by:
Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Vasiliy Kuznetsov authored
Signed-off-by:vasiliy <vasiliy@fb.com>
-
Yang Liu authored
Signed-off-by:Yang <lymailforjob@gmail.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
danielafrimi authored
Signed-off-by:dafrimi <dafrimi@nvidia.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Borushiki authored
Signed-off-by:Borushiki <38628261+Otsutsukii@users.noreply.github.com>
-
Grzegorz K. Karch authored
Signed-off-by:Grzegorz Karch <gkarch@nvidia.com>
-
RED authored
Signed-off-by:
liuli <ll407707@alibaba-inc.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
liuli <ll407707@alibaba-inc.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
csy0225 authored
Signed-off-by:
Jee Jee Li <pandaleefree@gmail.com> Co-authored-by:
i-zhangmingming <i-zhangmingming@stepfun.com> Co-authored-by:
xiewuxun <xiewuxun@stepfun.com> Co-authored-by:
zetaohong <i-hongzetao@stepfun.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
-
- 01 Feb, 2026 5 commits
-
-
will b. authored
Signed-off-by:
Eduardo Salinas <edus@microsoft.com> Signed-off-by:
catswe <212922539+catswe@users.noreply.github.com> Co-authored-by:
Eduardo Salinas <edus@microsoft.com>
-
shaharmor98 authored
-
JartX authored
[BUGFIX] Fix hipErrorIllegalState in Qwen3-Omni during startup profiling allow inference Omni on ROCM (#33077) Signed-off-by:JartX <sagformas@epdcenter.es>
-
Maral authored
Signed-off-by:
maral <maralbahari.98@gmail.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Eduardo Salinas authored
Signed-off-by:Eduardo Salinas <edus@microsoft.com>
-
- 31 Jan, 2026 11 commits
-
-
René Honig authored
Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
Roy Wang authored
Signed-off-by:esmeetu <jasonailu87@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
ℍ𝕠𝕝𝕝𝕠𝕨 𝕄𝕒𝕟 authored
Signed-off-by:Hollow Man <hollowman@opensuse.org>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Jinwu authored
Co-authored-by:
jinwuguo <jinwuguo@tencent.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
Fadi Arafeh authored
[CPU][Feat] Enable KleidiAI accelerated int4 dynamic quant with BF16 activations on Arm CPUs (#33122) Signed-off-by:Fadi Arafeh <fadi.arafeh@arm.com>
-
AutumnAurelium authored
Signed-off-by:AutumnAurelium <88015631+AutumnAurelium@users.noreply.github.com>
-
Dimitrios Bariamis authored
Signed-off-by:
Dimitrios Bariamis <12195802+dbari@users.noreply.github.com> Co-authored-by:
Dimitrios Bariamis <12195802+dbari@users.noreply.github.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Matthias Gehre authored
Signed-off-by:
Matthias Gehre <matthias.gehre@amd.com> Co-authored-by:
Cursor <cursoragent@cursor.com>
-