- 04 Feb, 2026 12 commits
-
-
Sage Moore authored
Change the type signature of MixtureOfExperts.expert_weights to MutableSequence[Sequence[Tensor]] (#33573) Signed-off-by:
Sage Moore <sagmoore@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
Muhammad Hashmi authored
Signed-off-by:
Muhammad Hashmi <mhashmi@berkeley.edu> Signed-off-by:
NickLucche <nlucches@redhat.com> Co-authored-by:
NickLucche <nlucches@redhat.com>
-
Simon Danielsson authored
Signed-off-by:simondanielsson <simon.danielsson99@hotmail.com>
-
Taeksang Kim authored
Signed-off-by:Taeksang Kim <ts.kim@hyperaccel.ai>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Yueqian Lin authored
Signed-off-by:
linyueqian <linyueqian@outlook.com> Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
Vadim Gimpelson authored
Signed-off-by:Vadim Gimpelson <vadim.gimpelson@gmail.com>
-
Kunshang Ji authored
Signed-off-by:Kunshang Ji <kunshang.ji@intel.com>
-
Frank Wang authored
Signed-off-by:frankwang28 <frank.wbb@hotmail.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Shanshan Shen authored
Signed-off-by:shen-shanshan <467638484@qq.com>
-
- 03 Feb, 2026 11 commits
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Michael Goin authored
[Bugfix] Disable TRTLLM FP8 MoE if router_logits_dtype==float32 and routing_method!=DeepSeekV3 (#33613) Signed-off-by:mgoin <mgoin64@gmail.com>
-
Vadim Gimpelson authored
Signed-off-by:Vadim Gimpelson <vadim.gimpelson@gmail.com>
-
Patrick von Platen authored
Signed-off-by:
Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Shanshan Shen authored
Signed-off-by:shen-shanshan <467638484@qq.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
zxy authored
Signed-off-by:
zxy <zhou0493@e.ntu.edu.sg> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Song Zhixin authored
Signed-off-by:
jesse <szxfml@gmail.com> Signed-off-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
Michael Goin authored
[Bugfix] Disable RoutingMethodType.[Renormalize,RenormalizeNaive] TRTLLM per-tensor FP8 MoE (#33620) Signed-off-by:mgoin <mgoin64@gmail.com>
-
Kunshang Ji authored
Signed-off-by:Kunshang Ji <kunshang.ji@intel.com>
-
Shengliang Xu authored
Signed-off-by:
Shengliang Xu <shengliangx@nvidia.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
- 02 Feb, 2026 10 commits
-
-
Patrick von Platen authored
Signed-off-by:
Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Vasiliy Kuznetsov authored
Signed-off-by:vasiliy <vasiliy@fb.com>
-
Yang Liu authored
Signed-off-by:Yang <lymailforjob@gmail.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
danielafrimi authored
Signed-off-by:dafrimi <dafrimi@nvidia.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Borushiki authored
Signed-off-by:Borushiki <38628261+Otsutsukii@users.noreply.github.com>
-
Grzegorz K. Karch authored
Signed-off-by:Grzegorz Karch <gkarch@nvidia.com>
-
RED authored
Signed-off-by:
liuli <ll407707@alibaba-inc.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
liuli <ll407707@alibaba-inc.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
csy0225 authored
Signed-off-by:
Jee Jee Li <pandaleefree@gmail.com> Co-authored-by:
i-zhangmingming <i-zhangmingming@stepfun.com> Co-authored-by:
xiewuxun <xiewuxun@stepfun.com> Co-authored-by:
zetaohong <i-hongzetao@stepfun.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
-
- 01 Feb, 2026 5 commits
-
-
will b. authored
Signed-off-by:
Eduardo Salinas <edus@microsoft.com> Signed-off-by:
catswe <212922539+catswe@users.noreply.github.com> Co-authored-by:
Eduardo Salinas <edus@microsoft.com>
-
shaharmor98 authored
-
JartX authored
[BUGFIX] Fix hipErrorIllegalState in Qwen3-Omni during startup profiling allow inference Omni on ROCM (#33077) Signed-off-by:JartX <sagformas@epdcenter.es>
-
Maral authored
Signed-off-by:
maral <maralbahari.98@gmail.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Eduardo Salinas authored
Signed-off-by:Eduardo Salinas <edus@microsoft.com>
-
- 31 Jan, 2026 2 commits
-
-
René Honig authored
Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
Roy Wang authored
Signed-off-by:esmeetu <jasonailu87@gmail.com>
-