- 04 Feb, 2026 8 commits
-
-
Sage Moore authored
Change the type signature of MixtureOfExperts.expert_weights to MutableSequence[Sequence[Tensor]] (#33573) Signed-off-by:
Sage Moore <sagmoore@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
Muhammad Hashmi authored
Signed-off-by:
Muhammad Hashmi <mhashmi@berkeley.edu> Signed-off-by:
NickLucche <nlucches@redhat.com> Co-authored-by:
NickLucche <nlucches@redhat.com>
-
Taeksang Kim authored
Signed-off-by:Taeksang Kim <ts.kim@hyperaccel.ai>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Yueqian Lin authored
Signed-off-by:
linyueqian <linyueqian@outlook.com> Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Shanshan Shen authored
Signed-off-by:shen-shanshan <467638484@qq.com>
-
- 03 Feb, 2026 7 commits
-
-
Michael Goin authored
[Bugfix] Disable TRTLLM FP8 MoE if router_logits_dtype==float32 and routing_method!=DeepSeekV3 (#33613) Signed-off-by:mgoin <mgoin64@gmail.com>
-
Patrick von Platen authored
Signed-off-by:
Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Shanshan Shen authored
Signed-off-by:shen-shanshan <467638484@qq.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
zxy authored
Signed-off-by:
zxy <zhou0493@e.ntu.edu.sg> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Song Zhixin authored
Signed-off-by:
jesse <szxfml@gmail.com> Signed-off-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
Shengliang Xu authored
Signed-off-by:
Shengliang Xu <shengliangx@nvidia.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
- 02 Feb, 2026 9 commits
-
-
Patrick von Platen authored
Signed-off-by:
Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Yang Liu authored
Signed-off-by:Yang <lymailforjob@gmail.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
danielafrimi authored
Signed-off-by:dafrimi <dafrimi@nvidia.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Borushiki authored
Signed-off-by:Borushiki <38628261+Otsutsukii@users.noreply.github.com>
-
Grzegorz K. Karch authored
Signed-off-by:Grzegorz Karch <gkarch@nvidia.com>
-
RED authored
Signed-off-by:
liuli <ll407707@alibaba-inc.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
liuli <ll407707@alibaba-inc.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
csy0225 authored
Signed-off-by:
Jee Jee Li <pandaleefree@gmail.com> Co-authored-by:
i-zhangmingming <i-zhangmingming@stepfun.com> Co-authored-by:
xiewuxun <xiewuxun@stepfun.com> Co-authored-by:
zetaohong <i-hongzetao@stepfun.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
-
- 01 Feb, 2026 2 commits
-
-
JartX authored
[BUGFIX] Fix hipErrorIllegalState in Qwen3-Omni during startup profiling allow inference Omni on ROCM (#33077) Signed-off-by:JartX <sagformas@epdcenter.es>
-
Eduardo Salinas authored
Signed-off-by:Eduardo Salinas <edus@microsoft.com>
-
- 31 Jan, 2026 8 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
AutumnAurelium authored
Signed-off-by:AutumnAurelium <88015631+AutumnAurelium@users.noreply.github.com>
-
Dimitrios Bariamis authored
Signed-off-by:
Dimitrios Bariamis <12195802+dbari@users.noreply.github.com> Co-authored-by:
Dimitrios Bariamis <12195802+dbari@users.noreply.github.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Patrick von Platen authored
Signed-off-by:Patrick von Platen <patrick.v.platen@gmail.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 30 Jan, 2026 6 commits
-
-
Kyle Sayers authored
Signed-off-by:Kyle Sayers <kylesayrs@gmail.com>
-
Julien Denize authored
Signed-off-by:juliendenize <julien.denize@mistral.ai>
-
Patrick von Platen authored
Signed-off-by:
Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com>
-
tianshu-Michael-yu authored
Signed-off-by:Tianshu Yu <tianshuyu.formal@gmail.com>
-
hujiaxin0 authored
Signed-off-by:
hujiaxin <524446785@qq.com> Signed-off-by:
Emilie1001 <79921183+Emilie1001@users.noreply.github.com> Co-authored-by:
Emilie1001 <79921183+Emilie1001@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-