- 24 Feb, 2026 1 commit
-
-
laibao authored
新增 router_capture 工具,用于按 num_tokens/rank 过滤并落盘 MoE router logits 在 Qwen3MoeSparseMoeBlock 中接入采集调用,并在 torch.compile 场景下自动跳过 补充 VLLM_MOE_ROUTER_CAPTURE* 环境变量
-
- 16 Feb, 2026 2 commits
- 13 Feb, 2026 1 commit
-
-
王敏 authored
-
- 11 Feb, 2026 1 commit
-
-
laibao authored
参考并移植 011/vllm 的关键提交逻辑 新增 VLLM_USE_MOE_W16A16_TRITON 开关,并接入基于 lightop 的运行时能力探测与启用结果缓存。 在权重加载后对 w13 与 w2 执行 W16A16 Marlin 预打包。 W16A16 Marlin 启用时保留 monolithic 执行路径,并在 fused_experts_impl 中增加 packed 权重 fast-path。 保持 Marlin 或 lightop 不可用时的回退行为不变。
-
- 10 Feb, 2026 1 commit
-
-
zhuwenwen authored
-
- 06 Feb, 2026 4 commits
- 04 Feb, 2026 3 commits
-
-
zhuwenwen authored
-
zhuwenwen authored
-
Michael Goin authored
Signed-off-by:Robert Shaw <rshaw@neuralmagic.com>
-
- 03 Feb, 2026 3 commits
-
-
Kiersten Stokes authored
Signed-off-by:
kiersten-stokes <kierstenstokes@gmail.com> (cherry picked from commit 9e138cb0)
-
zhuwenwen authored
-
Zhewen Li authored
Signed-off-by:
zhewenli <zhewen@inferact.ai> Co-authored-by:
zhewenli <zhewen@inferact.ai>
-
- 02 Feb, 2026 3 commits
-
-
Zhewen Li authored
Signed-off-by:
zhewenli <zhewen@inferact.ai> Co-authored-by:
zhewenli <zhewen@inferact.ai>
-
csy0225 authored
Signed-off-by:
Jee Jee Li <pandaleefree@gmail.com> Co-authored-by:
i-zhangmingming <i-zhangmingming@stepfun.com> Co-authored-by:
xiewuxun <xiewuxun@stepfun.com> Co-authored-by:
zetaohong <i-hongzetao@stepfun.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com> (cherry picked from commit c3b40dc3)
-
wang.yuqi authored
Signed-off-by:
wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by:
wang.yuqi <noooop@126.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> (cherry picked from commit abb34ac4)
-
- 28 Jan, 2026 1 commit
-
-
Roger Wang authored
Signed-off-by:
wanglinian <wanglinian@stu.pku.edu.cn> Signed-off-by:
wangln19 <96399074+wangln19@users.noreply.github.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by:
youkaichao <youkaichao@gmail.com> Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
wanglinian <wanglinian@stu.pku.edu.cn> Co-authored-by:
wangln19 <96399074+wangln19@users.noreply.github.com> Co-authored-by:
Zaida Zhou <58739961+zhouzaida@users.noreply.github.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Nick Hill <nickhill123@gmail.com> Co-authored-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> (cherry picked from commit b539f988)
-
- 27 Jan, 2026 1 commit
-
-
Roger Wang authored
Signed-off-by:
wanglinian <wanglinian@stu.pku.edu.cn> Signed-off-by:
wangln19 <96399074+wangln19@users.noreply.github.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by:
youkaichao <youkaichao@gmail.com> Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
wanglinian <wanglinian@stu.pku.edu.cn> Co-authored-by:
wangln19 <96399074+wangln19@users.noreply.github.com> Co-authored-by:
Zaida Zhou <58739961+zhouzaida@users.noreply.github.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Nick Hill <nickhill123@gmail.com> Co-authored-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
- 26 Jan, 2026 12 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Andy Lo authored
Signed-off-by:Andy Lo <andy@mistral.ai>
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
Yuxuan Zhang authored
Signed-off-by:
zRzRzRzRzRzRzR <2448370773@qq.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
VihaanThat authored
-
Itay Etelis authored
Signed-off-by:
Itay Etelis <itay.etelis@ibm.com> Co-authored-by:
Itay Etelis <itay.etelis@ibm.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
ltd0924 authored
Signed-off-by:
luotingdan <luotingdan@stepfun.com> Signed-off-by:
ltd0924 <32387785+ltd0924@users.noreply.github.com> Co-authored-by:
luotingdan <luotingdan@stepfun.com>
-
Lucas Wilkinson authored
[CI] Fix MHA attention test failure (AttributeError when model_config is None in ViT attention backend) (#33033) Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
- 25 Jan, 2026 2 commits
-
-
Itay Etelis authored
Signed-off-by:
Itay Etelis <itay.etelis@ibm.com> Co-authored-by:
Itay Etelis <itay.etelis@ibm.com>
-
JJJYmmm authored
Signed-off-by:
JJJYmmm <92386084+JJJYmmm@users.noreply.github.com> Signed-off-by:
Roger Wang <hey@rogerw.io> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
- 24 Jan, 2026 3 commits
-
-
david guan authored
Co-authored-by:Claude Sonnet 4.5 <noreply@anthropic.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
- 23 Jan, 2026 2 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Harry Huang authored
Signed-off-by:
huanghaoyan.hhy <huanghaoyan.hhy@alibaba-inc.com> Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Chen Zhang <zhangch99@outlook.com>
-