- 14 Jan, 2026 1 commit
-
-
wanglong3 authored
-
- 07 Jan, 2026 2 commits
- 06 Jan, 2026 1 commit
-
-
zhuwenwen authored
-
- 05 Jan, 2026 4 commits
- 04 Jan, 2026 1 commit
-
-
laibao authored
实现了用于优化张量计算的 rms_mrope_fuse 和 rms_mrope_fuse_fake 方法 更新了 forward:在满足条件时走新的 M-RoPE 融合路径 增强了 Qwen3MoeModel 对动态参数维度的支持,以适配该功能
-
- 22 Dec, 2025 1 commit
-
-
zhuwenwen authored
-
- 20 Dec, 2025 1 commit
-
-
zhuwenwen authored
-
- 16 Dec, 2025 1 commit
-
-
laibao authored
新增环境变量 VLLM_USE_MARLIN_W16A16_MOE,用于显式启用 Marlin W16A16 MoE experts 在 fused_moe 中当开关开启且实现可用时,调用 fused_experts_impl_w16a16_marlin 增加 Marlin W16A16 MoE 实现与 reduce 路径
-
- 12 Dec, 2025 2 commits
- 30 Nov, 2025 1 commit
-
-
王敏 authored
-
- 20 Nov, 2025 1 commit
-
-
zhuwenwen authored
add VLLM_USE_PP_SYNC to use pp sync update qwen3 of rmsnorm
-
- 17 Nov, 2025 1 commit
-
-
chenych authored
-
- 13 Nov, 2025 3 commits
- 31 Oct, 2025 1 commit
-
-
zhuwenwen authored
-
- 29 Oct, 2025 1 commit
-
- 28 Oct, 2025 1 commit
-
-
zhuwenwen authored
-
- 25 Oct, 2025 1 commit
-
-
zhuwenwen authored
-
- 21 Oct, 2025 1 commit
-
-
zhuwenwen authored
-
- 11 Oct, 2025 1 commit
-
-
zhuwenwen authored
-
- 02 Oct, 2025 1 commit
-
-
Chen Zhang authored
Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Signed-off-by:
simon-mo <simon.mo@hey.com>
-
- 01 Oct, 2025 4 commits
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
simon-mo <simon.mo@hey.com>
-
Roger Wang authored
Signed-off-by:simon-mo <simon.mo@hey.com>
-
Yongye Zhu authored
Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Signed-off-by:
youkaichao <youkaichao@gmail.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
NickLucche <nlucches@redhat.com> Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Signed-off-by:
Barry Kang <43644113+Barry-Delaney@users.noreply.github.com> Signed-off-by:
Lucia Fang <fanglu@meta.com> Co-authored-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Lucia Fang <116399278+luccafong@users.noreply.github.com> Co-authored-by:
Lucia Fang <fanglu@meta.com> Co-authored-by:
NickLucche <nlucches@redhat.com> Co-authored-by:
Siyuan Fu <siyuanf@nvidia.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Xiaozhu Meng <mxz297@gmail.com> Co-authored-by:
Barry Kang <43644113+Barry-Delaney@users.noreply.github.com> Signed-off-by:
simon-mo <simon.mo@hey.com>
-
Roger Wang authored
Signed-off-by:
Roger Wang <hey@rogerw.io> Signed-off-by:
simon-mo <simon.mo@hey.com>
-
- 29 Sep, 2025 1 commit
-
-
JJJYmmm authored
Signed-off-by:
liuye.hj <liuye.hj@alibaba-inc.com> Co-authored-by:
liuye.hj <liuye.hj@alibaba-inc.com> Signed-off-by:
simon-mo <simon.mo@hey.com>
-
- 28 Sep, 2025 4 commits
-
-
Isotr0py authored
Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Roger Wang <hey@rogerw.io> Signed-off-by:
simon-mo <simon.mo@hey.com>
-
Roger Wang authored
Signed-off-by:
Roger Wang <hey@rogerw.io> Signed-off-by:
simon-mo <simon.mo@hey.com>
-
Tyler Michael Smith authored
Signed-off-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com> Signed-off-by:
simon-mo <simon.mo@hey.com>
-
Wentao Ye authored
Signed-off-by:simon-mo <simon.mo@hey.com>
-
- 26 Sep, 2025 4 commits
-
-
阿丹(adan) authored
Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
liudan <adan@minicpm.com> Co-authored-by:
liudan <liudan@qq.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Lucia Fang <116399278+luccafong@users.noreply.github.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Chih-Chieh Yang authored
Signed-off-by:
Chih-Chieh-Yang <7364402+cyang49@users.noreply.github.com> Co-authored-by:
RishiAstra <40644327+RishiAstra@users.noreply.github.com>
-
Eugene Khvedchenya authored
Signed-off-by:
Eugene Khvedchenia <ekhvedchenia@nvidia.com> Signed-off-by:
Eugene Khvedchenya <ekhvedchenya@gmail.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-