- 05 Dec, 2025 1 commit
-
-
王敏 authored
-
- 04 Dec, 2025 3 commits
- 03 Dec, 2025 1 commit
-
-
zhuwenwen authored
add VLLM_USE_OPT_RESHAPE_AND_CACHE、VLLM_USE_FUSE_SILU_AND_MUL and VLLM_USE_TOPK_RENORM for qwen3-30b
-
- 02 Dec, 2025 1 commit
-
-
王敏 authored
-
- 24 Nov, 2025 1 commit
-
-
liuchy5 authored
-
- 21 Nov, 2025 1 commit
-
-
zhuwenwen authored
-
- 18 Nov, 2025 1 commit
-
-
王敏 authored
-
- 17 Nov, 2025 1 commit
-
-
zhuwenwen authored
-
- 14 Nov, 2025 1 commit
-
-
zhuwenwen authored
-
- 13 Nov, 2025 2 commits
- 29 Oct, 2025 1 commit
-
- 28 Oct, 2025 1 commit
-
-
zhuwenwen authored
-
- 20 Oct, 2025 1 commit
-
-
zhuwenwen authored
-
- 12 Oct, 2025 1 commit
-
-
zhuwenwen authored
-
- 11 Oct, 2025 2 commits
- 03 Oct, 2025 1 commit
-
-
Michael Goin authored
Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by:
simon-mo <simon.mo@hey.com>
-
- 02 Oct, 2025 2 commits
-
-
Chen Zhang authored
Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Signed-off-by:
simon-mo <simon.mo@hey.com>
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
simon-mo <simon.mo@hey.com>
-
- 01 Oct, 2025 4 commits
-
-
Lucia Fang authored
Signed-off-by:
Lu Fang <fanglu@fb.com> Signed-off-by:
simon-mo <simon.mo@hey.com>
-
qizixi authored
Signed-off-by:zixi-qi <qizixi@meta.com>
-
Yongye Zhu authored
Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Signed-off-by:
youkaichao <youkaichao@gmail.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
NickLucche <nlucches@redhat.com> Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Signed-off-by:
Barry Kang <43644113+Barry-Delaney@users.noreply.github.com> Signed-off-by:
Lucia Fang <fanglu@meta.com> Co-authored-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Lucia Fang <116399278+luccafong@users.noreply.github.com> Co-authored-by:
Lucia Fang <fanglu@meta.com> Co-authored-by:
NickLucche <nlucches@redhat.com> Co-authored-by:
Siyuan Fu <siyuanf@nvidia.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Xiaozhu Meng <mxz297@gmail.com> Co-authored-by:
Barry Kang <43644113+Barry-Delaney@users.noreply.github.com> Signed-off-by:
simon-mo <simon.mo@hey.com>
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
simon-mo <simon.mo@hey.com>
-
- 30 Sep, 2025 1 commit
-
-
zhuwenwen authored
-
- 29 Sep, 2025 1 commit
-
-
Robert Shaw authored
Signed-off-by:
Sage Moore <sage@neuralmagic.com> Signed-off-by:
simon-mo <simon.mo@hey.com> Signed-off-by:
rentianyue-jk <rentianyue-jk@360shuke.com> Signed-off-by:
Russell Bryant <rbryant@redhat.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by:
Chenheli Hua <huachenheli@outlook.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com> Signed-off-by:
NickLucche <nlucches@redhat.com> Signed-off-by:
Roger Wang <hey@rogerw.io> Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Sage Moore <sage@neuralmagic.com> Co-authored-by:
Russell Bryant <rbryant@redhat.com> Co-authored-by:
rentianyue-jk <rentianyue-jk@360shuke.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Chenheli Hua <huachenheli@outlook.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com> Co-authored-by:
Nicolò Lucchesi <nlucches@redhat.com> Co-authored-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Robert Shaw <robshaw@redhat.com> Signed-off-by:
simon-mo <simon.mo@hey.com>
-
- 28 Sep, 2025 2 commits
-
-
Roger Wang authored
Signed-off-by:
Roger Wang <hey@rogerw.io> Signed-off-by:
simon-mo <simon.mo@hey.com>
-
Sage Moore authored
Signed-off-by:
Sage Moore <sage@neuralmagic.com> Signed-off-by:
simon-mo <simon.mo@hey.com>
-
- 26 Sep, 2025 10 commits
-
-
Seiji Eicher authored
Signed-off-by:
Seiji Eicher <seiji@anyscale.com> Signed-off-by:
Seiji Eicher <58963096+eicherseiji@users.noreply.github.com> Co-authored-by:
Rui Qiao <161574667+ruisearch42@users.noreply.github.com>
-
Lucas Wilkinson authored
[BugFix] Fix using `dbo_decode_token_threshold` always (and ignoring `dbo_prefill_token_threshold`) (#25622) Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
wang.yuqi authored
Signed-off-by:
wang.yuqi <noooop@126.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Chih-Chieh Yang authored
Signed-off-by:
Chih-Chieh-Yang <7364402+cyang49@users.noreply.github.com> Co-authored-by:
RishiAstra <40644327+RishiAstra@users.noreply.github.com>
-
Icey authored
Signed-off-by:Icey <1790571317@qq.com>
-
Tao He authored
[Qwen3-Next][GDN] fixes cuda graph capturing bug in GDN metadata and a stride bug in causal_conv_1d. (#25743) Signed-off-by:Tao He <linzhu.ht@alibaba-inc.com>
-
Andrew Sansom authored
Signed-off-by:Andrew Sansom <andrew@protopia.ai>
-
Andrew Sansom authored
fix: revert cast to cpu in `MsgpackEncoder._encode_tensor` to avoid hidden performance regressions (#25738) Signed-off-by:Andrew Sansom <andrew@protopia.ai>
-
yitingdc authored
Signed-off-by:yiting.jiang <yiting.jiang@daocloud.io>
-
zhuwenwen authored
-