- 04 Mar, 2026 1 commit
-
-
zhangshao authored
-
- 02 Mar, 2026 1 commit
-
-
zhangshao authored
-
- 24 Feb, 2026 1 commit
-
-
zhuwenwen authored
-
- 27 Jan, 2026 1 commit
-
-
xiabo authored
-
- 21 Jan, 2026 1 commit
-
-
xiabo authored
-
- 15 Jan, 2026 1 commit
-
-
zhuwenwen authored
todo: add VLLM_USE_QUERY_QUANT to not use q quant
-
- 13 Nov, 2025 1 commit
-
-
zhuwenwen authored
set default_max_num_batched_tokens = 10240 update qwen3_moe of layernorm off lightop of moe_fused_gate
-
- 31 Oct, 2025 1 commit
-
-
zhuwenwen authored
-
- 20 Oct, 2025 1 commit
-
-
zhuwenwen authored
-
- 11 Oct, 2025 1 commit
-
-
zhuwenwen authored
-
- 01 Oct, 2025 1 commit
-
-
Yongye Zhu authored
Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Signed-off-by:
youkaichao <youkaichao@gmail.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
NickLucche <nlucches@redhat.com> Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Signed-off-by:
Barry Kang <43644113+Barry-Delaney@users.noreply.github.com> Signed-off-by:
Lucia Fang <fanglu@meta.com> Co-authored-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Lucia Fang <116399278+luccafong@users.noreply.github.com> Co-authored-by:
Lucia Fang <fanglu@meta.com> Co-authored-by:
NickLucche <nlucches@redhat.com> Co-authored-by:
Siyuan Fu <siyuanf@nvidia.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Xiaozhu Meng <mxz297@gmail.com> Co-authored-by:
Barry Kang <43644113+Barry-Delaney@users.noreply.github.com> Signed-off-by:
simon-mo <simon.mo@hey.com>
-
- 25 Sep, 2025 2 commits
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Signed-off-by:
Matthew Bonanni <mbonanni001@gmail.com>
-
Jonas M. Kübler authored
Signed-off-by:Jonas Kuebler <kuebj@amazon.com>
-
- 23 Sep, 2025 2 commits
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
- 20 Sep, 2025 2 commits
-
-
Chendi.Xue authored
Signed-off-by:Chendi Xue <Chendi.Xue@intel.com>
-
Boyuan Feng authored
Signed-off-by:
Boyuan Feng <boyuan@meta.com> Signed-off-by:
Boyuan Feng <fby.1994@gmail.com> Signed-off-by:
boyuanfeng <boyuan@meta.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
- 19 Sep, 2025 1 commit
-
-
Yan Ma authored
Signed-off-by:
Yan Ma <yan.ma@intel.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
- 18 Sep, 2025 2 commits
-
-
zhuwenwen authored
-
Chaojun Zhang authored
Signed-off-by:chzhang <chaojun.zhang@intel.com>
-
- 15 Sep, 2025 1 commit
-
-
Rafael Marcelino Koike authored
Signed-off-by:
Rafael Marcelino Koike <rafael.koike@oracle.com> Signed-off-by:
Rafael Koike <koike.rafael@gmail.com>
-
- 12 Sep, 2025 1 commit
-
-
Wenlong Wang authored
Signed-off-by:wwl2755 <wangwenlong2755@gmail.com>
-
- 11 Sep, 2025 1 commit
-
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
- 10 Sep, 2025 1 commit
-
-
baonudesifeizhai authored
Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
- 27 Aug, 2025 1 commit
-
-
Kunshang Ji authored
Signed-off-by:Kunshang Ji <kunshang.ji@intel.com>
-
- 25 Aug, 2025 1 commit
-
-
Ayush Satyam authored
Signed-off-by:Ayush Satyam <ayushsatyam146@gmail.com>
-
- 22 Aug, 2025 1 commit
-
-
elvischenv authored
Signed-off-by:
elvischenv <219235043+elvischenv@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
- 19 Aug, 2025 1 commit
-
-
elvischenv authored
Signed-off-by:
elvischenv <219235043+elvischenv@users.noreply.github.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
- 16 Aug, 2025 1 commit
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 12 Aug, 2025 1 commit
-
-
Yongye Zhu authored
Signed-off-by:Yongye Zhu <zyy1102000@gmail.com>
-
- 07 Aug, 2025 1 commit
-
-
Lucas Wilkinson authored
[Attention] Support multiple attention metadata builders per kv_cache_spec + proper local attention no hybrid kv cache fix (#21588) Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 06 Aug, 2025 1 commit
-
-
zhuwenwen authored
-
- 05 Aug, 2025 1 commit
-
-
zhuwenwen authored
-
- 23 Jul, 2025 1 commit
-
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
- 21 Jul, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
- 19 Jul, 2025 2 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Lucia Fang authored
Signed-off-by:
Lucia Fang <fanglu@fb.com> Signed-off-by:
Lu Fang <fanglu@meta.com> Signed-off-by:
Lu Fang <fanglu@fb.com> Co-authored-by:
Lu Fang <fanglu@meta.com>
-
- 18 Jul, 2025 1 commit
-
-
hax0r31337 authored
Signed-off-by:hax0r31337 <liulihaocaiqwq@gmail.com>
-
- 17 Jul, 2025 2 commits