- 16 Jan, 2026 1 commit
-
-
xiabo authored
vllm:export VLLM_CUSTOM_CACHE=1 dtk:export HIP_KERNEL_EVENT_SYSTENFENCE=1 2、kvcache支持fp8
-
- 12 Jan, 2026 1 commit
-
-
yangql authored
-
- 10 Jan, 2026 1 commit
-
-
zhuwenwen authored
-
- 05 Jan, 2026 3 commits
- 22 Dec, 2025 1 commit
-
-
zhuwenwen authored
-
- 20 Dec, 2025 1 commit
-
-
zhuwenwen authored
-
- 12 Dec, 2025 1 commit
-
-
zhuwenwen authored
replace the fp8_mqa_logits and fp8_paged_mqa_logits interfaces in deepgemm with mqa_logits and paged_mqa_logits from lightop
-
- 04 Dec, 2025 3 commits
- 03 Dec, 2025 1 commit
-
-
zhuwenwen authored
add VLLM_USE_OPT_RESHAPE_AND_CACHE、VLLM_USE_FUSE_SILU_AND_MUL and VLLM_USE_TOPK_RENORM for qwen3-30b
-
- 02 Dec, 2025 1 commit
-
-
王敏 authored
-
- 13 Nov, 2025 2 commits
- 20 Oct, 2025 1 commit
-
-
zhuwenwen authored
-
- 03 Oct, 2025 1 commit
-
-
Michael Goin authored
Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by:
simon-mo <simon.mo@hey.com>
-
- 02 Oct, 2025 2 commits
-
-
Chen Zhang authored
Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Signed-off-by:
simon-mo <simon.mo@hey.com>
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
simon-mo <simon.mo@hey.com>
-
- 01 Oct, 2025 2 commits
-
-
Lucia Fang authored
Signed-off-by:
Lu Fang <fanglu@fb.com> Signed-off-by:
simon-mo <simon.mo@hey.com>
-
Yongye Zhu authored
Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Signed-off-by:
youkaichao <youkaichao@gmail.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
NickLucche <nlucches@redhat.com> Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Signed-off-by:
Barry Kang <43644113+Barry-Delaney@users.noreply.github.com> Signed-off-by:
Lucia Fang <fanglu@meta.com> Co-authored-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Lucia Fang <116399278+luccafong@users.noreply.github.com> Co-authored-by:
Lucia Fang <fanglu@meta.com> Co-authored-by:
NickLucche <nlucches@redhat.com> Co-authored-by:
Siyuan Fu <siyuanf@nvidia.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Xiaozhu Meng <mxz297@gmail.com> Co-authored-by:
Barry Kang <43644113+Barry-Delaney@users.noreply.github.com> Signed-off-by:
simon-mo <simon.mo@hey.com>
-
- 30 Sep, 2025 1 commit
-
-
zhuwenwen authored
-
- 26 Sep, 2025 3 commits
-
-
Chih-Chieh Yang authored
Signed-off-by:
Chih-Chieh-Yang <7364402+cyang49@users.noreply.github.com> Co-authored-by:
RishiAstra <40644327+RishiAstra@users.noreply.github.com>
-
Icey authored
Signed-off-by:Icey <1790571317@qq.com>
-
Tao He authored
[Qwen3-Next][GDN] fixes cuda graph capturing bug in GDN metadata and a stride bug in causal_conv_1d. (#25743) Signed-off-by:Tao He <linzhu.ht@alibaba-inc.com>
-
- 25 Sep, 2025 5 commits
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Signed-off-by:
Matthew Bonanni <mbonanni001@gmail.com>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
Jonas M. Kübler authored
Signed-off-by:Jonas Kuebler <kuebj@amazon.com>
-
zhuwenwen authored
-
Wei Wei authored
Signed-off-by:Wei Wei <wwei6@meta.com>
-
- 24 Sep, 2025 5 commits
-
-
Woosuk Kwon authored
Signed-off-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Signed-off-by:
Woosuk Kwon <woosuk@thinkingmachines.ai>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Benjamin Chislett authored
Signed-off-by:
Benjamin Chislett <benjamin.chislett@centml.ai> Signed-off-by:
Benjamin Chislett <bchislett@nvidia.com> Co-authored-by:
lhsjohn <huashuoli@tencent.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Benjamin Chislett authored
-
- 23 Sep, 2025 4 commits
-
-
Burkhard Ringlein authored
Signed-off-by:
Burkhard Ringlein <ngl@zurich.ibm.com> Co-authored-by:
Chih-Chieh Yang <chih.chieh.yang@ibm.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
Lucas Wilkinson authored
Signed-off-by:
Sage Moore <sage@neuralmagic.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by:
Sage Moore <sage@neuralmagic.com> Co-authored-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-