- 13 Nov, 2025 1 commit
-
-
zhuwenwen authored
restore the default settings of disable_cascade_attn add VLLM_USE_OPT_ZEROS to replace triton_ (torch.zeros) set default_max_num_batched_tokens = 10240 update qwen3_moe of layernorm
-
- 27 Oct, 2025 1 commit
-
-
王敏 authored
-
- 13 Oct, 2025 1 commit
-
-
王敏 authored
-
- 30 Sep, 2025 1 commit
-
-
王敏 authored
-
- 04 Sep, 2025 1 commit
-
-
王敏 authored
2.解决mtp >1 大EP推理all gather卡住问题
-
- 01 Sep, 2025 1 commit
-
-
王敏 authored
-
- 15 Aug, 2025 1 commit
-
-
王敏 authored
-
- 07 Aug, 2025 1 commit
-
-
王敏 authored
-
- 06 Aug, 2025 1 commit
-
-
zhuwenwen authored
This reverts merge request !169
-
- 05 Aug, 2025 1 commit
-
-
王敏 authored
-
- 30 Jul, 2025 1 commit
-
-
yangql authored
-
- 26 Jul, 2025 1 commit
-
-
yangql authored
-
- 03 Jul, 2025 1 commit
-
-
zhuwenwen authored
-
- 25 Jun, 2025 1 commit
-
-
cjackal authored
Signed-off-by:cjackal <44624812+cjackal@users.noreply.github.com>
-
- 03 Jun, 2025 1 commit
-
-
Simon Mo authored
Signed-off-by:simon-mo <simon.mo@hey.com>
-
- 23 May, 2025 1 commit
-
-
Jiayi Yao authored
Signed-off-by:
Rui Qiao <ruisearch42@gmail.com> Signed-off-by:
YaoJiayi <120040070@link.cuhk.edu.cn> Co-authored-by:
Rui Qiao <ruisearch42@gmail.com>
-
- 15 May, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 24 Apr, 2025 1 commit
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 27 Feb, 2025 2 commits
-
-
Benjamin Chislett authored
Signed-off-by:Benjamin Chislett <benjamin.chislett@centml.ai>
-
zhuwenwen authored
-
- 25 Feb, 2025 1 commit
-
-
Harry Mellor authored
-
- 19 Feb, 2025 1 commit
-
-
Lucia Fang authored
Signed-off-by:
Lu Fang <fanglu@fb.com> Co-authored-by:
LiuXiaoxuanPKU <lilyliupku@gmail.com>
-