- 19 Jul, 2025 1 commit
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 10 Jul, 2025 1 commit
-
-
王敏 authored
-
- 17 Jun, 2025 1 commit
-
-
王敏 authored
[fix]1.修复medusa、scorer等并行解码单测;2.修复moe kernel单测问题,优化代码;3.修复rejection_sampler中test_compare_nonflashinfer_backend单测问题
-
- 13 Jun, 2025 1 commit
-
-
王敏 authored
-
- 04 Jun, 2025 1 commit
-
-
Lukas Geiger authored
Signed-off-by:Lukas Geiger <lukas.geiger94@gmail.com>
-
- 03 Jun, 2025 1 commit
-
-
Simon Mo authored
Signed-off-by:simon-mo <simon.mo@hey.com>
-
- 27 May, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 23 May, 2025 3 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
lizhigong authored
-
Shanshan Shen authored
Signed-off-by:shen-shanshan <467638484@qq.com>
-
- 19 May, 2025 1 commit
-
-
王敏 authored
-
- 09 May, 2025 5 commits
- 08 May, 2025 1 commit
-
-
zhuwenwen authored
-
- 07 May, 2025 1 commit
-
-
zhuwenwen authored
-
- 04 May, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 02 May, 2025 1 commit
-
-
Andrew Sansom authored
Signed-off-by:
Andrew Sansom <andrew@protopia.ai> Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by:
临景 <linjing.yx@alibaba-inc.com> Co-authored-by:
Bryce1010 <bryceyx@gmail.com> Co-authored-by:
Nan2018 <nan@protopia.ai> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
DarkLight1337 <tlleungac@connect.ust.hk>
-
- 01 May, 2025 2 commits
-
-
Huy Do authored
Signed-off-by:Huy Do <huydhn@gmail.com>
-
Noah Yoshida authored
Signed-off-by:Noah Yoshida <noahcy117@gmail.com>
-
- 29 Apr, 2025 1 commit
-
-
xiabo authored
-
- 24 Apr, 2025 1 commit
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 22 Apr, 2025 1 commit
-
-
zhuwenwen authored
-
- 21 Apr, 2025 1 commit
-
-
Shanshan Shen authored
Signed-off-by:shen-shanshan <467638484@qq.com>
-
- 18 Apr, 2025 1 commit
-
-
zhuwenwen authored
[fix]修复开启并行解码后,在极端测试情况下,由于设置了speculative-disable-by-batch-size导致不跑并行解码导致previous_hidden_states不断增加,最终导致显存用尽服务无响应问题
-
- 23 Mar, 2025 1 commit
-
-
shangmingc authored
Signed-off-by:Shangming Cai <caishangming@linux.alibaba.com>
-
- 15 Mar, 2025 2 commits
-
-
Bryan Lu authored
Signed-off-by:Bryan Lu <yuzhelu@amazon.com>
-
zhuwenwen authored
-
- 05 Mar, 2025 1 commit
-
-
pyc96 authored
[Bugfix] Fix DeepSeek MTP crash when using TP1ModelRunner with CUDA graph due to shape mismatch (#14237) Signed-off-by:pyc96 <pychen96@gmail.com>
-
- 27 Feb, 2025 1 commit
-
-
Benjamin Chislett authored
Signed-off-by:Benjamin Chislett <benjamin.chislett@centml.ai>
-
- 26 Feb, 2025 1 commit
-
-
Jee Jee Li authored
-
- 25 Feb, 2025 2 commits
-
-
cjackal authored
Signed-off-by:cjackal <44624812+cjackal@users.noreply.github.com>
-
Harry Mellor authored
-
- 20 Feb, 2025 1 commit
-
-
Simon Mo authored
-
- 19 Feb, 2025 2 commits
-
-
shangmingc authored
Signed-off-by:Shangming Cai <caishangming@linux.alibaba.com>
-
Lucia Fang authored
Signed-off-by:
Lu Fang <fanglu@fb.com> Co-authored-by:
LiuXiaoxuanPKU <lilyliupku@gmail.com>
-
- 17 Feb, 2025 2 commits
-
-
王敏 authored
-
shangmingc authored
Signed-off-by:Shangming Cai <caishangming@linux.alibaba.com>
-