- 17 Apr, 2025 1 commit
-
-
王敏 authored
[fix]修复开启并行解码后,在极端测试情况下,由于设置了speculative-disable-by-batch-size导致不跑并行解码导致previous_hidden_states不断增加,最终导致显存用尽服务无响应问题
-
- 15 Apr, 2025 5 commits
- 14 Apr, 2025 3 commits
- 11 Apr, 2025 1 commit
-
-
zhuwenwen authored
-
- 10 Apr, 2025 1 commit
-
-
zhuwenwen authored
-
- 09 Apr, 2025 1 commit
-
-
zhuwenwen authored
-
- 08 Apr, 2025 1 commit
-
-
zhuwenwen authored
-
- 07 Apr, 2025 1 commit
-
-
zhuwenwen authored
-
- 03 Apr, 2025 2 commits
- 02 Apr, 2025 1 commit
-
-
zhuwenwen authored
-
- 28 Mar, 2025 3 commits
- 27 Mar, 2025 1 commit
-
-
yangql authored
-
- 26 Mar, 2025 8 commits
- 25 Mar, 2025 2 commits
- 24 Mar, 2025 5 commits
- 22 Mar, 2025 1 commit
-
-
xiabo authored
-
- 18 Mar, 2025 1 commit
-
-
zhuwenwen authored
-
- 17 Mar, 2025 2 commits