- 13 May, 2025 6 commits
-
-
zhuwenwen authored
Merge branch 'v0.8.5.post1-dev' of http://112.11.119.99:10068/dcutoolkit/deeplearing/vllm into v0.8.5.post1-dev
-
zhuwenwen authored
-
zhuwenwen authored
提升bf16 pa精度 See merge request dcutoolkit/deeplearing/vllm!112
-
zhangshao authored
-
zhuwenwen authored
support telechat2 and glm4 nn layout remove log of request_id
-
zhuwenwen authored
-
- 09 May, 2025 14 commits
-
-
zhuwenwen authored
debug on v0.8.5 See merge request dcutoolkit/deeplearing/vllm!111
-
lizhigong authored
-
zhuwenwen authored
V0.8.5 zero overhead See merge request dcutoolkit/deeplearing/vllm!110
-
lizhigong authored
-
lizhigong authored
-
lizhigong authored
-
lizhigong authored
-
lizhigong authored
-
lizhigong authored
-
lizhigong authored
-
lizhigong authored
-
lizhigong authored
-
lizhigong authored
-
lizhigong authored
-
- 08 May, 2025 6 commits
- 07 May, 2025 4 commits
- 06 May, 2025 4 commits
- 02 May, 2025 2 commits
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
Robert Shaw authored
Signed-off-by:rshaw@neuralmagic.com <robertgshaw2@gmail.com>
-
- 30 Apr, 2025 3 commits
- 29 Apr, 2025 1 commit
-
-
zhuwenwen authored
-