- 07 Dec, 2024 2 commits
- 06 Dec, 2024 6 commits
- 05 Dec, 2024 10 commits
- 04 Dec, 2024 5 commits
- 03 Dec, 2024 8 commits
- 02 Dec, 2024 2 commits
- 29 Nov, 2024 1 commit
-
-
zhuwenwen authored
-
- 28 Nov, 2024 4 commits
- 27 Nov, 2024 2 commits
Update offline_streaming_inference_chat_demo.py See merge request dcutoolkit/deeplearing/vllm!49
[feat]并行解码支持多卡推理 See merge request dcutoolkit/deeplearing/vllm!48
fix See merge request dcutoolkit/deeplearing/vllm!47
[fix]修复单测test_mlp_correctness失败问题 See merge request dcutoolkit/deeplearing/vllm!45
0.6.2 w8a8 See merge request dcutoolkit/deeplearing/vllm!43
[fix]修复llm_engine.py 越界报错 See merge request dcutoolkit/deeplearing/vllm!42
优化medusa 推理 See merge request dcutoolkit/deeplearing/vllm!41