- 06 Dec, 2024 5 commits
- 05 Dec, 2024 10 commits
- 04 Dec, 2024 5 commits
- 03 Dec, 2024 8 commits
- 02 Dec, 2024 2 commits
- 29 Nov, 2024 1 commit
-
-
zhuwenwen authored
-
- 28 Nov, 2024 4 commits
- 27 Nov, 2024 3 commits
- 25 Nov, 2024 2 commits
[feat]并行解码支持多卡推理 See merge request dcutoolkit/deeplearing/vllm!48
fix See merge request dcutoolkit/deeplearing/vllm!47
[fix]修复单测test_mlp_correctness失败问题 See merge request dcutoolkit/deeplearing/vllm!45
0.6.2 w8a8 See merge request dcutoolkit/deeplearing/vllm!43
[fix]修复llm_engine.py 越界报错 See merge request dcutoolkit/deeplearing/vllm!42
优化medusa 推理 See merge request dcutoolkit/deeplearing/vllm!41
add VLLM_OPTEST_MODELS_PATH/OPTEST_MODELS_PATH to load models from local path instead of Hugging Face Hub