Merge branch 'v0.6.2-dev_wm' into 'v0.6.2-dev'
[feat]优化medusa代码,通过VLLM_TREE_DECODING环境变量控制是否采用tree-style解码,计算逻辑与主干隔离 See merge request dcutoolkit/deeplearing/vllm!51
Showing
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
Please register or sign in to comment