Merge branch 'v0.9.2-dev-ds-wm-1217' into 'v0.9.2-dev-ds'
w8a8 高吞吐模式先量化再dispatch See merge request dcutoolkit/deeplearing/vllm!303
Showing
Please register or sign in to comment
w8a8 高吞吐模式先量化再dispatch See merge request dcutoolkit/deeplearing/vllm!303