Merge branch 'fp8e5m2_bw' into 'v0.11.0-dev'
解决fp8 kv cache scale错误问题 See merge request dcutoolkit/deeplearing/vllm!455
Showing
Please register or sign in to comment
解决fp8 kv cache scale错误问题 See merge request dcutoolkit/deeplearing/vllm!455