Merge branch 'fp8e5m2_bw' into 'v0.11.0-dev'
修复qwen vl系列kv cache e5m2计算scale bug See merge request dcutoolkit/deeplearing/vllm!452
Showing
Please register or sign in to comment
修复qwen vl系列kv cache e5m2计算scale bug See merge request dcutoolkit/deeplearing/vllm!452