Merge branch 'v0.15.1-dev-kvfp8-fuse' into 'v0.15.1-dev'
支持kvacache fp8_e4m3的RMS_ROPE_CONCAT See merge request dcutoolkit/deeplearing/vllm!531
Showing
Please register or sign in to comment
支持kvacache fp8_e4m3的RMS_ROPE_CONCAT See merge request dcutoolkit/deeplearing/vllm!531