Merge branch 'v0.9.2-dev-main+mtp-zero' into 'v0.9.2-dev'
fix: 只有当kv block中不含有MTP的假数据时才会被cached,以修复cache_full_blocks同一个kv block保存两次的bug See merge request dcutoolkit/deeplearing/vllm!331
Showing
Please register or sign in to comment
fix: 只有当kv block中不含有MTP的假数据时才会被cached,以修复cache_full_blocks同一个kv block保存两次的bug See merge request dcutoolkit/deeplearing/vllm!331