Merge branch 'v0.9.2-dev-wm-1108' into 'v0.9.2-dev'
[fix]解决开启mtp后,在极端情况碰到显存不足时,导致mla中申请的tensor数据错乱问题 See merge request dcutoolkit/deeplearing/vllm!247
Showing
Please register or sign in to comment
[fix]解决开启mtp后,在极端情况碰到显存不足时,导致mla中申请的tensor数据错乱问题 See merge request dcutoolkit/deeplearing/vllm!247