Merge branch 'vllm-zhagnshao' into 'v0.8.5.post1-dev'
增加4倍的临时空间,解决大batch 大seq时,临时空间不足访存越界的情况 See merge request dcutoolkit/deeplearing/vllm!133
Showing
Please register or sign in to comment
增加4倍的临时空间,解决大batch 大seq时,临时空间不足访存越界的情况 See merge request dcutoolkit/deeplearing/vllm!133