"vllm/sampling_params.py" did not exist on "42f1042e1c71f89b7875a292d1adf3a8d01c6d49"
[PyTorch] Fix attention backend and tests for `sm120` (#2320)
* Fix attention backend and tests for sm120 Signed-off-by:Kirthi Shankar Sivamani <ksivamani@nvidia.com> * Disable MLA only for backward Signed-off-by:
Kirthi Shankar Sivamani <ksivamani@nvidia.com> --------- Signed-off-by:
Kirthi Shankar Sivamani <ksivamani@nvidia.com>
Showing
Please register or sign in to comment