[Bugfix] Cuda Clean up scales Kvcache fp8/int8_per_token_head (#39224)
Signed-off-by:JartX <sagformas@epdcenter.es> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
Showing
Please register or sign in to comment
Signed-off-by:JartX <sagformas@epdcenter.es> Co-authored-by:
Michael Goin <mgoin64@gmail.com>