Fix FP8 current scaling attention logic (#2234)
* Fix in FP8 attention selection logic Signed-off-by:Kirthi Shankar Sivamani <ksivamani@nvidia.com> * Improve logic Signed-off-by:
Kirthi Shankar Sivamani <ksivamani@nvidia.com> --------- Signed-off-by:
Kirthi Shankar Sivamani <ksivamani@nvidia.com>
Showing
Please register or sign in to comment