- 05 Nov, 2025 2 commits
-
-
Lei Wang authored
[Refactor] Dynamic registration of FP8 data type for compatibility with older PyTorch versions (#1197)
-
Lei Wang authored
* fix * lint fix * fix * lint fix * fix * upd * support n>256 * Remove unnecessary pass configurations for fast math in MHA forward BHSD latency script. * lint fix * lint fix
-
- 02 Nov, 2025 1 commit
-
-
Lei Wang authored
* fix * lint fix * fix * lint fix * fix * upd
-