- 20 Feb, 2025 1 commit
-
-
sangwz authored
-
- 13 Sep, 2024 1 commit
-
-
sangwzh authored
-
- 14 Aug, 2023 1 commit
-
-
Xin Yao authored
Signed-off-by:Xin Yao <xiny@nvidia.com>
-
- 07 Nov, 2022 1 commit
-
-
Hongzhi (Steve), Chen authored
* [Misc] clang-format auto fix. * blabla * nolint * blabla Co-authored-by:Steve <ubuntu@ip-172-31-34-29.ap-northeast-1.compute.internal>
-
- 06 Nov, 2022 1 commit
-
-
Xin Yao authored
* add bf16 specializations * remove SWITCH_BITS * enable amp for bf16 * remove SWITCH_BITS for cpu kernels * enbale bf16 based on CUDART * fix compiling for sm<80 * fix cpu build * enable unit tests * update doc * disable test for CUDA < 11.0 * address comments * address comments
-