- 01 Aug, 2023 1 commit
-
-
Tri Dao authored
-
- 28 Jul, 2023 1 commit
-
-
Tri Dao authored
-
- 27 Jul, 2023 1 commit
-
-
Kirthi Shankar Sivamani authored
* Add RNG state to kernel launch params Signed-off-by:
Kirthi Shankar Sivamani <ksivamani@nvidia.com> * Save seed and offset for backward Signed-off-by:
Kirthi Shankar Sivamani <ksivamani@nvidia.com> * Single thread write to global mem Signed-off-by:
Kirthi Shankar Sivamani <ksivamani@nvidia.com> * compute_dq_dk_dv_1colblock get seed and offset from launch params Signed-off-by:
Kirthi Shankar Sivamani <ksivamani@nvidia.com> * compute_dq_dk_dv_1rowblock get seed and offset from launch params Signed-off-by:
Kirthi Shankar Sivamani <ksivamani@nvidia.com> * Change forward c++ APIs to save RNG state for backward Signed-off-by:
Kirthi Shankar Sivamani <ksivamani@nvidia.com> * Change backward c++ APIs to set RNG state for bprop launcher Signed-off-by:
Kirthi Shankar Sivamani <ksivamani@nvidia.com> * Bug fixes Signed-off-by:
Kirthi Shankar Sivamani <ksivamani@nvidia.com> * Python side API changes Signed-off-by:
Kirthi Shankar Sivamani <ksivamani@nvidia.com> * Bug fix; only save seeds instead of full offset Signed-off-by:
Kirthi Shankar Sivamani <ksivamani@nvidia.com> * Account for 3D grid size Signed-off-by:
Kirthi Shankar Sivamani <ksivamani@nvidia.com> --------- Signed-off-by:
Kirthi Shankar Sivamani <ksivamani@nvidia.com>
-
- 18 Jul, 2023 1 commit
-
-
Tri Dao authored
-
- 17 Jul, 2023 1 commit
-
-
Tri Dao authored
-
- 03 Jul, 2023 1 commit
-
-
Tri Dao authored
-
- 13 Apr, 2023 1 commit
-
-
Kirthi Shankar Sivamani authored
Signed-off-by:Kirthi Shankar Sivamani <ksivamani@nvidia.com>
-
- 12 Apr, 2023 1 commit
-
-
Kirthi Shankar Sivamani authored
Signed-off-by:Kirthi Shankar Sivamani <ksivamani@nvidia.com>
-
- 31 Mar, 2023 1 commit
-
-
Kirthi Shankar Sivamani authored
-
- 13 Dec, 2022 1 commit
-
-
Tri Dao authored
-
- 05 Nov, 2022 1 commit
-
-
Tri Dao authored
This is faster since we only need to do atomic adds on dq, instead of atomic adds on both dk and dv.
-
- 24 Oct, 2022 1 commit
-
-
Tri Dao authored
-
- 23 Oct, 2022 1 commit
-
-
Tri Dao authored
-
- 21 Oct, 2022 2 commits
- 14 Oct, 2022 2 commits
- 04 Jul, 2022 2 commits
- 02 Jun, 2022 1 commit
-
-
Tri Dao authored
-
- 29 May, 2022 1 commit
-
-
Tri Dao authored
-
- 26 May, 2022 1 commit
-
-
Tri Dao authored
-
- 20 May, 2022 1 commit
-
-
Tri Dao authored
-