- 30 Sep, 2022 1 commit
-
-
Lawrence McAfee authored
-
- 23 Sep, 2022 1 commit
-
-
Jared Casper authored
The LICENSE file says everything is 3-clause BSD, which is what we want, but at some point the Apache license was added to the top of some files and that proliferated. This commit removes the Apache license from any files that we own the copyright to. Also updates the copyright year and removes the unnessary coding=utf-8 line.
-
- 17 May, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 28 Apr, 2022 2 commits
-
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
- 24 Mar, 2022 1 commit
-
-
Lawrence McAfee authored
-
- 14 Mar, 2022 1 commit
-
-
Lawrence McAfee authored
-
- 09 Mar, 2022 3 commits
-
-
Lawrence McAfee authored
-
Lawrence McAfee authored
-
Lawrence McAfee authored
-
- 02 Mar, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 01 Mar, 2022 1 commit
-
-
Lawrence McAfee authored
-
- 18 Feb, 2022 1 commit
-
-
Lawrence McAfee authored
-
- 15 Feb, 2022 1 commit
-
-
Lawrence McAfee authored
-
- 10 Feb, 2022 2 commits
-
-
Lawrence McAfee authored
-
Lawrence McAfee authored
-
- 09 Feb, 2022 1 commit
-
-
Lawrence McAfee authored
-
- 26 Jan, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 22 Jan, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 12 Jan, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 19 Aug, 2021 1 commit
-
-
mshoeybi authored
-
- 16 Jul, 2021 1 commit
-
-
Lawrence McAfee authored
-
- 19 Mar, 2021 1 commit
-
-
Mohammad Shoeybi authored
-
- 08 Mar, 2021 1 commit
-
-
Mohammad Shoeybi authored
-
- 04 Mar, 2021 1 commit
-
-
Rewon Child authored
-
- 23 Feb, 2021 1 commit
-
-
Rewon Child authored
-
- 09 Feb, 2021 1 commit
-
-
Deepak Narayanan authored
- Split a model's computation into multiple virtual stages as needed, and schedule communication correctly between these virtual stages - Move schedule code into `schedules.py` and communication code into `p2p_communication.py` - Use hyphens instead of spaces in all time logging for consistency - Factor out code in megatron/training.py into helper functions - Refactor evaluate() function: make it use forward_backward_schedule functions
-
- 22 Jan, 2021 1 commit
-
-
Vijay Korthikanti authored
-
- 08 Jan, 2021 1 commit
-
-
Vijay Korthikanti authored
-
- 30 Dec, 2020 1 commit
-
-
mohammad authored
-
- 27 Dec, 2020 1 commit
-
-
mohammad authored
-
- 25 Dec, 2020 1 commit
-
-
mohammad authored
-