- 12 May, 2022 1 commit
-
-
John Kamalu authored
tools/merge_datasets.py - tool to merge multiple dataset files into a single dataset - testing conducted and included in the megatron-testing repo https://gitlab-master.nvidia.com/ADLR/megatron-testing tools/preprocess_data.py - magic numbers changed to required command line arguments megatron/data/indexed_dataset.py - when merging, fix to properly update document index - testing conducted and included in the megatron-testing repo (see above) - fix follows this history https://github.com/bigscience-workshop/Megatron-DeepSpeed/pull/66
-
- 15 Feb, 2022 3 commits
-
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
- 01 Feb, 2022 3 commits
-
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
- 14 Jan, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 13 Jan, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 12 Jan, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 17 Aug, 2021 1 commit
-
-
mshoeybi authored
-
- 19 May, 2021 1 commit
-
-
Mostofa Patwary authored
-
- 18 May, 2021 1 commit
-
-
Mostofa Patwary authored
-
- 14 May, 2021 1 commit
-
-
Jared Casper authored
-
- 16 Apr, 2021 1 commit
-
-
Jared Casper authored
-
- 09 Apr, 2021 1 commit
-
-
Mostofa Patwary authored
-
- 19 Mar, 2021 1 commit
-
-
Mostofa Patwary authored
-
- 10 Mar, 2021 1 commit
-
-
Mostofa Patwary authored
-
- 23 Feb, 2021 1 commit
-
-
Mostofa Patwary authored
-
- 18 Feb, 2021 1 commit
-
-
Mostofa Patwary authored
-
- 10 Feb, 2021 1 commit
-
-
Mostofa Patwary authored
-
- 04 Feb, 2021 1 commit
-
-
Mostofa Patwary authored
-
- 29 Jan, 2021 2 commits
-
-
Mostofa Patwary authored
-
Mostofa Patwary authored
-
- 28 Jan, 2021 1 commit
-
-
Vijay Korthikanti authored
-
- 26 Jan, 2021 1 commit
-
-
Vijay Korthikanti authored
-
- 25 Jan, 2021 1 commit
-
-
Mohammad Shoeybi authored
-
- 22 Jan, 2021 1 commit
-
-
Vijay Korthikanti authored
-
- 12 Jan, 2021 1 commit
-
-
Mohammad Shoeybi authored
-
- 08 Jan, 2021 1 commit
-
-
Vijay Korthikanti authored
-
- 30 Dec, 2020 1 commit
-
-
mshoeybi authored
-
- 19 Dec, 2020 7 commits
- 02 Dec, 2020 1 commit
-
-
mohammad authored
-
- 29 Nov, 2020 1 commit
-
-
mohammad authored
-