- 03 May, 2020 1 commit
-
-
Neel Kant authored
-
- 30 Apr, 2020 1 commit
-
-
Neel Kant authored
-
- 29 Apr, 2020 1 commit
-
-
Neel Kant authored
-
- 24 Apr, 2020 4 commits
- 23 Apr, 2020 1 commit
-
-
Neel Kant authored
-
- 21 Apr, 2020 1 commit
-
-
Neel Kant authored
-
- 20 Apr, 2020 1 commit
-
-
Neel Kant authored
-
- 17 Apr, 2020 1 commit
-
-
Neel Kant authored
-
- 16 Apr, 2020 8 commits
-
-
Mohammad authored
-
Mohammad authored
-
Neel Kant authored
-
Jared Casper authored
-
Jared Casper authored
-
Neel Kant authored
-
Neel Kant authored
-
Mohammad authored
-
- 15 Apr, 2020 3 commits
- 14 Apr, 2020 6 commits
- 13 Apr, 2020 3 commits
-
-
Jared Casper authored
Do not cast data returned from indexed_dataset to int64, rely on caller to cast to appropriate type.
-
Jared Casper authored
-
Mohammad authored
-
- 12 Apr, 2020 1 commit
-
-
Mohammad authored
-
- 10 Apr, 2020 1 commit
-
-
Mohammad authored
-
- 09 Apr, 2020 3 commits
- 08 Apr, 2020 4 commits
-
-
Jared Casper authored
-
Jared Casper authored
-
Neel Kant authored
-
Jared Casper authored
preprocess_data: - Adds ability to not split sentences. This is used for gpt2 datasets. - Adds ability to create multiple datasets from different json keys, this is current untested. indexed_dataset: - Add new "get" function to get a portion of an entry.
-