- 08 Apr, 2020 3 commits
-
-
Jared Casper authored
-
Jared Casper authored
preprocess_data: - Adds ability to not split sentences. This is used for gpt2 datasets. - Adds ability to create multiple datasets from different json keys, this is current untested. indexed_dataset: - Add new "get" function to get a portion of an entry.
-
Mohammad authored
-
- 07 Apr, 2020 1 commit
-
-
Mohammad authored
-
- 03 Apr, 2020 1 commit
-
-
Neel Kant authored
-
- 01 Apr, 2020 1 commit
-
-
Mohammad authored
-
- 30 Mar, 2020 4 commits
- 29 Mar, 2020 1 commit
-
-
Mohammad authored
-
- 28 Mar, 2020 2 commits
- 27 Mar, 2020 1 commit
-
-
Mohammad authored
-
- 24 Mar, 2020 1 commit
-
-
Mohammad Shoeybi authored
-
- 17 Mar, 2020 1 commit
-
-
Mohammad Shoeybi authored
-
- 26 Dec, 2019 1 commit
-
-
Mohammad Shoeybi authored
-
- 25 Dec, 2019 1 commit
-
-
Mohammad Shoeybi authored
-
- 04 Dec, 2019 1 commit
-
-
Mohammad Shoeybi authored
-
- 24 Nov, 2019 4 commits
-
-
Mohammad Shoeybi authored
-
Mohammad Shoeybi authored
-
Mohammad Shoeybi authored
-
Mohammad Shoeybi authored
-
- 21 Nov, 2019 1 commit
-
-
Mohammad Shoeybi authored
-
- 20 Nov, 2019 2 commits
-
-
Mohammad Shoeybi authored
-
Mohammad Shoeybi authored
-
- 19 Nov, 2019 1 commit
-
-
Jared Casper authored
-
- 13 Nov, 2019 1 commit
-
-
Jared Casper authored
Some cleanup of types, etc. in the helpers cpp code.
-
- 11 Nov, 2019 1 commit
-
-
Mohammad Shoeybi authored
-
- 09 Nov, 2019 2 commits
-
-
Jared Casper authored
-
Jared Casper authored
Also removed the fix_lua_indexing stuff from fairseq.
-
- 08 Nov, 2019 5 commits
-
-
Mohammad Shoeybi authored
-
Mohammad Shoeybi authored
-
Mohammad Shoeybi authored
-
Mohammad Shoeybi authored
-
Jared Casper authored
-
- 07 Nov, 2019 2 commits
-
-
Jared Casper authored
-
Mohammad Shoeybi authored
-