- 08 Dec, 2022 1 commit
-
-
Leo Gao authored
-
- 07 Dec, 2022 1 commit
-
-
Stella Biderman authored
Cleanup `README.md` and package deps
-
- 02 Dec, 2022 2 commits
- 25 Nov, 2022 8 commits
-
-
Stella Biderman authored
Add the original `LAMBADA` dataset
-
jon-tow authored
-
-
Stella Biderman authored
Remove unsupported `gitlab` link in pre-commit config
-
jon-tow authored
-
jon-tow authored
-
jon-tow authored
-
- 04 Nov, 2022 1 commit
-
-
Jonathan Tow authored
* Add `TextSynth` API
-
- 01 Nov, 2022 2 commits
-
-
Stella Biderman authored
Remove custom datasets that are in HF
-
Stella Biderman authored
[deps] Use minimum versioning for `numexpr`
-
- 25 Oct, 2022 1 commit
-
-
jon-tow authored
-
- 15 Sep, 2022 4 commits
-
-
Stella Biderman authored
Manually concat tokenizer revision with subfolder
-
jon-tow authored
-
jon-tow authored
-
jon-tow authored
-
- 23 Jul, 2022 1 commit
-
-
Stella Biderman authored
Fix make_disjoint_window for tail case
-
- 22 Jul, 2022 1 commit
-
-
Rich Hankins authored
When the continuation list contains a single entry, make_disjoint_window incorrectly truncates the context list to be empty. The fix checks for the non-overlapping case and, in that case, simply returns the existing lists.
-
- 24 Jun, 2022 2 commits
-
-
Stella Biderman authored
Fix key access in squad evaluation metrics
-
Konstantin Schulz authored
-
- 16 Jun, 2022 2 commits
-
-
Stella Biderman authored
Update decontamination.md
-
researcher2 authored
-
- 11 Jun, 2022 2 commits
-
-
-
jon-tow authored
-
- 17 May, 2022 1 commit
-
-
Stella Biderman authored
Update datasets/README.md
-
- 16 May, 2022 2 commits
- 12 May, 2022 2 commits
-
-
Jonathan Tow authored
Add bootstrap_iters kwarg
-
-
- 11 May, 2022 5 commits
-
-
Stella Biderman authored
Decontam import fix
-
Niklas Muennighoff authored
-
jon-tow authored
-
jon-tow authored
-
jon-tow authored
-
- 03 May, 2022 2 commits
-
-
Jonathan Tow authored
Add pre-commit
-
jon-tow authored
-