- 27 Feb, 2024 2 commits
-
-
Hailey Schoelkopf authored
Co-authored-by:Daniel Furman <dryanfurman@gmail.com>
-
Hailey Schoelkopf authored
Co-authored-by:lewtun <lewis.c.tunstall@gmail.com>
-
- 16 Jan, 2024 1 commit
-
-
haileyschoelkopf authored
-
- 15 Jan, 2024 11 commits
-
-
haileyschoelkopf authored
-
haileyschoelkopf authored
-
haileyschoelkopf authored
-
haileyschoelkopf authored
-
Hailey Schoelkopf authored
Bumping CITATION.bib to match re-adding the citation in readme. cc @StellaAthena
-
Stella Biderman authored
It looks like Google Scholar has [already noticed](https://scholar.google.com/scholar?hl=en&as_sdt=0%2C9&authuser=2&q=%22A+framework+for+few-shot+language+model+evaluation%2C+12+2023%22&btnG=) the updated citation block so let's add it back in.
-
Lintang Sutawika authored
* rewor documentation for explaining local dataset * fix typo * Update new_task_guide.md
-
Hailey Schoelkopf authored
* add WIP device_map overrides * update handling outside of accelerate launcher * change .to(device) log to debug level * run linter
-
Lintang Sutawika authored
* benchmark yamls allow minor edits of already registered tasks * add documentation * removed print
-
Hailey Schoelkopf authored
* Make parallelize=True distinction clearer in documentation. * run linter
-
Hailey Schoelkopf authored
-
- 13 Jan, 2024 3 commits
-
-
daniel-furman authored
-
daniel-furman authored
-
daniel-furman authored
-
- 12 Jan, 2024 3 commits
-
-
Hailey Schoelkopf authored
-
jp authored
* Add: kobest config file * Add: kobest utils * Add: README * Update utils.py
-
Hailey Schoelkopf authored
-
- 11 Jan, 2024 9 commits
-
-
Stella Biderman authored
-
Hailey Schoelkopf authored
* fix incorrect lookback protections * bump generate_until task versions
-
daniel-furman authored
-
daniel-furman authored
-
daniel-furman authored
-
daniel-furman authored
-
daniel-furman authored
-
daniel-furman authored
-
Tanishq Abraham authored
* multimedqa * Update medqa.yaml * move to benchmarks folder * add README.md --------- Co-authored-by:Lintang Sutawika <lintang@sutawika.com>
-
- 10 Jan, 2024 11 commits
-
-
Baber Abbasi authored
* Refine scoring logic for multiple_target "exact_match" metric * skip old tests from master * skip old tests from master * delete tests from master
-
James A. Michaelov authored
-
Baber Abbasi authored
-
daniel-furman authored
-
daniel-furman authored
-
daniel-furman authored
-
daniel-furman authored
-
daniel-furman authored
-
daniel-furman authored
-
Daniel Furman authored
-
daniel-furman authored
-