- 26 Jan, 2024 1 commit
-
-
thnkinbtfly authored
-
- 25 Jan, 2024 2 commits
-
-
Hailey Schoelkopf authored
* Update README.md * [!Tip]
-
Baber Abbasi authored
* get `doc` from instance * acceletate bugfix: get ground doc from instance * convert filter to `process_result` * get docs from instances in `FilterEnsemble` * rename * nit * better looping * fix typehint
-
- 24 Jan, 2024 2 commits
-
-
Hailey Schoelkopf authored
-
Baber Abbasi authored
-
- 23 Jan, 2024 4 commits
-
-
Baber Abbasi authored
* manage default (greedy) gen_kwargs in vllm better * mirror HF `do_sample` * just need to set temp=0 for greedy
-
Hailey Schoelkopf authored
* don't use get_task_dict() as a helper, it will download the dataset! * pre-commit * Update README.md --------- Co-authored-by:lintangsutawika <lintang@eleuther.ai>
-
Hailey Schoelkopf authored
* Update arc_easy.yaml * Update flan_cot.yaml * update HF dataset path * Update freeform.yaml * Update flan_cot.yaml --------- Co-authored-by:Lintang Sutawika <lintang@eleuther.ai>
-
Baber Abbasi authored
-
- 22 Jan, 2024 5 commits
-
-
Brian Vaughan authored
-
Brian Vaughan authored
-
Michael Goin authored
* Add `local-completions` support using OpenAI interface * Refactor oa_completion * Address tokenizer comments and change request chunks to batch size * Add warning message for tiktoken backend * fix formatting * fix whitespace * Update README.md --------- Co-authored-by:Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
-
Lintang Sutawika authored
* add fix fordeciding if stderr is N/A or not * process N/A
-
Hailey Schoelkopf authored
-
- 19 Jan, 2024 1 commit
-
-
Lintang Sutawika authored
-
- 18 Jan, 2024 7 commits
-
-
kwrobel.eth authored
-
Lintang Sutawika authored
* tuple should be considered as well * set option to keep callable as callable
-
Hailey Schoelkopf authored
-
Quentin Lhoest authored
-
Hailey Schoelkopf authored
-
Danielle Pintz authored
-
Hannibal046 authored
* Update nq_open.yaml change regex * Bump NQ version --------- Co-authored-by:Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
-
- 16 Jan, 2024 2 commits
-
-
Mark Saroufim authored
* Update README.md * punctuation --------- Co-authored-by:Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
-
Hailey Schoelkopf authored
-
- 15 Jan, 2024 7 commits
-
-
Hailey Schoelkopf authored
Bumping CITATION.bib to match re-adding the citation in readme. cc @StellaAthena
-
Stella Biderman authored
It looks like Google Scholar has [already noticed](https://scholar.google.com/scholar?hl=en&as_sdt=0%2C9&authuser=2&q=%22A+framework+for+few-shot+language+model+evaluation%2C+12+2023%22&btnG=) the updated citation block so let's add it back in.
-
Lintang Sutawika authored
* rewor documentation for explaining local dataset * fix typo * Update new_task_guide.md
-
Hailey Schoelkopf authored
* add WIP device_map overrides * update handling outside of accelerate launcher * change .to(device) log to debug level * run linter
-
Lintang Sutawika authored
* benchmark yamls allow minor edits of already registered tasks * add documentation * removed print
-
Hailey Schoelkopf authored
* Make parallelize=True distinction clearer in documentation. * run linter
-
Hailey Schoelkopf authored
-
- 12 Jan, 2024 3 commits
-
-
Hailey Schoelkopf authored
-
jp authored
* Add: kobest config file * Add: kobest utils * Add: README * Update utils.py
-
Hailey Schoelkopf authored
-
- 11 Jan, 2024 3 commits
-
-
Stella Biderman authored
-
Hailey Schoelkopf authored
* fix incorrect lookback protections * bump generate_until task versions
-
Tanishq Abraham authored
* multimedqa * Update medqa.yaml * move to benchmarks folder * add README.md --------- Co-authored-by:Lintang Sutawika <lintang@sutawika.com>
-
- 10 Jan, 2024 3 commits
-
-
Baber Abbasi authored
* Refine scoring logic for multiple_target "exact_match" metric * skip old tests from master * skip old tests from master * delete tests from master
-
James A. Michaelov authored
-
Baber Abbasi authored
-