- 20 Dec, 2023 1 commit
-
-
Alex Bäuerle authored
* feat: add option to upload results to Zeno * config-based upload supporting different task types and metrics * upload tasks as individual projects * wording * readme * add example notebook * Update documentation for Zeno integration * Make zeno deps an extra * Update README.md * Document extra deps installation * Update zeno_visualize.py * fix: balance parens * fix typo * fix merge commit I botched * Update zeno_visualize.py * Update logger warning stmt * fix whitespace --------- Co-authored-by:Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
-
- 15 Dec, 2023 1 commit
-
-
Baber Abbasi authored
-
- 13 Dec, 2023 1 commit
-
-
Baber Abbasi authored
* unpack group; add output_path to arg * Add `vllm` to overview
-
- 06 Dec, 2023 6 commits
- 17 Nov, 2023 1 commit
-
-
lintangsutawika authored
-
- 20 Oct, 2023 2 commits
-
-
Michael Pieler authored
-
Michael Pieler authored
-
- 18 Oct, 2023 1 commit
-
-
haileyschoelkopf authored
-
- 19 Sep, 2023 2 commits
-
-
Lintang Sutawika authored
-
Lintang Sutawika authored
-
- 14 Sep, 2023 1 commit
-
-
Chris authored
-
- 10 Sep, 2023 1 commit
-
-
Hojjat Mokhtarabadi authored
-
- 15 Jul, 2023 1 commit
-
-
baberabb authored
-
- 14 Jul, 2023 2 commits
-
-
lintangsutawika authored
-
lintangsutawika authored
-
- 19 Jun, 2023 1 commit
-
-
haileyschoelkopf authored
-
- 16 Jun, 2023 1 commit
-
-
lintangsutawika authored
-
- 15 Jun, 2023 1 commit
-
-
lintangsutawika authored
-
- 12 Jun, 2023 1 commit
-
-
Hailey Schoelkopf authored
* add wip gsm8k yaml * cleanup tasks dir * push gsm8k yaml changes * rename gpt2.py * add updated gsm8k , triviaqa baseline * add new cot yaml * allow for multiple filter pipelines, new filter types * updated gsm8k + sampling gen configs * cleanup self-consistency yaml * push outline for advanced docs * push docs checklist * switch to inheritance for many tasks * acc_norm and acc_mutual_info fixed * fix missing newline in error msg * remove many .py tasks * updated GSM8k * added more doc * Update advanced_task_guide.md Added list of parameters * Update advanced_task_guide.md * Added details on listing metrics * Update advanced_task_guide.md * Added more explanation * modify current default filter name * add new tags to tasks * remove a lingering print() * add rest of param docs, cleanup deprecated fields * push docs update * move ALL_TASKS definition location * confirm write_out.py works if no description dict passed --------- Co-authored-by:lintangsutawika <lintang@sutawika.com>
-
- 01 Jun, 2023 1 commit
-
-
gakada authored
* Fix tokenization issue in BaseLM.loglikelihood * Add a regression script * Use entire non-continuation length as context --------- Co-authored-by:Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
-
- 06 May, 2023 1 commit
-
-
Julen Etxaniz authored
-
- 28 Apr, 2023 1 commit
-
-
yurodiviy authored
* Support bigbench-hard json tasks using multiple_choice_grade * Add support for greedy decoding in bigbench tasks * move bigbench_resources to datasets * rectify changes to rf.greedy_until w upstream * make path to resource import reflect new location --------- Co-authored-by:haileyschoelkopf <hailey.schoelkopf@yale.edu>
-
- 02 Dec, 2022 1 commit
-
-
jon-tow authored
-
- 11 May, 2022 1 commit
-
-
jon-tow authored
-
- 03 May, 2022 3 commits
-
-
jon-tow authored
-
Fabrizio Milo authored
-
Fabrizio Milo authored
-
- 29 Apr, 2022 1 commit
-
- 28 Apr, 2022 1 commit
-
-
jordiclive authored
-
- 27 Apr, 2022 2 commits
-
-
jon-tow authored
-
jordiclive authored
-
- 25 Apr, 2022 1 commit
-
-
jon-tow authored
-
- 13 Mar, 2022 2 commits
-
-
researcher2 authored
-
researcher2 authored
Add documentation for decontamination. Improvements to 13-gram generation.
-
- 03 Mar, 2022 1 commit
-
-
researcher2 authored
-