- 22 Feb, 2024 1 commit
-
-
Ayush Thakur authored
* add wandb as extra dependency * wandb metrics logging * refactor * log samples as tables * fix linter * refactor: put in a class * change dir * add panels * log eval as table * improve tables logging * improve reports logging * precommit run * ruff check * handle importing reports api gracefully * ruff * compare results * minor pre-commit fixes * build comparison report * ruff check * log results as artifacts * remove comparison script * update dependency * type annotate and docstring * add example * update readme * fix typo * teardown * handle outside wandb run * gracefully fail reports creation * precommit checks * add report url to summary * use wandb printer for better url stdout * fix ruff * handle N/A and groups * fix eval table * remove unused var * update wandb version req + disable reports stdout * remove reports feature to TODO * add label to multi-choice question data * log model predictions * lints * loglikelihood_rolling * log eval result for groups * log tables by group for better handling * precommit * choices column for multi-choice * graciously fail wandb * remove reports feature * track system metrics + total eval time + stdout --------- Co-authored-by:Lintang Sutawika <lintang@eleuther.ai>
-
- 21 Jul, 2023 1 commit
-
-
baberabb authored
-
- 14 Jul, 2023 1 commit
-
-
lintangsutawika authored
-
- 01 Jul, 2023 2 commits
-
-
FarzanehNakhaee authored
-
FarzanehNakhaee authored
-
- 29 Jun, 2023 1 commit
-
-
FarzanehNakhaee authored
-
- 16 Jun, 2023 1 commit
-
-
Lintang Sutawika authored
-
- 07 Jun, 2023 1 commit
-
-
FarzanehNakhaee authored
-
- 11 May, 2023 1 commit
-
-
lintangsutawika authored
-
- 03 May, 2022 1 commit
-
-
Fabrizio Milo authored
-
- 29 Mar, 2021 1 commit
-
-
& authored
-
- 12 Feb, 2021 3 commits
- 07 Sep, 2020 1 commit
-
-
Anish Thite authored
-
- 28 Aug, 2020 1 commit
-
-
Leo Gao authored
-