- 02 Aug, 2023 3 commits
-
-
Lintang Sutawika authored
[Refactor] Fix Max Length arg
-
Lintang Sutawika authored
-
lintangsutawika authored
-
- 01 Aug, 2023 15 commits
-
-
Lintang Sutawika authored
[Refactor] Benchmark scripts
-
Lintang Sutawika authored
-
Hailey Schoelkopf authored
Update README.md
-
Hailey Schoelkopf authored
-
Lintang Sutawika authored
-
Lintang Sutawika authored
[Refactor] Cleanup for `big-refactor`
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-
haileyschoelkopf authored
-
haileyschoelkopf authored
-
- 31 Jul, 2023 3 commits
-
-
Hailey Schoelkopf authored
[Refactor] Updated anthropic to new API
-
baberabb authored
-
Lintang Sutawika authored
[Refactor] Fixes for when using `num_fewshot`
-
- 28 Jul, 2023 4 commits
- 27 Jul, 2023 2 commits
-
-
lintangsutawika authored
-
lintangsutawika authored
Merge branch 'big-refactor' of https://github.com/EleutherAI/lm-evaluation-harness into benchmark-scripts
-
- 25 Jul, 2023 7 commits
-
-
Hailey Schoelkopf authored
[Refactor] Use console script
-
Hailey Schoelkopf authored
Remove condition to check for `winograd_schema`
-
Lintang Sutawika authored
Early stop bug of greedy_until (primary_until should be a list of str)
-
Lintang Sutawika authored
[Refactor] Migrate xwinograd tasks to yaml
-
Lintang Sutawika authored
-
Lintang Sutawika authored
-
Lintang Sutawika authored
-
- 24 Jul, 2023 6 commits
-
-
Hailey Schoelkopf authored
[Refactor] Fix tests
-
lintangsutawika authored
-
Lintang Sutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-