- 26 Feb, 2024 1 commit
-
-
Lintang Sutawika authored
* add brier_score * process brier_score * brier score is working for N-sized class * fxied brier score * add TED to BigBench and Brier score to MMLU * format * Update metrics.py * Update task.py * Update generate_until_template_yaml * Delete lm_eval/tasks/bigbench/aux_metric.py * Update generate_until_template_yaml * Update _default_template_yaml
-
- 04 Dec, 2023 2 commits
-
-
lintangsutawika authored
-
lintangsutawika authored
-
- 29 Nov, 2023 2 commits
- 28 Nov, 2023 16 commits
-
-
Lintang Sutawika authored
-
Stella Biderman authored
-
Stella Biderman authored
-
Stella Biderman authored
Changing the name since this is not actually the big bench format.
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-
- 27 Nov, 2023 15 commits
-
-
Stella Biderman authored
-
Stella Biderman authored
-
Stella Biderman authored
-
Stella Biderman authored
-
haileyschoelkopf authored
-
haileyschoelkopf authored
-
baberabb authored
-
Hailey Schoelkopf authored
-
Hailey Schoelkopf authored
-
Hailey Schoelkopf authored
-
baberabb authored
-
baberabb authored
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-
- 26 Nov, 2023 1 commit
-
-
haileyschoelkopf authored
-
- 24 Nov, 2023 3 commits
-
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-