Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
"megatron/core/pipeline_parallel/schedules.py" did not exist on "bea16fa3319abf7901d6b434ec4becdac5a684f6"
3fe4b022e64b93318f302259baadf09f811b9808
Switch branch/tag
lm-evaluation-harness
lm_eval
config
task.py
08 Oct, 2025
1 commit
fixup fewshots
· 3fe4b022
Baber
authored
Oct 08, 2025
3fe4b022
25 Sep, 2025
14 commits
update default values; fixes
· b89af51e
Baber
authored
Jul 10, 2025
b89af51e
move test one doc to method
· 7cef4d38
Baber
authored
Jul 23, 2025
7cef4d38
overload Task methods if callable in yaml dict
· ec767666
Baber
authored
Jul 23, 2025
ec767666
remove deps; types
· 4ad6cd9f
Baber
authored
Jul 22, 2025
4ad6cd9f
make multiple_input explicit
· 689e0c91
Baber
authored
Jul 22, 2025
689e0c91
`check_gold_index_error` util; fix `process_results`; rm generate_until multiple-choice
· d9876b22
Baber
authored
Jul 22, 2025
d9876b22
improve metric aggregation default and higher-better checks; add `TaskConfig.from_template`
· d19bd889
Baber
authored
Jul 21, 2025
d19bd889
cleanup
· 69d14fb3
Baber
authored
Jul 21, 2025
69d14fb3
remove prompt-source for now
· 70f5e2f0
Baber
authored
Jul 18, 2025
70f5e2f0
refactor: improve dataset and metric handling in TaskConfig
· 227f1a74
Baber
authored
Jul 08, 2025
227f1a74
refactor: update type hints and improve filter ensemble construction
· 3b4d0af1
Baber
authored
Jul 08, 2025
3b4d0af1
cleanup
· c81c03ee
Baber
authored
Jul 08, 2025
c81c03ee
serialize better
· 674611e9
Baber
authored
Jul 05, 2025
674611e9
refactor configs to files
· 57adbd35
Baber
authored
Jul 04, 2025
57adbd35