Commits · deecaa1a8ae2c2afddc168e24ae02ff3dbebb95b · gaoqiong / lm-evaluation-harness

02 Aug, 2023 3 commits
- Merge pull request #723 from EleutherAI/fix_max_length · deecaa1a
  Lintang Sutawika authored Aug 03, 2023
```
[Refactor] Fix Max Length arg
```
  deecaa1a
- Update task.py · 53105bba
  Lintang Sutawika authored Aug 02, 2023
  
  53105bba
- fix to add a case for if a user add `max_length` to generation_kwargs · ba864e09
  lintangsutawika authored Aug 02, 2023
  
  ba864e09
01 Aug, 2023 15 commits
- Merge pull request #612 from EleutherAI/benchmark-scripts · d88a566c
  Lintang Sutawika authored Aug 01, 2023
```
[Refactor] Benchmark scripts
```
  d88a566c
- Merge branch 'big-refactor' into benchmark-scripts · 29f12dd9
  Lintang Sutawika authored Aug 01, 2023
  
  29f12dd9
- Merge pull request #720 from EleutherAI/lintangsutawika-patch-1 · 4168c05f
  Hailey Schoelkopf authored Aug 01, 2023
```
Update README.md
```
  4168c05f
- Update README.md · aa848227
  Hailey Schoelkopf authored Aug 01, 2023
  
  aa848227
- Update README.md · 8ebc85e8
  Lintang Sutawika authored Aug 01, 2023
  
  8ebc85e8
- Merge pull request #686 from EleutherAI/cleanup · 546fd5cd
  Lintang Sutawika authored Aug 01, 2023
```
[Refactor] Cleanup for `big-refactor`
```
  546fd5cd
- update on metrics and delet files · e37698df
  lintangsutawika authored Aug 01, 2023
  
  e37698df
- added rte · 1bc408ff
  lintangsutawika authored Aug 01, 2023
  
  1bc408ff
- fix configs · 223da391
  lintangsutawika authored Aug 01, 2023
  
  223da391
- skips metrics prep if process_result is not None · 1813bf04
  lintangsutawika authored Aug 01, 2023
  
  1813bf04
- process aggregate fix · 0eb94c8b
  lintangsutawika authored Aug 01, 2023
  
  0eb94c8b
- process int label to string for greedy_until · b2598de8
  lintangsutawika authored Aug 01, 2023
  
  b2598de8
- remove comments · 8da6033f
  lintangsutawika authored Aug 01, 2023
  
  8da6033f
- fix Pile dataset name · 540d468e
  haileyschoelkopf authored Aug 01, 2023
  
  540d468e
- merge conflicts · 16bc6bc0
  haileyschoelkopf authored Aug 01, 2023
  
  16bc6bc0
31 Jul, 2023 3 commits
- Merge pull request #710 from baberabb/big-refactor_claude · 465c695b
  Hailey Schoelkopf authored Jul 31, 2023
```
[Refactor] Updated anthropic to new API
```
  465c695b
- fixed generation_kwargs; added dependency groups to testing on CI · 471297ba
  baberabb authored Jul 31, 2023
  
  471297ba
- Merge pull request #702 from EleutherAI/num_fewshot-bug · 6efc8d5e
  Lintang Sutawika authored Jul 31, 2023
```
[Refactor] Fixes for when using `num_fewshot`
```
  6efc8d5e
28 Jul, 2023 4 commits
- fixed error handling · b8510001
  baberabb authored Jul 29, 2023
  
  b8510001
- passed kwargs to client · 3a3655d6
  baberabb authored Jul 28, 2023
  
  3a3655d6
- added kwargs to client · fe358061
  baberabb authored Jul 28, 2023
  
  fe358061
- updated anthropic to new API · 8ffa0e67
  baberabb authored Jul 28, 2023
  
  8ffa0e67
27 Jul, 2023 2 commits
- fix on output_path_file · 7d8f5469
  lintangsutawika authored Jul 27, 2023
  
  7d8f5469
- Merge branch 'big-refactor' of... · b7cd829b
  lintangsutawika authored Jul 27, 2023
```
Merge branch 'big-refactor' of https://github.com/EleutherAI/lm-evaluation-harness into benchmark-scripts
```
  b7cd829b
25 Jul, 2023 7 commits
- Merge pull request #703 from EleutherAI/use_console_script · 4e44f0aa
  Hailey Schoelkopf authored Jul 25, 2023
```
[Refactor] Use console script
```
  4e44f0aa
- Merge pull request #690 from EleutherAI/lintangsutawika-patch-1 · 6b1897d6
  Hailey Schoelkopf authored Jul 25, 2023
```
Remove condition to check for `winograd_schema`
```
  6b1897d6
- Merge pull request #700 from ZZR0/greedy_until--patch-1 · 05c879cc
  Lintang Sutawika authored Jul 25, 2023
```
Early stop bug of greedy_until (primary_until should be a list of str)
```
  05c879cc
- Merge pull request #695 from yeoedward/xwinograd · 8e8212fc
  Lintang Sutawika authored Jul 25, 2023
```
[Refactor] Migrate xwinograd tasks to yaml
```
  8e8212fc
- __setitem__ allows setting key-value like dict · e0b3cbf5
  Lintang Sutawika authored Jul 25, 2023
  
  e0b3cbf5
- reorder · f148c2e2
  Lintang Sutawika authored Jul 25, 2023
  
  f148c2e2
- Reorder · 0b8446a4
  Lintang Sutawika authored Jul 25, 2023
  
  0b8446a4
24 Jul, 2023 6 commits
- Merge pull request #693 from baberabb/big-refactor_fixfin · d553e060
  Hailey Schoelkopf authored Jul 24, 2023
```
[Refactor] Fix tests
```
  d553e060
- added `lm-eval` and `lm_eval` as command to use instead of `python main.py` · 20d70067
  lintangsutawika authored Jul 24, 2023
  
  20d70067
- Update evaluator.py · 0cd992b1
  Lintang Sutawika authored Jul 24, 2023
  
  0cd992b1
- Use num_fewshot set in yaml and show warning if it's being overwritten by argparse · 8add5ed6
  lintangsutawika authored Jul 24, 2023
  
  8add5ed6
- add condition if --task is not a benchmark · 2d96a8c8
  lintangsutawika authored Jul 24, 2023
  
  2d96a8c8
- removed promptsource yaml file · ed304c1d
  lintangsutawika authored Jul 24, 2023
  
  ed304c1d