Commits · 9ad2fc3d99ecc5a3149bfa86b0bbfb483eda446e · gaoqiong / lm-evaluation-harness

03 Jul, 2023 1 commit
- Account for padding in inplen calculation · 9ad2fc3d
  Hailey Schoelkopf authored Jul 03, 2023
  
  9ad2fc3d
31 May, 2023 1 commit
- fix p-tuning inaccuracy, because output logit contains virtual token length · d8bf52c6
  Wang, Yi authored May 30, 2023
```
Signed-off-by: Wang, Yi <yi.a.wang@intel.com>
```
  d8bf52c6
30 May, 2023 1 commit
- changed quantized type signature and default val (#532) · 441e6ac1
  Sam Passaglia authored May 30, 2023
  
  441e6ac1
27 May, 2023 1 commit
- Merge pull request #519 from gakada/gptq · 095d8406
  Stella Biderman authored May 26, 2023
```
Add support for loading GPTQ models via AutoGPTQ
```
  095d8406
26 May, 2023 1 commit
- GPTQ: add auto-gptq extra, add gptq_use_triton parameter · c11ad4f2
  gk authored May 26, 2023
  
  c11ad4f2
25 May, 2023 2 commits
- Extend `dtype` command line flag to `HFLM` (#523) · 8cff2bea
  Hailey Schoelkopf authored May 25, 2023
```
* allow for hf-causal to take dtype arg

* document this change
```
  8cff2bea
- GPTQ: fix README and support default names · b465cd01
  gk authored May 25, 2023
  
  b465cd01
23 May, 2023 4 commits
- Merge pull request #522 from EleutherAI/fix-mgpt-fewshot · 4e94af6f
  Lintang Sutawika authored May 24, 2023
  
  4e94af6f
- fixed fewshot prompt by filling [mask] · 25699d3e
  lintangsutawika authored May 23, 2023
  
  25699d3e
- re implemented fewshot_context method in the class to allow custom prompt for fewshot · 5a8ac198
  lintangsutawika authored May 23, 2023
  
  5a8ac198
- Add support for loading GPTQ models via AutoGPTQ · b296c4f6
  gk authored May 23, 2023
  
  b296c4f6
21 May, 2023 4 commits
- Merge pull request #481 from janEbert/json-task · 84ef60ee
  Stella Biderman authored May 21, 2023
```
Add perplexity task on arbitrary JSON data
```
  84ef60ee
- Merge branch 'master' into json-task · 4de8a74e
  Stella Biderman authored May 21, 2023
  
  4de8a74e
- Merge pull request #492 from juletx/eval-info · bda68845
  Stella Biderman authored May 21, 2023
```
Add option to dump prompts and completions to a JSON file
```
  bda68845
- Merge pull request #480 from kenhktsui/float-limit · 96a83d45
  Stella Biderman authored May 21, 2023
```
Evaluation Against Portion of Benchmark Data
```
  96a83d45
19 May, 2023 2 commits
- Merge pull request #477 from juletx/results · e53eb332
  Stella Biderman authored May 19, 2023
```
Add results of various models in json and md format
```
  e53eb332
- Merge pull request #483 from janEbert/out-dir · d1327193
  Stella Biderman authored May 19, 2023
```
Create output path directory if necessary
```
  d1327193
18 May, 2023 1 commit
- Update README.md · e7a212ff
  Stella Biderman authored May 18, 2023
  
  e7a212ff
12 May, 2023 1 commit
- Merge remote-tracking branch 'upstream/master' into results · 92a50856
  Julen Etxaniz authored May 12, 2023
  
  92a50856
11 May, 2023 3 commits

Add multilingual datasets (XCOPA, XStoryCloze, XWinograd, PAWS-X, XNLI, MGSM) (#426) · d1451679

Julen Etxaniz authored May 11, 2023

* add xcopa dataset

* add xstory_cloze dataset and run pre-commit

* fix xcopa validation and test sets

* add xwinograd dataset

* add pawsx task

* add xnli task

* update task table with recently added tasks

* remove unused metrics from paws-x

* add mgsm task and fix gsm8k

* fix gsm8k until

* update task table

d1451679

update write out variable name · af913422
Julen Etxaniz authored May 11, 2023

af913422
update parameter names and add docs · 99b0a42d
Julen Etxaniz authored May 11, 2023

99b0a42d

10 May, 2023 4 commits
- Merge pull request #490 from jquesnelle/auto-batch-size-fix · 05550ef3
  Stella Biderman authored May 10, 2023
```
fix adaptive batch crash when there are no new requests
```
  05550ef3
- Update README.md · f71bffb0
  Stella Biderman authored May 10, 2023
  
  f71bffb0
- add --write_detailed_eval_info to dump JSON with prompts and completions · 2e046ce3
  Julen Etxaniz authored May 10, 2023
  
  2e046ce3
- fix adaptive batch crash when there are no new requests (e.g. when pulling from cache) · d424f26b
  Jeffrey Quesnelle authored May 10, 2023
  
  d424f26b
09 May, 2023 2 commits
- Update README.md · 8fc04fe5
  Stella Biderman authored May 09, 2023
  
  8fc04fe5
- Update README.md · 44a4c374
  Stella Biderman authored May 09, 2023
  
  44a4c374
08 May, 2023 3 commits
- Create output path directory if necessary · c473d7e0
  janEbert authored May 08, 2023
  
  c473d7e0
- Add perplexity task on arbitrary JSON data · 3226ed64
  janEbert authored May 08, 2023
  
  3226ed64
- add more mpt and llama results · afbf8e66
  Julen Etxaniz authored May 08, 2023
  
  afbf8e66
07 May, 2023 2 commits

Merge pull request #448 from nikhilpinnaparaju/issue-439 · 082d6db3
Stella Biderman authored May 07, 2023
```
Set PAD token to EOS token
```
082d6db3

allow float limit to represent data portion · 3fda1195

Ken Tsui authored May 07, 2023

When `limit` is <1, limit represents the percentage of the total number of examples.
If it is >=1,  then it means the number of examples per task (only use this for testing).

3fda1195

06 May, 2023 5 commits
- update task table with missing tasks · 0e849758
  Julen Etxaniz authored May 06, 2023
  
  0e849758
- add opt and mpt model results · 8a707701
  Julen Etxaniz authored May 06, 2023
  
  8a707701
- add basic markdown tables with results · 2ac318a9
  Julen Etxaniz authored May 06, 2023
  
  2ac318a9
- Update README.md · 14043a0f
  Stella Biderman authored May 06, 2023
  
  14043a0f
- add bloom, xglm and llama results · 21e128d8
  Julen Etxaniz authored May 06, 2023
  
  21e128d8
05 May, 2023 2 commits
- Sort task names to keep the same order always (#474) · 0542d35d
  Julen Etxaniz authored May 05, 2023
```
This makes comparing the results of different models easier because tasks are ordered in the same way.
```
  0542d35d
- added comment · bdc1af90
  Nikhil Pinnaparaju authored May 04, 2023
  
  bdc1af90