Commits · 538be6da5d129d8970fa8585f92a4cd9df4eee02 · gaoqiong / lm-evaluation-harness

06 Mar, 2021 1 commit
- Flag down a bunch of todos · 491283c5
  Leo Gao authored Mar 05, 2021
  
  491283c5
23 Feb, 2021 1 commit
- Fix translation context · ba9c13b2
  Leo Gao authored Feb 22, 2021
  
  ba9c13b2
21 Feb, 2021 2 commits
- Fix gpt3 batching bug · 03f34463
  Leo Gao authored Feb 20, 2021
  
  03f34463
- Add gpt3 chunking · 1ff4e07f
  Leo Gao authored Feb 20, 2021
  
  1ff4e07f
19 Feb, 2021 1 commit
- Add gpt2/3 tokenizer sanity check · 77b44470
  Leo Gao authored Feb 18, 2021
  
  77b44470
14 Feb, 2021 1 commit
- Add GPT2 greedy_until truncation · 8966289a
  Leo Gao authored Feb 14, 2021
  
  8966289a
11 Feb, 2021 3 commits
- Fixes to make greedy_until work · 432bd44c
  Leo Gao authored Feb 10, 2021
```
# Conflicts:
#	lm_eval/models/gpt2.py
#	lm_eval/tasks/squad.py
```
  432bd44c
- Fixes to make greedy_until work · 7b649ded
  Leo Gao authored Feb 10, 2021
  
  7b649ded
- Implement GPT2 greedy_until · e8f9dc71
  Leo Gao authored Feb 10, 2021
  
  e8f9dc71
10 Feb, 2021 1 commit
- Update gpt2 for efficiency and allow specifying model size · aab91285
  Leo Gao authored Feb 09, 2021
  
  aab91285
08 Feb, 2021 1 commit
- LM: handle empty context · 359114fd
  Leo Gao authored Feb 07, 2021
  
  359114fd
05 Feb, 2021 2 commits
- Add retry with backoff for GPT3 · b1f7284e
  Leo Gao authored Feb 04, 2021
  
  b1f7284e
- Get rid of annoying logging · c55e8237
  Leo Gao authored Feb 04, 2021
  
  c55e8237
04 Feb, 2021 4 commits
- Add missing import · 1815286c
  Leo Gao authored Feb 03, 2021
  
  1815286c
- Implement gpt3 greedy_until · 5f4c7c50
  Leo Gao authored Feb 03, 2021
  
  5f4c7c50
- Massive refactor · 778e0f91
  Leo Gao authored Feb 03, 2021
```
- Extract evaluator (still needs work to clean up)
- Add tests for evaluator
- Fix all the things that break on the new tests
- Misc cleanup
```
  778e0f91
- Implement gpt3 logprobs · 52c1c56a
  Leo Gao authored Feb 03, 2021
  
  52c1c56a
01 Feb, 2021 1 commit
- Fix linting problems · fe4a1efd
  Leo Gao authored Feb 01, 2021
  
  fe4a1efd
28 Jan, 2021 1 commit
- Implement unit testing and fix lots of problems with tasks · 60a6fd8c
  Leo Gao authored Jan 27, 2021
  
  60a6fd8c
22 Jan, 2021 2 commits
- Clean up code, remove some footguns · e31b4b31
  Leo Gao authored Jan 21, 2021
  
  e31b4b31
- Implement isgreedy · e723d3d5
  Leo Gao authored Jan 21, 2021
  
  e723d3d5
09 Jan, 2021 1 commit
- Refactor and implement SAT evaluation · 0f9c1624
  Leo Gao authored Jan 08, 2021
  
  0f9c1624
05 Jan, 2021 1 commit
- Add reminder to rewrite · 08dc67ea
  Leo Gao authored Jan 05, 2021
  
  08dc67ea
03 Jan, 2021 1 commit
- Fix memory problem · f298ca76
  Leo Gao authored Jan 03, 2021
  
  f298ca76
28 Dec, 2020 2 commits
- Update · e41a082c
  Leo Gao authored Dec 27, 2020
  
  e41a082c
- Update interfaces · 76e65788
  Leo Gao authored Dec 27, 2020
  
  76e65788
30 Nov, 2020 2 commits

Remove num_tokens · e3031e84
Leo Gao authored Nov 30, 2020

e3031e84

Refactor to remove generate and fix some bad tokenization · 90e50b4c

Leo Gao authored Nov 30, 2020

In particular, the following assumptions are FALSE in general:
tokenize(context + continuation) = tokenize(context) + tokenize(continuation)
len(tokenize(context + continuation)) = len(tokenize(context)) + len(tokenize(continuation))
tokenize(context + continuation)[:len(tokenize(context))] = tokenize(context)

So we need to tip-toe around the problem by being careful with how we do it.

In particular, using Fast is not just for performance; while behavour of GPT2Tokenizer differs across Transformers 2 and 3, GPT2TokenizerFast doesn't.

90e50b4c

31 Oct, 2020 1 commit
- Put gpt2 in eval mode · 8d7d2132
  Leo Gao authored Oct 31, 2020
  
  8d7d2132
04 Oct, 2020 2 commits
- Remove residual MODEL_REGISTRY · d9e50f87
  Leo Gao authored Oct 03, 2020
  
  d9e50f87
- Fix GPT2 impl partially · ffea4dc5
  Leo Gao authored Oct 03, 2020
```
TODO: still need to add `until` everywhere
```
  ffea4dc5
17 Sep, 2020 2 commits
- refactor to explicit registries · 515d78b3
  Jason Phang authored Sep 17, 2020
  
  515d78b3
- refactor to explicit registries · 89de8d7d
  Jason Phang authored Sep 17, 2020
  
  89de8d7d
14 Sep, 2020 2 commits
- Update docs · f1ec7b06
  Jason Phang authored Sep 14, 2020
  
  f1ec7b06
- SuperGLUE, and truncation · 8161c22e
  Jason Phang authored Sep 14, 2020
  
  8161c22e
07 Sep, 2020 2 commits
- GPT-2 fixes · e7a87e71
  Jason Phang authored Sep 07, 2020
  
  e7a87e71
- lib · f88bb827
  Jason Phang authored Sep 07, 2020
  
  f88bb827