Commits · c77b60c1764a82593ac96a320c8e332d5eccdffc · gaoqiong / lm-evaluation-harness

11 May, 2021 1 commit

Leo Gao authored May 10, 2021

model_args should only be things that affect output of the model
therefore, stuff like batch size, device, etc shouldn't be in there

5f42f976

05 May, 2021 3 commits
- Minor changes · f16c301e
  Leo Gao authored May 05, 2021
  
  f16c301e
- Refactor PerplexityTask · 1e7f884d
  Leo Gao authored May 05, 2021
  
  1e7f884d
- Begin refactoring perplexity code · b0cf0163
  Leo Gao authored May 04, 2021
  
  b0cf0163
03 May, 2021 1 commit
- gpt2 perplexity · 9454c839
  Jason Phang authored May 02, 2021
  
  9454c839
15 Apr, 2021 3 commits
- Actually fix problems and make tests pass · 31c29e3b
  Leo Gao authored Apr 14, 2021
  
  31c29e3b
- Fix problems · fe0311b6
  Leo Gao authored Apr 14, 2021
  
  fe0311b6
- Initial implementation of gpt2 batching · be3a6a2d
  Leo Gao authored Apr 14, 2021
  
  be3a6a2d
11 Apr, 2021 2 commits
- More refactoring of model code · a586a5c4
  Leo Gao authored Apr 11, 2021
  
  a586a5c4
- Refactor gpt2 loglikelihood · 8352e671
  Leo Gao authored Apr 11, 2021
  
  8352e671
08 Apr, 2021 1 commit
- gpt2: Mask out all tokens above 50256 · 2b8956b8
  Leo Gao authored Apr 07, 2021
  
  2b8956b8
05 Apr, 2021 1 commit
- Implement partial caching · efbe6e7f
  Leo Gao authored Apr 04, 2021
```
Now, if a run gets interrupted halfway, you can easily resume
```
  efbe6e7f
02 Apr, 2021 1 commit
- Roll back last token optimization · f3fee648
  Leo Gao authored Apr 02, 2021
  
  f3fee648
31 Mar, 2021 1 commit
- Handle GPTNeoConfig not having n_ctx · 49ee4db2
  Leo Gao authored Mar 30, 2021
  
  49ee4db2
27 Mar, 2021 2 commits
- Minor updates · bc5478a1
  Leo Gao authored Mar 26, 2021
  
  bc5478a1
- Fix stuff and make tests pass · c971fa82
  Leo Gao authored Mar 26, 2021
  
  c971fa82
26 Mar, 2021 3 commits
- Add Reorderer and implement in gpt2 and gpt3 · 8e8d7c6d
  Leo Gao authored Mar 26, 2021
  
  8e8d7c6d
- More changes to make neo work · 1b4242c1
  Leo Gao authored Mar 26, 2021
  
  1b4242c1
- Update GPT2LM to handle neo based models as well · 747b851d
  Leo Gao authored Mar 26, 2021
  
  747b851d
06 Mar, 2021 1 commit
- Flag down a bunch of todos · 491283c5
  Leo Gao authored Mar 05, 2021
  
  491283c5
19 Feb, 2021 1 commit
- Add gpt2/3 tokenizer sanity check · 77b44470
  Leo Gao authored Feb 18, 2021
  
  77b44470
14 Feb, 2021 1 commit
- Add GPT2 greedy_until truncation · 8966289a
  Leo Gao authored Feb 14, 2021
  
  8966289a
11 Feb, 2021 3 commits
- Fixes to make greedy_until work · 432bd44c
  Leo Gao authored Feb 10, 2021
```
# Conflicts:
#	lm_eval/models/gpt2.py
#	lm_eval/tasks/squad.py
```
  432bd44c
- Fixes to make greedy_until work · 7b649ded
  Leo Gao authored Feb 10, 2021
  
  7b649ded
- Implement GPT2 greedy_until · e8f9dc71
  Leo Gao authored Feb 10, 2021
  
  e8f9dc71
10 Feb, 2021 1 commit
- Update gpt2 for efficiency and allow specifying model size · aab91285
  Leo Gao authored Feb 09, 2021
  
  aab91285
08 Feb, 2021 1 commit
- LM: handle empty context · 359114fd
  Leo Gao authored Feb 07, 2021
  
  359114fd
05 Feb, 2021 1 commit
- Get rid of annoying logging · c55e8237
  Leo Gao authored Feb 04, 2021
  
  c55e8237
01 Feb, 2021 1 commit
- Fix linting problems · fe4a1efd
  Leo Gao authored Feb 01, 2021
  
  fe4a1efd
28 Jan, 2021 1 commit
- Implement unit testing and fix lots of problems with tasks · 60a6fd8c
  Leo Gao authored Jan 27, 2021
  
  60a6fd8c
22 Jan, 2021 2 commits
- Clean up code, remove some footguns · e31b4b31
  Leo Gao authored Jan 21, 2021
  
  e31b4b31
- Implement isgreedy · e723d3d5
  Leo Gao authored Jan 21, 2021
  
  e723d3d5
09 Jan, 2021 1 commit
- Refactor and implement SAT evaluation · 0f9c1624
  Leo Gao authored Jan 08, 2021
  
  0f9c1624
03 Jan, 2021 1 commit
- Fix memory problem · f298ca76
  Leo Gao authored Jan 03, 2021
  
  f298ca76
28 Dec, 2020 2 commits
- Update · e41a082c
  Leo Gao authored Dec 27, 2020
  
  e41a082c
- Update interfaces · 76e65788
  Leo Gao authored Dec 27, 2020
  
  76e65788
30 Nov, 2020 2 commits

Remove num_tokens · e3031e84
Leo Gao authored Nov 30, 2020

e3031e84

Refactor to remove generate and fix some bad tokenization · 90e50b4c

Leo Gao authored Nov 30, 2020

In particular, the following assumptions are FALSE in general:
tokenize(context + continuation) = tokenize(context) + tokenize(continuation)
len(tokenize(context + continuation)) = len(tokenize(context)) + len(tokenize(continuation))
tokenize(context + continuation)[:len(tokenize(context))] = tokenize(context)

So we need to tip-toe around the problem by being careful with how we do it.

In particular, using Fast is not just for performance; while behavour of GPT2Tokenizer differs across Transformers 2 and 3, GPT2TokenizerFast doesn't.

90e50b4c

31 Oct, 2020 1 commit
- Put gpt2 in eval mode · 8d7d2132
  Leo Gao authored Oct 31, 2020
  
  8d7d2132
04 Oct, 2020 1 commit
- Fix GPT2 impl partially · ffea4dc5
  Leo Gao authored Oct 03, 2020
```
TODO: still need to add `until` everywhere
```
  ffea4dc5