- 18 May, 2023 1 commit
-
-
bzantium authored
-
- 13 Feb, 2022 1 commit
-
-
Leo Gao authored
-
- 05 Dec, 2021 1 commit
-
-
Leo Gao authored
-
- 24 Nov, 2021 2 commits
-
-
Jason Phang authored
-
Jason Phang authored
-
- 05 Nov, 2021 1 commit
-
-
Leo Gao authored
-
- 11 Oct, 2021 1 commit
-
-
Leo Gao authored
-
- 10 Jun, 2021 1 commit
-
-
Leo Gao authored
-
- 22 May, 2021 1 commit
-
-
Leo Gao authored
for some reason putting it in LM and having it be inherited breaks everything. should try to figure this out at some point.
-
- 11 May, 2021 1 commit
-
-
Leo Gao authored
model_args should only be things that affect output of the model therefore, stuff like batch size, device, etc shouldn't be in there
-
- 06 May, 2021 1 commit
-
-
Leo Gao authored
-
- 05 May, 2021 1 commit
-
-
Leo Gao authored
-
- 03 May, 2021 2 commits
-
-
Jason Phang authored
-
Jason Phang authored
-
- 15 Apr, 2021 1 commit
-
-
Leo Gao authored
-
- 11 Apr, 2021 2 commits
- 05 Apr, 2021 1 commit
-
-
Leo Gao authored
Now, if a run gets interrupted halfway, you can easily resume
-
- 27 Mar, 2021 1 commit
-
-
Leo Gao authored
-
- 26 Mar, 2021 1 commit
-
-
Leo Gao authored
-
- 21 Feb, 2021 2 commits
- 19 Feb, 2021 1 commit
-
-
Leo Gao authored
-
- 11 Feb, 2021 1 commit
-
-
Leo Gao authored
-
- 08 Feb, 2021 1 commit
-
-
Leo Gao authored
-
- 05 Feb, 2021 2 commits
- 04 Feb, 2021 4 commits
- 28 Jan, 2021 1 commit
-
-
Leo Gao authored
-
- 05 Jan, 2021 1 commit
-
-
Leo Gao authored
-
- 30 Nov, 2020 2 commits
-
-
Leo Gao authored
-
Leo Gao authored
In particular, the following assumptions are FALSE in general: tokenize(context + continuation) = tokenize(context) + tokenize(continuation) len(tokenize(context + continuation)) = len(tokenize(context)) + len(tokenize(continuation)) tokenize(context + continuation)[:len(tokenize(context))] = tokenize(context) So we need to tip-toe around the problem by being careful with how we do it. In particular, using Fast is not just for performance; while behavour of GPT2Tokenizer differs across Transformers 2 and 3, GPT2TokenizerFast doesn't.
-
- 04 Oct, 2020 1 commit
-
-
Leo Gao authored
-
- 14 Sep, 2020 1 commit
-
-
Jason Phang authored
-
- 07 Sep, 2020 3 commits
-
-
Jason Phang authored
-
Jason Phang authored
-
Jason Phang authored
-