"...git@developer.sourcefind.cn:chenpangpang/open-webui.git" did not exist on "5437d90a251cb8782456cc163b9b6d4222d145bb"
- 30 Apr, 2019 2 commits
- 25 Apr, 2019 7 commits
-
-
thomwolf authored
-
Thomas Wolf authored
fixed BertForMultipleChoice model init and forward pass
-
Thomas Wolf authored
Docs for new learning rate code
-
Thomas Wolf authored
Fix training schedules in examples to match new API
-
lukovnikov authored
-
lukovnikov authored
-
lukovnikov authored
- added some images for illustration - updated comments in optimization
-
- 24 Apr, 2019 2 commits
-
-
Thomas Wolf authored
Hubconf
-
thomwolf authored
-
- 23 Apr, 2019 4 commits
-
-
thomwolf authored
-
Thomas Wolf authored
Fix --reduce_memory in finetune_on_pregenerated
-
Thomas Wolf authored
Fix indentation weirdness in GPT-2 example.
-
Thomas Wolf authored
Learning rate schedules improvement + extension
-
- 22 Apr, 2019 2 commits
-
-
Matthew Carrigan authored
-
Matt authored
Pulling commits from main repo
-
- 21 Apr, 2019 4 commits
-
-
Sangwhan Moon authored
-
lukovnikov authored
-
lukovnikov authored
- removed unused plotting code - using ABC for LRSchedule - added some schedule object init tests
-
lukovnikov authored
-
- 17 Apr, 2019 16 commits
-
-
Ailing Zhang authored
-
Ailing Zhang authored
-
Thomas Wolf authored
Updating network handling
-
thomwolf authored
-
thomwolf authored
-
thomwolf authored
-
thomwolf authored
-
thomwolf authored
-
thomwolf authored
-
thomwolf authored
-
Thomas Wolf authored
Fix indentation for unconditional generation
-
Thomas Wolf authored
Fix gradient overflow issue during attention mask
-
Thomas Wolf authored
[run_gpt2.py] temperature should be a float, not int
-
Thomas Wolf authored
Gpt2 tokenization
-
thomwolf authored
-
thomwolf authored
-
- 16 Apr, 2019 3 commits
-
-
Ben Mann authored
-
Abhi Sharma authored
This fix is in reference to issue #382. GPT2 can now be trained in mixed precision, which I've confirmed with testing. I also tested unconditional generation on multiple seeds before and after changing 1e10 to 1e4 and there was no difference. Please let me know if there is anything else I can do to make this pull request better. Thanks for all your work!
-
Abhi Sharma authored
-