- 08 Mar, 2021 12 commits
-
-
Stas Bekman authored
* fix sharded ddp enum * test fixes * stronger validation + apex breaks other tests
-
Stas Bekman authored
* more readable test * add all the missing places * one more nltk * better exception check * revert
-
Sylvain Gugger authored
* Fix version control with anchors * Simplify
-
Stas Bekman authored
-
Mehrad Moradshahi authored
* Fix Marian decoding Tokenizer's decode and batch_decode now accepts a new argument (use_source_tokenizer) which indicates whether the source spm should be used to decode ids. This is useful for Marian models specificallly when decoding source input ids. * Adapt docstrings Co-authored-by:Sylvain Gugger <sylvain.gugger@gmail.com>
-
Lysandre authored
-
Lysandre Debut authored
* Enable torch 1.8.0 in GPU CI * Disable torch-scatter
-
Suraj Patil authored
* fix tests * emb should be a parameter * fix positional embeddings * fix make_weights * don't save pos embeds * add comment to describe the clamping
-
Oren Amsalem authored
-
Eunhyuk Shin authored
-
Stas Bekman authored
-
Yu authored
-
- 06 Mar, 2021 3 commits
-
-
Suraj Patil authored
* m2m_100 * no layernorm_embedding * sinusoidal positional embeddings * update pos embeddings * add default config values * tokenizer * add conversion script * fix config * fix pos embed * remove _float_tensor * update tokenizer * update lang codes * handle lang codes * fix pos embeds * fix spm key * put embedding weights on device * remove qa and seq classification heads * fix convert script * lang codes pn one line * fix embeds * fix tokenizer * fix tokenizer * add fast tokenizer * style * M2M100MT => M2M100 * fix copyright, style * tokenizer converter * vocab file * remove fast tokenizer * fix embeds * fix tokenizer * fix tests * add tokenizer tests * add integration test * quality * fix model name * fix test * doc * doc * fix doc * add copied from statements * fix tokenizer tests * apply review suggestions * fix urls * fix shift_tokens_right * apply review suggestions * fix * fix doc * add lang code to id * remove unused function * update checkpoint names * fix copy * fix tokenizer * fix checkpoint names * fix merge issue * style
-
Lysandre authored
-
Stas Bekman authored
* offline mode start * add specific values * fix fallback * add test * better values check and range * test that actually works * document the offline mode * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * more strict check * cleaner test * pt-only test * style Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 05 Mar, 2021 11 commits
-
-
Daniel Hug authored
* Refactor checkpoint name in ALBERT and ALBERT_tf * Refactor checkpoint name in BART and BART_tf * Refactor checkpoint name in BERT generation * Refactor checkpoint name in Blenderbot_tf * Refactor checkpoint name in Blenderbot_small_tf * Refactor checkpoint name in ConvBERT AND CONVBERT_TF * Refactor checkpoint name in CTRL AND CTRL_TF * Refactor checkpoint name in DistilBERT AND DistilBERT_TF * Refactor checkpoint name in DistilBERT redo * Refactor checkpoint name in Electra and Electra_tf * Refactor checkpoint name in FlauBERT and FlauBERT_tf * Refactor checkpoint name in FSMT * Refactor checkpoint name in GPT2 and GPT2_tf * Refactor checkpoint name in IBERT * Refactor checkpoint name in LED and LED_tf * Refactor checkpoint name in Longformer and Longformer_tf * Refactor checkpoint name in Lxmert and Lxmert_tf * Refactor checkpoint name in Marian_tf * Refactor checkpoint name in MBART and MBART_tf * Refactor checkpoint name in MobileBERT and MobileBERT_tf * Refactor checkpoint name in mpnet and mpnet_tf * Refactor checkpoint name in openai and openai_tf * Refactor checkpoint name in pegasus_tf * Refactor checkpoint name in reformer * Refactor checkpoint name in Roberta and Roberta_tf * Refactor checkpoint name in SqueezeBert * Refactor checkpoint name in Transformer_xl and Transformer_xl_tf * Refactor checkpoint name in XLM and XLM_tf * Refactor checkpoint name in XLNET and XLNET_tf * Refactor checkpoint name in BERT_tf * run make tests, style, quality, fixup
-
Lysandre Debut authored
* Add stale bot to Github Actions * Update message * Message for assignee * Update scripts/stale.py * Uncomment & stop testing
-
Sylvain Gugger authored
* Fix embeddings for PyTorch 1.8 * Try with PyTorch 1.8.0 * Fix embeddings init * Fix copies * Typo * More typos
-
Chen Liang authored
DEBERTA_PRETRAINED_MODEL_ARCHIVE_LIST => DEBERTA_V2_PRETRAINED_MODEL_ARCHIVE_LIST in line 31.
-
Joakim Warholm authored
-
Lysandre Debut authored
* Only run one test * Patch segfault * Fix summarization pipeline * Ready for merge
-
Patrick von Platen authored
-
Nicolas Patry authored
-
Lysandre authored
-
lewtun authored
-
Lysandre authored
-
- 04 Mar, 2021 6 commits
-
-
Patrick von Platen authored
* first step to refactor * make all fast tests pass * make all slow tests pass * save intermediate * correct cache * finish PR * make fp16 work
-
Sylvain Gugger authored
* Rework TPU checkpointing in Trainer * Wraps the barrier in a dist test * Address review comments * Remove line
-
Philipp Schmid authored
* removed overwrites * remove default value for output_dir * adjusted typing
-
Sylvain Gugger authored
-
Sylvain Gugger authored
This reverts commit f3660613.
-
Sylvain Gugger authored
-
- 03 Mar, 2021 8 commits
-
-
Sylvain Gugger authored
-
Sylvain Gugger authored
* Fix gradient accumulation for SM Model Parallelism * Style and divide loss by grad accum steps
-
felixgwu authored
* fix all_hidden_states * use output_states instead of next_kv
-
Stas Bekman authored
* remap classes to strings * missing new util * style * doc * move the autogenerated file * Trigger CI
-
Sylvain Gugger authored
* Refactor checkpoint name in BERT and MobileBERT * Add option to check copies * Add QuestionAnswering * Add last models * Make black happy
-
Jeff Yang authored
* feat(docs): navigate with left/right arrow keys * fix: add missing comma
-
Patrick von Platen authored
* fix speed degradation bug t5 * fix for all models * fix code quality
-
WybeKoper authored
Co-authored-by:WybeKoper <WybeKoper@users.noreply.github.com>
-