- 28 Apr, 2022 5 commits
-
-
Zachary Mueller authored
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
amyeroberts authored
-
conan1024hao authored
* dd parameter --config_overrides for run_mlm_wwm.py * linter
-
Yih-Dar authored
* fix * Apply suggestions from code review Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 27 Apr, 2022 14 commits
-
-
Dat Quoc Nguyen authored
The emoji version must be either 0.5.4 or 0.6.0. Newer emoji versions have been updated to newer versions of the Emoji Charts, thus not consistent with the one used for pre-processing the pre-training Tweet corpus (i.e. not consistent with the vocab).
-
Yih-Dar authored
* Add -e flag * add check * create new keys * run python setup.py build install * add comments * change to develop Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Sylvain Gugger authored
* Fix doc notebooks links * Remove missing section
-
Zachary Mueller authored
* Fixup all examples
-
Sylvain Gugger authored
* Fix multiple deletions of the same files in save_pretrained * Add is_main_process argument
-
Sylvain Gugger authored
-
Mishig Davaadorj authored
BertModelForSequenceClassification -> BertForSequenceClassification
-
Antoni Baum authored
* Fix `distributed_concat` with scalar tensor * Update trainer_pt_utils.py
-
NielsRogge authored
* Add fix * Apply suggestion from code review Co-authored-by:Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
-
Leonid Boytsov authored
1. Fixes evaluation errors popping up when you train/eval on squad v2 (one was newly encountered and one that was previously reported Running SQuAD 1.0 sample command raises IndexError #15401 but not completely fixed). 2. Removes boolean arguments that don't use store_true. Please, don't use these: *ANY non-empty string is being converted to True in this case and this clearly is not the desired behavior (and it creates a LOT of confusion). 3. All no-trainer test scripts are now saving metric values in the same way (with the right prefix eval_), which is consistent with the trainer-based versions. 4. Adds forgotten model.eval() in the no-trainer versions. This improved some results, but not everything (see the discussion in the end). Please, see the F1 scores and the discussion below.
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
NielsRogge authored
* Add first draft * Improve script and README * Improve README * Apply suggestions from code review * Improve script, add link to resulting model * Add corresponding test * Adjust learning rate
-
Anton Lozhkov authored
* Avoid repeated per-lang filtering * Language groups and logits preprocessing * Style
-
- 26 Apr, 2022 7 commits
-
-
Yongliang Shen authored
-
nikkie authored
-
Yang Ming authored
-
Krishna Sirumalla authored
* add roformer onnx config
-
Ahmed Elnaggar authored
FIx Iterations for decoder
-
Manuel authored
* apply torch int div * black linting fixup * update path to torch_int_div * clarify imports
-
Sylvain Gugger authored
* Limit the use of PreTrainedModel.device * Fix
-
- 25 Apr, 2022 11 commits
-
-
-
Sanchit Gandhi authored
-
Joao Gante authored
Co-authored-by:Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Rushi Chaudhari authored
* added deit onnx config
-
Joao Gante authored
Co-authored-by:Matt <Rocketknight1@users.noreply.github.com>
-
Joao Gante authored
* XLA min len, forced eos, and forced bos Co-authored-by:Matt <Rocketknight1@users.noreply.github.com>
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Yih-Dar authored
* add torch.cuda.empty_cache in some PT RAG tests * torch.cuda.empty_cache in tearDownModule() * tearDown() * add gc.collect() Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Yih-Dar authored
* add missing ckpt in config docs * add more missing ckpt in config docs * fix wrong ckpts * fix realm ckpt * fix s2t2 * fix xlm_roberta ckpt * Fix for deberta v2 * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * use only one checkpoint for DPR * Apply suggestions from code review Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com>
-
Patrick von Platen authored
* fix doc test * fix doc test Co-authored-by:Patrick <patrick@pop-os.localdomain>
-
Thomas Chaigneau authored
Co-authored-by:ChainYo <t.chaigneau.tc@gmail.com>
-
- 23 Apr, 2022 1 commit
-
-
Patrick von Platen authored
* [DocTests] Fix some doc tests * hacky fix * correct
-
- 22 Apr, 2022 2 commits
-
-
cavdard authored
* changes in create optimizer to support tensor parallelism with SMP * Update src/transformers/trainer.py Convert if check to one line. Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Cavdar <dcavdar@a07817b12d7e.ant.amazon.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Joao Gante authored
-