"examples/vscode:/vscode.git/clone" did not exist on "6d67837f06fb8e3155a5c5b0dd57cd09841bc9f9"
- 27 Apr, 2022 9 commits
-
-
Sylvain Gugger authored
* Fix multiple deletions of the same files in save_pretrained * Add is_main_process argument
-
Sylvain Gugger authored
-
Mishig Davaadorj authored
BertModelForSequenceClassification -> BertForSequenceClassification
-
Antoni Baum authored
* Fix `distributed_concat` with scalar tensor * Update trainer_pt_utils.py
-
NielsRogge authored
* Add fix * Apply suggestion from code review Co-authored-by:Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
-
Leonid Boytsov authored
1. Fixes evaluation errors popping up when you train/eval on squad v2 (one was newly encountered and one that was previously reported Running SQuAD 1.0 sample command raises IndexError #15401 but not completely fixed). 2. Removes boolean arguments that don't use store_true. Please, don't use these: *ANY non-empty string is being converted to True in this case and this clearly is not the desired behavior (and it creates a LOT of confusion). 3. All no-trainer test scripts are now saving metric values in the same way (with the right prefix eval_), which is consistent with the trainer-based versions. 4. Adds forgotten model.eval() in the no-trainer versions. This improved some results, but not everything (see the discussion in the end). Please, see the F1 scores and the discussion below.
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
NielsRogge authored
* Add first draft * Improve script and README * Improve README * Apply suggestions from code review * Improve script, add link to resulting model * Add corresponding test * Adjust learning rate
-
Anton Lozhkov authored
* Avoid repeated per-lang filtering * Language groups and logits preprocessing * Style
-
- 26 Apr, 2022 7 commits
-
-
Yongliang Shen authored
-
nikkie authored
-
Yang Ming authored
-
Krishna Sirumalla authored
* add roformer onnx config
-
Ahmed Elnaggar authored
FIx Iterations for decoder
-
Manuel authored
* apply torch int div * black linting fixup * update path to torch_int_div * clarify imports
-
Sylvain Gugger authored
* Limit the use of PreTrainedModel.device * Fix
-
- 25 Apr, 2022 11 commits
-
-
-
Sanchit Gandhi authored
-
Joao Gante authored
Co-authored-by:Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Rushi Chaudhari authored
* added deit onnx config
-
Joao Gante authored
Co-authored-by:Matt <Rocketknight1@users.noreply.github.com>
-
Joao Gante authored
* XLA min len, forced eos, and forced bos Co-authored-by:Matt <Rocketknight1@users.noreply.github.com>
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Yih-Dar authored
* add torch.cuda.empty_cache in some PT RAG tests * torch.cuda.empty_cache in tearDownModule() * tearDown() * add gc.collect() Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Yih-Dar authored
* add missing ckpt in config docs * add more missing ckpt in config docs * fix wrong ckpts * fix realm ckpt * fix s2t2 * fix xlm_roberta ckpt * Fix for deberta v2 * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * use only one checkpoint for DPR * Apply suggestions from code review Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com>
-
Patrick von Platen authored
* fix doc test * fix doc test Co-authored-by:Patrick <patrick@pop-os.localdomain>
-
Thomas Chaigneau authored
Co-authored-by:ChainYo <t.chaigneau.tc@gmail.com>
-
- 23 Apr, 2022 1 commit
-
-
Patrick von Platen authored
* [DocTests] Fix some doc tests * hacky fix * correct
-
- 22 Apr, 2022 7 commits
-
-
cavdard authored
* changes in create optimizer to support tensor parallelism with SMP * Update src/transformers/trainer.py Convert if check to one line. Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Cavdar <dcavdar@a07817b12d7e.ant.amazon.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Joao Gante authored
-
Thomas Chaigneau authored
* add OnnxConfig for ConvBert Co-authored-by:ChainYo <t.chaigneau.tc@gmail.com>
-
Minh Chien Vu authored
* Add doctest BERT * make fixup * fix typo * change checkpoints * make fixup * define doctest output value, update doctest for mobilebert * solve fix-copies * update QA target start index and end index * change checkpoint for docs and reuse defined variable * Update src/transformers/models/bert/modeling_tf_bert.py Co-authored-by:
Yih-Dar <2521628+ydshieh@users.noreply.github.com> * Apply suggestions from code review Co-authored-by:
Yih-Dar <2521628+ydshieh@users.noreply.github.com> * Apply suggestions from code review Co-authored-by:
Yih-Dar <2521628+ydshieh@users.noreply.github.com> * make fixup * Add Doctest for Albert and Bigbird * make fixup * overwrite examples for Albert and Bigbird * Apply suggestions from code review Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * update longer examples for Bigbird * using examples from squad_v2 * print out example text * change name token-classification-big-bird checkpoint to random Co-authored-by:
Yih-Dar <2521628+ydshieh@users.noreply.github.com> Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
Mario 艩a拧ko authored
* Minor improvements to `convert_file_size_to_int` * Add <unit>bit version to kilos and megas * Minor fix
-
Joao Gante authored
-
Yih-Dar authored
* add missing entries in some mappings Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 21 Apr, 2022 5 commits
-
-
Loubna Ben Allal authored
* add tflops logging and fix grad accumulation * add accelerate tracking and checkpointing * scale loss of last batch correctly * fix typo * compress loss computation Co-authored-by:
Leandro von Werra <lvwerra@users.noreply.github.com> * add resume from checkpoint argument * add load_state accelerate from checkpoint, register lr scheduler and add tflops function * reformat code * reformat code * add condition on path for resume checkpoint * combine if conditions Co-authored-by:
Leandro von Werra <lvwerra@users.noreply.github.com> * add source for tflops formula Co-authored-by:
Leandro von Werra <lvwerra@users.noreply.github.com>
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Thomas Chaigneau authored
* add gptj to TOKENIZER_MAPPING_NAMES * fix int32 to float to avoid problem in onnx * Update src/transformers/models/gptj/modeling_gptj.py Co-authored-by:
ChainYo <t.chaigneau.tc@gmail.com> Co-authored-by:
lewtun <lewis.c.tunstall@gmail.com>
-
Eldar Kurtic authored
- all activations should be fetched through ACT2FN - it returns ReLU as `nn.Module`, which allows attaching hooks on the activation function and prints it to stdout when `print(model)`
-
Sylvain Gugger authored
-