- 11 Feb, 2021 5 commits
-
-
Patrick von Platen authored
-
Patrick von Platen authored
-
Patrick von Platen authored
* save intermediate * finish batch the same as fairseq * add normalization * fix batched input * add better comment * Update src/transformers/models/wav2vec2/modeling_wav2vec2.py * add nice docstring * add tokenizer tests * make all slow tests pass * finish PR * correct import
-
Tanmay Thakur authored
* Update:community.md, new nb add * feat: updated grammar on nb description * Update: Train summarizer for BlenderBotSmall
-
Qbiwan authored
* remove xnli_compute_metrics, add load_dataset, load_metric, set_seed,metric.compute,load_metric * fix * fix * fix * push * fix * everything works * fix init * fix * special treatment for sepconv1d * style *
馃檹 馃徑 * add doc and cleanup * fix doc * fix doc again * fix doc again * Apply suggestions from code review * make style * Proposal that should work * Remove needless code * Fix test * Apply suggestions from code review * remove xnli_compute_metrics, add load_dataset, load_metric, set_seed,metric.compute,load_metric * amend README * removed data_args.task_name and replaced with task_name = "xnli"; use split function to load train and validation dataset separately; remove __post_init__; remove flag --task_name from README. * removed dict task_to_keys, use str "xnli" instead of variable task_name, change preprocess_function to use examples["premise"], examples["hypothesis"] directly, remove sentence1_key and sentence2_key, change compute_metrics function to cater only to accuracy metric, add condition for train_langauge is None when using dataset.load_dataset() * removed `torch.distributed.barrier()` and `import torch` as `from_pretrained` is able to do the work; amend README
-
- 10 Feb, 2021 9 commits
-
-
Stas Bekman authored
* free up memory at the end of train * rework tests * consistent formatting * correction
-
Suraj Patil authored
* add forced logits processors * delete adjust_logits method * add forced_eos_token_id argument in config * add tests for forced logits processors * update gen utils tests * add forced option to tf generate * remove adjust_logits method from tf models * update adjust_logits for marian * delete _force_token_id_to_be_generated method * style * import warnings * pass max_length to _get_logits_processor * set forced_eos_token_id to None * set forced attributes in conf utils * typo * fix rag generate * add forced_eos_token_id in rag config * remove force_bos_token_to_be_generated from BartConfig * remove _force_token_ids_generation from FSMT * nit * fix negative constant * apply suggestions from code review
-
Julien Plu authored
* Fix test * Remove commented test * Fix name * Apply style * Fix check copies * Remove prints * Restore boolean * Fix reshape
-
Lysandre Debut authored
-
Stas Bekman authored
-
Stas Bekman authored
I assume the CI machine should have at least 4 cores, so let's build docs faster
-
Stas Bekman authored
* how to specify a specific gpu * new paper * expand on buffer sizes * style * where to find config examples * specific example * small updates
-
Anthony MOI authored
-
Shiva Zamani authored
-
- 09 Feb, 2021 16 commits
-
-
Boris Dayma authored
* doc: update W&B related doc * doc(wandb): mention report_to * doc(wandb): commit suggestion Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * doc(wandb): fix typo * doc(wandb): remove WANDB_DISABLED Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
abhishek thakur authored
-
Sylvain Gugger authored
-
Suraj Patil authored
* fix rag generate and tests * put back adjust_logits_during_generation * tests are okay Co-authored-by:Patrick von Platen <patrick.v.platen@gmail.com>
-
Patrick von Platen authored
-
Patrick von Platen authored
-
Julien Plu authored
* Replace tf.newaxis -> tf.expand_dims * Fix tests * Fix tests * Use reshape when a tensors needs a double expand * Fix GPT2 * Fix GPT2
-
Daniel Stancl authored
* Add head masking to TF LED * Add head_mask to Longformer + one doc piece to LED * Fix integration tests
-
Sylvain Gugger authored
-
Lysandre Debut authored
* Enable propagation by default * Document enable/disable default handler
-
Suraj Patil authored
* add do_predict, pass eval_beams durig eval * update help * apply suggestions from code review
-
Julien Plu authored
-
abhishek thakur authored
* Fix example in Wav2Vec2 documentation * fix style
-
Lysandre authored
-
Patrick von Platen authored
* add wav2vec2CTC and deprecate for maskedlm * remove from docs
-
Lysandre authored
-
- 08 Feb, 2021 10 commits
-
-
sandip authored
-
Stas Bekman authored
-
demSd authored
-
Juan Cruz-Benito authored
Removing run_pl_glue.py from text classification docs, include run_xnli.py & run_tf_text_classification.py (#10066) * Removing run_pl_glue.py from seq classification docs * Adding run_tf_text_classification.py * Using :prefix_link: to refer local files * Applying "make style" to the branch * Update docs/source/task_summary.rst Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Removing last underscores Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Lysandre authored
-
Stas Bekman authored
* deepspeed bug fixes and tests * manual wrap?
-
Anthony MOI authored
-
noise-field authored
* Unify logging with f-strings * Get limits from MLflow rather than hardcode * Add a check for parameter length overflow Also constants are marked as internal * Don't stop run in on_train_end This causes bad behaviour when there is a seprarte validation step: validation gets recorded as separate run. * Fix style
-
Olivier authored
* replace -100 token ids with the tokenizer pad_id for compute_metrics * fixed typo for label_ids
-
Lysandre Debut authored
-