- 31 Jan, 2022 9 commits
-
-
Sylvain Gugger authored
-
Ogundepo Odunayo authored
-
NielsRogge authored
* Fix Swin model outputs * Rename pooler
-
Suraj Patil authored
-
Jonatas Grosman authored
-
Kamal Raj authored
fix typo
-
Julien Plu authored
* Add Luke training * Fix true label tags * Fix true label tags * Fix true label tags * Update the data collator for Luke * Some training refactor for Luke * Improve data collator for Luke * Fix import * Fix datasets concatenation * Add the --max_entity_length argument for Luke models * Remove unused code * Fix style issues * Fix style issues * Move the Luke training into a separate folder * Fix style * Fix naming * Fix filtering * Fix filtering * Fix filter * Update some preprocessing * Move luke to research_projects * Checkstyle * Address comments * Fix style
-
Fran莽ois REMY authored
(This is an editorial change only)
-
NielsRogge authored
-
- 30 Jan, 2022 1 commit
-
-
Suraj Patil authored
-
- 29 Jan, 2022 4 commits
-
-
Yih-Dar authored
* Fix the inconsistency of loss calculation between PT/TF XLNetLMHeadModel * overwrite test_loss_computation Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Soonhwan-Kwon authored
* add xlm roberta xl * add convert xlm xl fairseq checkpoint to pytorch * fix init and documents for xlm-roberta-xl * fix indention * add test for XLM-R xl,xxl * fix model hub name * fix some stuff * up * correct init * fix more * fix as suggestions * add torch_device * fix default values of doc strings * fix leftovers * merge to master * up * correct hub names * fix docs * fix model * up * finalize * last fix * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * add copied from * make style Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Steven Liu authored
* clean commit of changes * apply review feedback, make edits * fix backticks, minor formatting *
馃枍 make fixup and minor edits *馃枍 fix # in header *馃摑 update code sample without from_pt *馃摑 final review -
Steven Liu authored
* add model sharing tutorial *
馃枍 apply feedback from review *馃摑 make edits *馃枍 fix formatting *馃摑 convert from pt checkpoint to flax *馃摑 final review
-
- 28 Jan, 2022 11 commits
-
-
Sylvain Gugger authored
-
Yih-Dar authored
* fix missing eps * Same fix for ConvBertGeneratorPredictions * Same fix for AlbertMLMHead Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Stas Bekman authored
* [deepspeed] saving checkpoint fallback when fp16 weights aren't saved * Bump required deepspeed version to match usage when saving checkpoints * update version Co-authored-by:Mihai Balint <balint.mihai@gmail.com>
-
Suraj Patil authored
* add xglm * update vocab size * fix model name * style and tokenizer * typo * no mask token * fix pos embed compute * fix args * fix tokenizer * fix positions * fix tokenization * style and dic fixes * fix imports * add fast tokenizer * update names * add pt tests * fix tokenizer * fix typo * fix tokenizer import * fix fast tokenizer * fix tokenizer * fix converter * add tokenizer test * update checkpoint names * fix tokenizer tests * fix slow tests * add copied from comments * rst -> mdx * flax model * update flax tests * quality * style * doc * update index and readme * fix copies * fix doc * update toctrr * fix indent * minor fixes * fix config doc * don't save embed_pos weights * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * address Sylvains commnets, few doc fixes * fix check_repo * align order of arguments * fix copies * fix labels * remove unnecessary mapping * fix saving tokenizer Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
Matt authored
* Make links explicit * Removing reference to compute_metrics() since it's kind of PyTorch-specific
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
lewtun authored
* Prepare deprecated ONNX exporter for PyTorch v1.11 * Add deprecation warning
-
Ngo Quang Huy authored
-
Ngo Quang Huy authored
* Fix `bad_word_ids` not working with sentencepiece-based tokenizers * make style Co-authored-by:Patrick von Platen <patrick.v.platen@gmail.com>
-
Nicolas Patry authored
* Fixing support `batch_size` and `num_return_Sequences` in `text-generation` pipeline And `text2text-generation` too. The bug was caused by the batch_size containing both the incoming batch **and** the generated `num_sequences`. The fix simply consists into splitting both of these again into different dimensions. * TF support. * Odd backward compatibility script in the way.
-
Yanming Wang authored
* Use syncfree AdamW for xla:gpu device by default * Make syncfree AdamW optional
-
- 27 Jan, 2022 15 commits
-
-
Lysandre Debut authored
* Add init to BORT * BORT should be in init
-
Steven Liu authored
*
馃枍 fix code syntax to external libraries and replace image *馃攧 revert code formatting, replace image with code block *馃枍 apply feedback -
Sylvain Gugger authored
* Allow dynamic modules to use relative imports * Add tests * Add one last test * Changes
-
dependabot[bot] authored
Bumps [numpy](https://github.com/numpy/numpy) from 1.19.2 to 1.21.0. - [Release notes](https://github.com/numpy/numpy/releases) - [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst.txt) - [Commits](https://github.com/numpy/numpy/compare/v1.19.2...v1.21.0 ) --- updated-dependencies: - dependency-name: numpy dependency-type: direct:production ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
dependabot[bot] authored
Bumps [notebook](http://jupyter.org ) from 6.1.5 to 6.4.1. --- updated-dependencies: - dependency-name: notebook dependency-type: direct:production ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by:
Lysandre Debut <lysandre@huggingface.co>
-
dependabot[bot] authored
Bumps [numpy](https://github.com/numpy/numpy) from 1.19.2 to 1.21.0. - [Release notes](https://github.com/numpy/numpy/releases) - [Changelog](https://github.com/numpy/numpy/blob/main/doc/HOWTO_RELEASE.rst.txt) - [Commits](https://github.com/numpy/numpy/compare/v1.19.2...v1.21.0 ) --- updated-dependencies: - dependency-name: numpy dependency-type: direct:production ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
Sylvain Gugger authored
-
Lysandre authored
-
Lysandre authored
-
Matt authored
* Example script for PushToHubCallback * Expanding description slightly
-
Sylvain Gugger authored
* Add proper documentation for Keras callbacks * Add dummies
-
Matt authored
-
Sylvain Gugger authored
Co-authored-by:
osanseviero <osanseviero@gmail.com> Co-authored-by:
osanseviero <osanseviero@gmail.com>
-
SaulLu authored
* add new test * add a feature to same the sentencepiece tokenizer model when the init file was deleted * update marian * update m2m_100 * fix marian * update speech to text * override test for layoutxlm * fix saving bartpho * remove harcoded values bartpho * special token string version * finish bartpho * override layoutxml test * add mbart * move special tokens list * format * Revert "format" This reverts commit 37a40df37903a932c2f951cbd33acb684246bae7. * simplify list of string of special tokens * Re-write `self.fairseq_tokens_to_ids ` initialization logic with special tokens Co-authored-by:
Sylvain Gugger <sylvain.gugger@gmail.com> Co-authored-by:
Sylvain Gugger <sylvain.gugger@gmail.com>
-
Anton Lozhkov authored
* Device argument for the eval script * Default to none * isort
-