- 13 Jul, 2021 7 commits
-
-
qqaatw authored
-
Sylvain Gugger authored
* Add option to load a pretrained model with mismatched shapes * Fail at loading when mismatched shapes in Flax * Fix tests * Update src/transformers/modeling_flax_utils.py Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * Address review comments Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * fix docs
-
Jeroen Steggink authored
* Wrong model is used, should be character instead of subword In the original Google repo for CANINE there was mixup in the model names in the README.md, which was fixed 2 weeks ago. Since this transformer model was created before, it probably resulted in wrong use in this example. s = subword, c = character * canine.rst style fix * Update docs/source/model_doc/canine.rst Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Styling canine.rst * Added links to model cards. * Fixed links to model cards. Co-authored-by:
Jeroen Steggink <978411+jsteggink@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Nick Doiron authored
* Allow ByT5 type in Flax T5 script * use T5TokenizerFast * change up tokenizer config * model_args * reorder imports * Update run_t5_mlm_flax.py
-
Lysandre Debut authored
* **encode_plus() shouldn't run for W2V2CTC * Typo
-
Lysandre Debut authored
-
- 12 Jul, 2021 19 commits
-
-
Omar Sanseviero authored
-
Kevin Canwen Xu authored
-
Kevin Canwen Xu authored
* README Translation for Chinese (Simplified) * update link * h3->h4 * html refactor * update model list * fix * Add a translation guide * format * update * typo * Refine wording
-
Philip May authored
-
Ahmed Khaled authored
-
Philipp Schmid authored
-
Lysandre Debut authored
* Cleanup test * Skip TF TransfoXL test
-
Lysandre Debut authored
-
Sylvain Gugger authored
* PoC, it pickles! * Remove old method. * Apply to every auto object
-
Matt authored
* Adding a TF summarization example * Style pass * Style fixes * Updates for review comments * Adding README * Style pass * Remove unused import
-
Sylvain Gugger authored
-
Sylvain Gugger authored
-
Sylvain Gugger authored
-
Lysandre Debut authored
-
Lysandre Debut authored
* Skip TestMarian_MT_EN * Skip EN_ZH and EN_ROMANCE * Skip EN_ROMANCE pipeline
-
Lewis Bails authored
Co-authored-by:Lewis Bails <Lewis.Bails@infomedia.dk>
-
Suraj Patil authored
-
Eduardo Gonzalez Ponferrada authored
-
Bhadresh Savani authored
-
- 10 Jul, 2021 3 commits
-
-
Stas Bekman authored
-
Stas Bekman authored
* wip * complete the doc * missing img * improve * correction * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Stas Bekman authored
-
- 09 Jul, 2021 11 commits
-
-
Will Rice authored
* TFHubert * Update with TFWav2Vec Bug Fixes * Add OOV Error * Feedback changes * Fix kwargs call
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * up
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * finish better examples for marian flax
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * fix mt5 auto
-
Alex Hedges authored
* Pass model_kwargs when loading a model in pipeline * Add test for model_kwargs parameter of pipeline() * Rewrite test to not download model * Fix failing style checks
-
Sylvain Gugger authored
-
Sylvain Gugger authored
* Base test * More test * Fix mistake * Add a docstring change * Add doc ignore * Simplify logic for unk token in Unigram tokenizers * Remove changes from otehr branch
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * fix save problem
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * add marian * finish make style * add model * add docs * add test * add integration tests * up * solve bug * correct tests * correct some tests * Apply suggestions from code review Co-authored-by:
Suraj Patil <surajp815@gmail.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * correct adapt marian * finish Co-authored-by:
Patrick von Platen <patrick@huggingface.co> Co-authored-by:
Suraj Patil <surajp815@gmail.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Nicolas Patry authored
* This will reduce "Already borrowed error": Original issue https://github.com/huggingface/tokenizers/issues/537 The original issue is caused by transformers calling many times mutable functions on the rust tokenizers. Rust needs to guarantee that only 1 agent has a mutable reference to memory at a given time (for many reasons which don't need explaining here). Usually, the rust compiler can guarantee that this property is true at compile time. Unfortunately, this is impossible for Python to do that, so PyO3, the bridge between rust and python used by `tokenizers`, will change the compile guarantee for a dynamic guarantee, so if multiple agents try to have multiple mutable borrows at the same time, then the runtime will yell with "Already borrowed". The proposed fix here in transformers, is simply to reduce the actual number of calls that really need mutable borrows. By reducing them, we reduce the risk of running into "Already borrowed" error. The caveat is now we add a call to read the current configuration of the `_tokenizer`, so worst case we have 2 calls instead of 1, and best case we simply have 1 + a Python comparison of a dict (should be negligible). * Adding a test. * trivial error :(. * Update tests/test_tokenization_fast.py Co-authored-by:
SaulLu <55560583+SaulLu@users.noreply.github.com> * Adding reference to original issues in the tests. * Update the tests with fast tokenizer. Co-authored-by:
SaulLu <55560583+SaulLu@users.noreply.github.com>
-
Omar Sanseviero authored
-