- 16 Oct, 2021 1 commit
-
-
Suraj Patil authored
-
- 15 Oct, 2021 4 commits
-
-
Patrick von Platen authored
* up * finish * up * up * finish
-
Murilo Gon莽alves authored
-
Anton Lozhkov authored
* Working encoder * SEW-D and tests * Further conv fixes * Automodels and conv inits * Update integration tests, add docs * Docs cleanup, resolve todos * Conf fix * Fix docs * Fix tests, apply suggestions * Update src/transformers/models/sew/modeling_sew.py Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * Model conversion and updated no-mask tests * Remove copy of feature_proj * Style * Update src/transformers/models/auto/feature_extraction_auto.py Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/auto/feature_extraction_auto.py Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * Move orgs Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
jacksukk authored
* fixed horizon_length * fixed horizon_length * fix style
-
- 14 Oct, 2021 10 commits
-
-
Lysandre Debut authored
* Scatter dummies + skip pipeline tests * Add torch scatter to build docs
-
Lukas Weiner authored
Raise exceptions instead of asserts in src/transformers/models/bart/modeling_flax_[bart, marian, mbart, pegasus].py (#13939) * Raise exceptions instead of asserts * fix: fixed failing quality check with copies * fix: fixed max line length * rerun github ci, failed to install dependencies
-
Patrick von Platen authored
-
Lysandre Debut authored
-
Sylvain Gugger authored
* Add strong test for configuration attributes * Add fake modif to trigger all tests * Add a better fake modif * Ignore is_encoder_decoder * Fix faulty configs * Remove fake modif
-
Sylvain Gugger authored
This reverts commit 5b6bd4e7.
-
David del R铆o Medina authored
-
Patrick von Platen authored
-
Sylvain Gugger authored
-
Li-Huai (Allan) Lin authored
* Remove wrong model_args of config.from_pretrained * Fix tf & flax
-
- 13 Oct, 2021 1 commit
-
-
NielsRogge authored
* First draft * Update self-attention of RoBERTa as proposition * Improve conversion script * Add TrOCR decoder-only model * More improvements * Make forward pass with pretrained weights work * More improvements * Some more improvements * More improvements * Make conversion work * Clean up print statements * Add documentation, processor * Add test files * Small improvements * Some more improvements * Make fix-copies, improve docs * Make all vision encoder decoder model tests pass * Make conversion script support other models * Update URL for OCR image * Update conversion script * Fix style & quality * Add support for the large-printed model * Fix some issues * Add print statement for debugging * Add print statements for debugging * Make possible fix for sinusoidal embedding * Further debugging * Potential fix v2 * Add more print statements for debugging * Add more print sta...
-
- 12 Oct, 2021 7 commits
-
-
Stas Bekman authored
-
Yih-Dar authored
* Add cross attentions to TFGPT2Model * Add TFEncoderDecoderModel * Add TFBaseModelOutputWithPoolingAndCrossAttentions * Add cross attentions to TFBertModel * Fix past or past_key_values argument issue * Fix generation * Fix save and load * Add some checks and comments * Clean the code that deals with past keys/values * Add kwargs to processing_inputs * Add serving_output to TFEncoderDecoderModel * Some cleaning + fix use_cache value issue * Fix tests + add bert2bert/bert2gpt2 tests * Fix more tests * Ignore crossattention.bias when loading GPT2 weights into TFGPT2 * Fix return_dict_in_generate in tf generation * Fix is_token_logit_eos_token bug in tf generation * Finalize the tests after fixing some bugs * Fix another is_token_logit_eos_token bug in tf generation * Add/Update docs * Add TFBertEncoderDecoderModelTest * Clean test script * Add TFEncoderDecoderModel to...
-
Nicolas Patry authored
384 // 4 < 128 would break `doc_stride`.
-
Patrick von Platen authored
* correct long to bool * up * correct code
-
Mishig Davaadorj authored
-
Hardian Lawi authored
-
Lysandre Debut authored
-
- 11 Oct, 2021 10 commits
-
-
Patrick von Platen authored
* adapt wav2vec2 * add example * add files * adapt * remove bogus file * Apply suggestions from code review * adapt files more * upload changes * del old files * up * up * up * up * up * correct gradient checkpoitning * add readme * finish * finish * up * more fixes * up * up * add demo run to readme * up
-
Lahfa Samy authored
Replace assert by ValueError of src/transformers/models/electra/modeling_{electra,tf_electra}.py and all other models that had copies (#13955) * Replace all assert by ValueError in src/transformers/models/electra * Reformat with black to pass check_code_quality test * Change some assert to ValueError of modeling_bert & modeling_tf_albert * Change some assert in multiples models * Change multiples models assertion to ValueError in order to validate check_code_style test and models template test. * Black reformat * Change some more asserts in multiples models * Change assert to ValueError in modeling_layoutlm.py to fix copy error in code_style_check * Add proper message to ValueError in modeling_tf_albert.py Co-authored-by:Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Simplify logic in models/bert/modeling_bert.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Add ValueError message to models/convbert/modeling_tf_convbert.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Add error message for ValueError to modeling_tf_electra.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Simplify logic in models/tapas/modeling_tapas.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Simplify logic in models/electra/modeling_electra.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Add ValueError message in src/transformers/models/bert/modeling_tf_bert.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Simplify logic in src/transformers/models/rembert/modeling_rembert.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Simplify logic in src/transformers/models/albert/modeling_albert.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Lukas Weiner authored
-
Sylvain Gugger authored
-
Midhun R Nair authored
-
Luis F. Talavera R authored
-
Jungwoo Park authored
-
Patrick von Platen authored
[Gradient checkpoining] Correct disabling `find_unused_parameters` in Trainer when gradient checkpointing is enabled (#13961) * up * correct test
-
Sylvain Gugger authored
* Honor existing attention mask in tokenzier.pad * Fix initialization of attention mask * Roll the implem on all subclasses * Fix tests
-
Lahfa Samy authored
* Raise ValueError exception instead of assert * Remove f unnecessary f-strings * Remove unused f-strings
-
- 09 Oct, 2021 1 commit
-
-
oraby8 authored
-
- 08 Oct, 2021 6 commits
-
-
Lysandre Debut authored
* Update bug-report.md * Update .github/ISSUE_TEMPLATE/bug-report.md Co-authored-by:
Suraj Patil <surajp815@gmail.com> * Update .github/ISSUE_TEMPLATE/bug-report.md Co-authored-by:
Suraj Patil <surajp815@gmail.com> * Update .github/ISSUE_TEMPLATE/bug-report.md Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update .github/ISSUE_TEMPLATE/bug-report.md Co-authored-by:
Anton Lozhkov <aglozhkov@gmail.com> Co-authored-by:
Suraj Patil <surajp815@gmail.com> Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by:
Anton Lozhkov <aglozhkov@gmail.com>
-
Chungman Lee authored
-
Sylvain Gugger authored
-
Sylvain Gugger authored
-
Sylvain Gugger authored
-
Patrick von Platen authored
* up * Update src/transformers/generation_stopping_criteria.py * finish
-