- 17 Mar, 2022 6 commits
-
-
Sylvain Gugger authored
-
Dayyan Smith authored
-
Yih-Dar authored
* fix activation Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
NielsRogge authored
* Fix device * Clean up Co-authored-by:Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
-
Lysandre Debut authored
* fix(transfo_xl): Fixes TransfoXL support when using Trainer. * fix(tests): Uses losses_1 and losses_2 pattern with TransfoXL test. * fix(transfo_xl): Adds requested changes to allow for backward compatibility. fix(transfo_xl): Adds requested changes to allow for backward compatibility. fix(transfo_xl): Fixes code styling. * Backward compatibility * Update src/transformers/models/transfo_xl/modeling_transfo_xl.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Gustavo de Rosa <gth.rosa@uol.com.br> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Francesco Saverio Zuppichini authored
* done * done
-
- 16 Mar, 2022 15 commits
-
-
Jo茫o Gustavo A. Amorim authored
* add types annotations for Beit (PyTorch) * add types annotations for ViT (PyTorch) * add types annotations for Deit (PyTorch) * change Optional[bool] to bool into some places at Beit * change Optional[bool] to bool into some places at ViT
-
Patrick von Platen authored
* up * fix min lengths
-
Francesco Saverio Zuppichini authored
* padding done * correctly return one attention per layer * almost correct, attentions are not flatten one tuple per stage * tests green * doc * conversations * reshaping hidden_states * view in the test * reshape_hidden_states in Encoder and Model * new outputs with reshaped_hidden_states * conversations * doc * Update docs/source/model_doc/swin.mdx Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Apply suggestions from code review Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> * conversations * fix tests * minor changes * resolved conversations * attentions one per stage * typo * typos * typos * function signature * CI * clean up tests Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com>
-
Joao Gante authored
-
Suraj Patil authored
* override from_pretrained * add tests * remove docstrings * fix typo * Trigger CI
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Francesco Saverio Zuppichini authored
* updated names * fit in one line * typo
-
John Ryan authored
* Distillbert type - squash * Update src/transformers/models/distilbert/modeling_distilbert.py Undo cleanup Co-authored-by:
Matt <Rocketknight1@users.noreply.github.com> * Update src/transformers/models/distilbert/modeling_distilbert.py Co-authored-by:
Matt <Rocketknight1@users.noreply.github.com> * Update src/transformers/models/distilbert/modeling_distilbert.py Co-authored-by:
Matt <Rocketknight1@users.noreply.github.com> * Update src/transformers/models/distilbert/modeling_distilbert.py Co-authored-by:
Matt <Rocketknight1@users.noreply.github.com> * Remove type Co-authored-by:
Matt <Rocketknight1@users.noreply.github.com>
-
Utku Saglam authored
Co-authored-by:utku saglam <utkusaglam@utku-MacBook-Pro.local>
-
Johannes Kolbe authored
* unpack_input decorator for tf_convnext * set unpack_input as top decorator Co-authored-by:Johannes Kolbe <johannes.kolbe@tech.better.team>
-
Anton Lozhkov authored
* Minor fixes * Fix vocab union * Update examples/research_projects/xtreme-s/README.md Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * Update README * unused import Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
Utku Saglam authored
Co-authored-by:utku saglam <utkusaglam@utku-MacBook-Pro.local>
-
Utku Saglam authored
Co-authored-by:utku saglam <utkusaglam@utku-MacBook-Pro.local>
-
Sanchit Gandhi authored
* Replace all deprecated `jax.ops` operations with jnp's `at` * np to jnp scores * suggested changes
-
Patrick von Platen authored
-
- 15 Mar, 2022 19 commits
-
-
Anton Lozhkov authored
* CTC+classification draft * CTC+classification draft * style * multilingual runs * Fix race condition during processor.from_reatrained * Merge covost experiments * Add README * Quality * Switch to .all configs * Fix typos
-
Sylvain Gugger authored
-
Yih-Dar authored
* fix the last element in `hidden_states` * fix missing elements in outputs for FlaxWav2Vec2EncoderLayerStableLayerNormCollection Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Dan Tegzes authored
-
Jack McDonald authored
-
Kamal Raj authored
-
Thomas Chaigneau authored
* Add type hints for FlauBERT PyTorch Base model. Others downstream tasks are inherited from XLM RoBERTa. * Add type hints for FlaubERT Tensorflow models. * fix output for TFFlaubertWithLMHeadModel
-
Kamal Raj authored
-
Kamal Raj authored
-
Suraj Patil authored
-
Sylvain Gugger authored
* First files * More files * Last files * Style
-
mowafess authored
-
Joydeep Bhattacharjee authored
* update transformer XL with tf decorator * code fixup * remove unused variables
-
Minh Chien Vu authored
Co-authored-by:ChienVM <chien_vm@detomo.co.jp>
-
Kamal Raj authored
-
Matt authored
* First attempt at TF XLA generation * Fix comments * Update XLA greedy generate with direct XLA calls * Support attention mask, prepare_inputs_for_generation no longer hardcoded for greedy * Handle position_ids correctly * make xla generate work for non xla case * force using xla generate * refactor * more fixes * finish cleaning * finish * finish * clean gpt2 tests * add gpt2 tests * correct more cases * up * finish * finish * more fixes * flake 8 stuff * final rag fix * Update src/transformers/models/rag/modeling_tf_rag.py * finish t5 as well * finish * Update src/transformers/generation_utils.py Co-authored-by:Patrick von Platen <patrick.v.platen@gmail.com>
-
Yih-Dar authored
* fix 2 pytorch vilt docstring examples * add vilt to doctest list file * remove device Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Markus Sagen authored
* Fix inconsistent example variable naming - Example code for a sequence classification in Tensorflow had spelling mistakes and incorrect and inconsistent naming - Changed variable naming to be consistent with the two other TF examples * Fix incorrect incorrect training examples
-
Sylvain Gugger authored
* Use tempaltes for all doc building jobs * Add this branch to the doc build * Switch to main branch
-