- 17 Mar, 2022 1 commit
-
-
Francesco Saverio Zuppichini authored
* done * done
-
- 16 Mar, 2022 15 commits
-
-
Jo茫o Gustavo A. Amorim authored
* add types annotations for Beit (PyTorch) * add types annotations for ViT (PyTorch) * add types annotations for Deit (PyTorch) * change Optional[bool] to bool into some places at Beit * change Optional[bool] to bool into some places at ViT
-
Patrick von Platen authored
* up * fix min lengths
-
Francesco Saverio Zuppichini authored
* padding done * correctly return one attention per layer * almost correct, attentions are not flatten one tuple per stage * tests green * doc * conversations * reshaping hidden_states * view in the test * reshape_hidden_states in Encoder and Model * new outputs with reshaped_hidden_states * conversations * doc * Update docs/source/model_doc/swin.mdx Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Apply suggestions from code review Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> * conversations * fix tests * minor changes * resolved conversations * attentions one per stage * typo * typos * typos * function signature * CI * clean up tests Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com>
-
Joao Gante authored
-
Suraj Patil authored
* override from_pretrained * add tests * remove docstrings * fix typo * Trigger CI
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Francesco Saverio Zuppichini authored
* updated names * fit in one line * typo
-
John Ryan authored
* Distillbert type - squash * Update src/transformers/models/distilbert/modeling_distilbert.py Undo cleanup Co-authored-by:
Matt <Rocketknight1@users.noreply.github.com> * Update src/transformers/models/distilbert/modeling_distilbert.py Co-authored-by:
Matt <Rocketknight1@users.noreply.github.com> * Update src/transformers/models/distilbert/modeling_distilbert.py Co-authored-by:
Matt <Rocketknight1@users.noreply.github.com> * Update src/transformers/models/distilbert/modeling_distilbert.py Co-authored-by:
Matt <Rocketknight1@users.noreply.github.com> * Remove type Co-authored-by:
Matt <Rocketknight1@users.noreply.github.com>
-
Utku Saglam authored
Co-authored-by:utku saglam <utkusaglam@utku-MacBook-Pro.local>
-
Johannes Kolbe authored
* unpack_input decorator for tf_convnext * set unpack_input as top decorator Co-authored-by:Johannes Kolbe <johannes.kolbe@tech.better.team>
-
Anton Lozhkov authored
* Minor fixes * Fix vocab union * Update examples/research_projects/xtreme-s/README.md Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * Update README * unused import Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
Utku Saglam authored
Co-authored-by:utku saglam <utkusaglam@utku-MacBook-Pro.local>
-
Utku Saglam authored
Co-authored-by:utku saglam <utkusaglam@utku-MacBook-Pro.local>
-
Sanchit Gandhi authored
* Replace all deprecated `jax.ops` operations with jnp's `at` * np to jnp scores * suggested changes
-
Patrick von Platen authored
-
- 15 Mar, 2022 24 commits
-
-
Anton Lozhkov authored
* CTC+classification draft * CTC+classification draft * style * multilingual runs * Fix race condition during processor.from_reatrained * Merge covost experiments * Add README * Quality * Switch to .all configs * Fix typos
-
Sylvain Gugger authored
-
Yih-Dar authored
* fix the last element in `hidden_states` * fix missing elements in outputs for FlaxWav2Vec2EncoderLayerStableLayerNormCollection Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Dan Tegzes authored
-
Jack McDonald authored
-
Kamal Raj authored
-
Thomas Chaigneau authored
* Add type hints for FlauBERT PyTorch Base model. Others downstream tasks are inherited from XLM RoBERTa. * Add type hints for FlaubERT Tensorflow models. * fix output for TFFlaubertWithLMHeadModel
-
Kamal Raj authored
-
Kamal Raj authored
-
Suraj Patil authored
-
Sylvain Gugger authored
* First files * More files * Last files * Style
-
mowafess authored
-
Joydeep Bhattacharjee authored
* update transformer XL with tf decorator * code fixup * remove unused variables
-
Minh Chien Vu authored
Co-authored-by:ChienVM <chien_vm@detomo.co.jp>
-
Kamal Raj authored
-
Matt authored
* First attempt at TF XLA generation * Fix comments * Update XLA greedy generate with direct XLA calls * Support attention mask, prepare_inputs_for_generation no longer hardcoded for greedy * Handle position_ids correctly * make xla generate work for non xla case * force using xla generate * refactor * more fixes * finish cleaning * finish * finish * clean gpt2 tests * add gpt2 tests * correct more cases * up * finish * finish * more fixes * flake 8 stuff * final rag fix * Update src/transformers/models/rag/modeling_tf_rag.py * finish t5 as well * finish * Update src/transformers/generation_utils.py Co-authored-by:Patrick von Platen <patrick.v.platen@gmail.com>
-
Yih-Dar authored
* fix 2 pytorch vilt docstring examples * add vilt to doctest list file * remove device Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Markus Sagen authored
* Fix inconsistent example variable naming - Example code for a sequence classification in Tensorflow had spelling mistakes and incorrect and inconsistent naming - Changed variable naming to be consistent with the two other TF examples * Fix incorrect incorrect training examples
-
Sylvain Gugger authored
* Use tempaltes for all doc building jobs * Add this branch to the doc build * Switch to main branch
-
Daniel Espejel authored
* Added spanish translation of quicktour.mdx * Suggestions applied in the revision of the translation Co-authored-by:
Omar U. Espejel <espejelomar@gmail.com> Co-authored-by:
Omar U. Espejel <espejelomar@gmail.com>
-
Ahmed Elnaggar authored
* Configurable Relative Position Max. Distance * fix missing config Co-authored-by:ahmed-elnaggar <ahmed.elnaggar@allianz.com>
-
marxav authored
-
Patrick von Platen authored
-
Pavel Belevich authored
-