- 28 Jul, 2022 3 commits
-
-
Nicola Procopio authored
-
Edoardo Federici authored
Left the term fine-tuning since there is no correct translation into Italian and the English term is generally used. The same was done with some terms like "learning rate"
-
Edoardo Federici authored
-
- 27 Jul, 2022 19 commits
-
-
Steven Liu authored
As pointed out by @NielsRogge, a feature extractor is used to prepare inputs for a model with a single modality rather than multimodal models.
-
Wang, Yi authored
* start from 1.12, torch_ccl is renamed as oneccl_bindings_for_pytorch and should import it before use Signed-off-by:
Wang, Yi A <yi.a.wang@intel.com> * add doc for perf_train_cpu_many Signed-off-by:
Wang, Yi A <yi.a.wang@intel.com> * update doc Signed-off-by:
Wang, Yi A <yi.a.wang@intel.com>
-
Ritik Nandwal authored
* Add files generated using transformer-cli add-new-model-like command * Add changes for swinv2 attention and forward method * Add fixes * Add modifications for weight conversion and remaining args in swin model * Add changes for patchmerging * Add changes for SwinV2selfattention * Update conversion script * Add final fixes for the swin_v2 model * Add changes for conversion script for pretrained window size case * Add pretrained window size value from config in SwinV2Encoder class * Make fixup * Add swinv2 to models_not_in_readme to utils/check_copies.py * Modify Swinv2v2 to Swin Transformer V2 * Remove copied from, to run make fixup command * Add updates to swinv2tf from main branch * Add pretrained_window_size to config, to make tests pass * Add modified weights from nandwalritik profile for swinv2 * Update model weights from swinv2 from nandwalritik profile * Add fix for build_pr_documentation CI fix * Add fixes f...
-
Lysandre authored
-
Sanchit Gandhi authored
* [Flax] Fix incomplete batches in example scripts * fix dataloader batching * convert jnp batch idxs to np array * add missing `pad_shard_unpad` to final prediction generate step * only `pad_shard_unpad` at inference time * merge conflicts * remove incomplete batch step from eval * fix run_qa.py * add `pad_shard_unpad` to run_flax_ner.py * add `pad_shard_unpad` to run_flax_glue.py * add `pad_shard_unpad` to run_image_classification.py * make style * fix mlm flax eval batches * remove redundant imports
-
Alara Dirik authored
* fix owlvit test assertion errors * fix gpu test error * remove redundant lines * fix styling
-
Sylvain Gugger authored
* Fix sacremoses sof dependency for Transofmers XL * Add function to the submodule init
-
Lysandre Debut authored
-
Sylvain Gugger authored
* Remove all uses of six * fix quality
-
Duong A. Nguyen authored
* generalize decay_mask_fn to find all layernorm params * fixup * generalising decay_mask_fn
-
Nouamane Tazi authored
-
Younes Belkada authored
- put gelu before 4h to h
-
Mikkel Denker authored
* Fixes torch jit tracing for LayoutLMv2 model. Pytorch seems to reuse memory for input_shape which caused a mismatch in shapes later in the forward pass. * Fixed code quality * avoid unneeded allocation of vector for shape
-
Loubna Ben Allal authored
* add info about megatron training * upload models and datasets from CodeParrot organization * upload models and datasets from CodeParrot organization * Update examples/research_projects/codeparrot/README.md Co-authored-by:
Leandro von Werra <lvwerra@users.noreply.github.com> * Update examples/research_projects/codeparrot/README.md Co-authored-by:
Leandro von Werra <lvwerra@users.noreply.github.com> * Update examples/research_projects/codeparrot/README.md Co-authored-by:
Leandro von Werra <lvwerra@users.noreply.github.com> * Update examples/research_projects/codeparrot/README.md Co-authored-by:
Leandro von Werra <lvwerra@users.noreply.github.com> * Update examples/research_projects/codeparrot/README.md Co-authored-by:
Leandro von Werra <lvwerra@users.noreply.github.com> * fix typo and add comment about codeparrot vs megatron Co-authored-by:
Leandro von Werra <lvwerra@users.noreply.github.com>
-
Yanming Wang authored
-
Seunghwan Hong authored
Signed-off-by:Seunghwan Hong <seunghwan@scatterlab.co.kr>
-
NielsRogge authored
* Improve docs * Improve docs of speech one as well * Apply suggestions from code review Co-authored-by:Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
-
Manuel R. Ciosici authored
Removes a duplicated instantiation of device. I removed the second instance of the line to maintain code alignment with the GPT-J implementation of forward.
-
NielsRogge authored
* Improve doc test * Improve code example of segmentation model * Apply suggestion * Update src/transformers/models/detr/modeling_detr.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 26 Jul, 2022 18 commits
-
-
Carolyn Wang authored
* add import * format
-
Matt authored
* Fix Sylvain's nits on the original PR * Update src/transformers/keras_callbacks.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Re-add "optional" to docstring Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Ian Castillo authored
* Update index * Translate to Spanish two sections from custom_models * Translate to Spanish custom models documentation * Fixing typos and grammatical errors * Add requested changes from reviewer
-
Federico Panero authored
* work in progress: custom_models * Update custom_models.mdx * Update custom_models.mdx * Update _toctree.yml * Update _toctree.yml * Update custom_models.mdx * Update custom_models.mdx * Update _toctree.yml * Update _toctree.yml Co-authored-by:Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Federico Panero authored
* Add Italian translation of converting_tensorflow_models.mdx * Update _toctree.yml * Update converting_tensorflow_models.mdx * Update docs/source/it/_toctree.yml Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Matt authored
* Raise a TF-specific error when importing Torch classes * Update src/transformers/utils/import_utils.py Co-authored-by:
Lysandre Debut <lysandre.debut@reseau.eseo.fr> * Add an inverse error for PyTorch users Co-authored-by:
Lysandre Debut <lysandre.debut@reseau.eseo.fr>
-
Fellip Silva Alves authored
* [ fast_tokenizers.mdx ] - Added translation to portuguese to tutorial * Delete docs/source/pt-br directory * [ fast_tokenizers.mdx ] - Continuing work on file * [ fast_tokenizers.mdx ] - Continuing work on file * Add fast tokenizers to _toctree.yml * Eliminated config and toctree.yml * Nits in fast_tokenizers.mdx * Finishing create_a_model * [ create_a_model.mdx ] finishing create a model in pt-br * [ Changing _toctree.yml ] adding create a model in pt Co-authored-by:Omar U. Espejel <espejelomar@gmail.com>
-
Gorkem Ozkaya authored
* Update translation.mdx * update translation.mdx by running make style
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Tom Mathews authored
-
Tom Mathews authored
-
Sylvain Gugger authored
-
Jingya HUANG authored
* Ensure value and attn weights have the same dtype * Remove prints * Modify decision transformers copied from gpt2 * Nit device Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * Fix style Co-authored-by:
Lysandre Debut <lysandre@huggingface.co>
-
gilad19 authored
* Add ViltForTokenClassification e.g. for Named-Entity-Recognition (NER) * Add ViltForTokenClassification e.g. for Named-Entity-Recognition (NER) * provide classifier only text hidden states * add test_for_token_classification * Update src/transformers/models/vilt/modeling_vilt.py Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update src/transformers/models/vilt/modeling_vilt.py Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update src/transformers/models/vilt/modeling_vilt.py Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update src/transformers/models/vilt/modeling_vilt.py Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> * add test_for_token_classification Co-authored-by:
gfuchs <gfuchs@ebay.com> Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com>
-
Alara Dirik authored
* fix docs and add owlvit docs test * fix minor bug in post_process, add to processor * improve owlvit code examples * fix hardcoded image size
-
Lysandre Debut authored
-