"...git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "e7e9261a202dd5623f488f1cb05007e88629f275"
  1. 29 Jun, 2022 1 commit
    • Aritra Roy Gosthipaty's avatar
      TF implementation of RegNets (#17554) · a7eba831
      Aritra Roy Gosthipaty authored
      
      
      * chore: initial commit
      
      Copied the torch implementation of regnets and porting the code to tf step by step. Also introduced an output layer which was needed for regnets.
      
      * chore: porting the rest of the modules to tensorflow
      
      did not change the documentation yet, yet to try the playground on the model
      
      * Fix initilizations (#1)
      
      * fix: code structure in few cases.
      
      * fix: code structure to align tf models.
      
      * fix: layer naming, bn layer still remains.
      
      * chore: change default epsilon and momentum in bn.
      
      * chore: styling nits.
      
      * fix: cross-loading bn params.
      
      * fix: regnet tf model, integration passing.
      
      * add: tests for TF regnet.
      
      * fix: code quality related issues.
      
      * chore: added rest of the files.
      
      * minor additions..
      
      * fix: repo consistency.
      
      * fix: regnet tf tests.
      
      * chore: reorganize dummy_tf_objects for regnet.
      
      * chore: remove checkpoint var.
      
      * chore: remov unnecessary files.
      
      * chore: run make style.
      
      * Update docs/source/en/model_doc/regnet.mdx
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * chore: PR feedback I.
      
      * fix: pt test. thanks to @ydshieh.
      
      * New adaptive pooler (#3)
      
      * feat: new adaptive pooler
      
      Co-authored-by: @Rocketknight1
      
      * chore: remove image_size argument.
      Co-authored-by: default avatarmatt <rocketknight1@gmail.com>
      Co-authored-by: default avatarmatt <rocketknight1@gmail.com>
      
      * Empty-Commit
      
      * chore: remove image_size comment.
      
      * chore: remove playground_tf.py
      
      * chore: minor changes related to spacing.
      
      * chore: make style.
      
      * Update src/transformers/models/regnet/modeling_tf_regnet.py
      Co-authored-by: default avataramyeroberts <aeroberts4444@gmail.com>
      
      * Update src/transformers/models/regnet/modeling_tf_regnet.py
      Co-authored-by: default avataramyeroberts <aeroberts4444@gmail.com>
      
      * chore: refactored __init__.
      
      * chore: copied from -> taken from./g
      
      * adaptive pool -> global avg pool, channel check.
      
      * chore: move channel check to stem.
      
      * pr comments - minor refactor and add regnets to doc tests.
      
      * Update src/transformers/models/regnet/modeling_tf_regnet.py
      Co-authored-by: default avatarNielsRogge <48327001+NielsRogge@users.noreply.github.com>
      
      * minor fix in the xlayer.
      
      * Empty-Commit
      
      * chore: removed from_pt=True.
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      Co-authored-by: default avatarmatt <rocketknight1@gmail.com>
      Co-authored-by: default avataramyeroberts <aeroberts4444@gmail.com>
      Co-authored-by: default avatarNielsRogge <48327001+NielsRogge@users.noreply.github.com>
      a7eba831
  2. 28 Jun, 2022 3 commits
  3. 27 Jun, 2022 1 commit
    • Matt's avatar
      Add a TF in-graph tokenizer for BERT (#17701) · ee0d001d
      Matt authored
      * Add a TF in-graph tokenizer for BERT
      
      * Add from_pretrained
      
      * Add proper truncation, option handling to match other tokenizers
      
      * Add proper imports and guards
      
      * Add test, fix all the bugs exposed by said test
      
      * Fix truncation of paired texts in graph mode, more test updates
      
      * Small fixes, add a (very careful) test for savedmodel
      
      * Add tensorflow-text dependency, make fixup
      
      * Update documentation
      
      * Update documentation
      
      * make fixup
      
      * Slight changes to tests
      
      * Add some docstring examples
      
      * Update tests
      
      * Update tests and add proper lowercasing/normalization
      
      * make fixup
      
      * Add docstring for padding!
      
      * Mark slow tests
      
      * make fixup
      
      * Fall back to BertTokenizerFast if BertTokenizer is unavailable
      
      * Fall back to BertTokenizerFast if BertTokenizer is unavailable
      
      * make fixup
      
      * Properly handle tensorflow-text dummies
      ee0d001d
  4. 24 Jun, 2022 2 commits
  5. 23 Jun, 2022 2 commits
  6. 21 Jun, 2022 3 commits
  7. 18 Jun, 2022 1 commit
    • Rafael Zimmer's avatar
      Added translation of index.mdx to Portuguese Issue #16824 (#17565) · 0d92798b
      Rafael Zimmer authored
      
      
      * Added translation of installation.mdx to Portuguese, as well
      as default templates of _toctree.yml and _config.py
      
      * [ build_documentation.yml ] - Updated doc_builder to build
      documentation in Portuguese.
      [ pipeline_tutorial.mdx ] - Created translation for the pipeline_tutorial.mdx.
      
      * [ build_pr_documentation.yml ] - Added pt language to pr_documentation builder.
      
      [ pipeline_tutorial.mdx ] - Grammar changes.
      
      * [ accelerate.mdx ] - Translated to Portuguese the acceleration tutorial.
      
      * [ multilingual.mdx ] - Added portuguese translation for multilingual tutorial.
      
      [ training.mdx ] - Added portuguese translation for training tutorial.
      
      * [ preprocessing.mdx ] - WIP
      
      * Update _toctree.yml
      
      * Adding Pr茅-processamento to _toctree.yml
      
      * Update accelerate.mdx
      
      * Nits and eliminate preprocessing file while it is ready
      
      * [ index.mdx ] - Translated to Portuguese the index apresentation page.
      
      * [ docs/source/pt ] - Updated _toctree.yml to match newest translations.
      
      * Fix build_pr_documentation.yml
      
      * Fix index nits
      
      * nits in _toctree
      Co-authored-by: default avatarOmar U. Espejel <espejelomar@gmail.com>
      0d92798b
  8. 15 Jun, 2022 2 commits
  9. 14 Jun, 2022 2 commits
  10. 13 Jun, 2022 2 commits
    • Daniel Stancl's avatar
      Add `LongT5` model (#16792) · a72f1c9f
      Daniel Stancl authored
      
      
      * Initial commit
      
      * Make some fixes
      
      * Make PT model full forward pass
      
      * Drop TF & Flax implementation, fix copies etc
      
      * Add Flax model and update some corresponding stuff
      
      * Drop some TF things
      
      * Update config and flax local attn
      
      * Add encoder_attention_type to config
      
      * .
      
      * Update docs
      
      * Do some cleansing
      
      * Fix some issues -> make style; add some docs
      
      * Fix position_bias + mask addition + Update tests
      
      * Fix repo consistency
      
      * Fix model consistency by removing flax operation over attn_mask
      
      * [WIP] Add PT TGlobal LongT5
      
      * .
      
      * [WIP] Add flax tglobal model
      
      * [WIP] Update flax model to use the right attention type in the encoder
      
      * Fix flax tglobal model forward pass
      
      * Make the use of global_relative_attention_bias
      
      * Add test suites for TGlobal model
      
      * Fix minor bugs, clean code
      
      * Fix pt-flax equivalence though not convinced with correctness
      
      * Fix LocalAttn implementation to match the original impl. + update READMEs
      
      * Few updates
      
      * Update: [Flax] improve large model init and loading #16148
      
      * Add ckpt conversion script accoring to #16853 + handle torch device placement
      
      * Minor updates to conversion script.
      
      * Typo: AutoModelForSeq2SeqLM -> FlaxAutoModelForSeq2SeqLM
      
      * gpu support + dtype fix
      
      * Apply some suggestions from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * * Remove (de)parallelize stuff
      * Edit shape comments
      * Update README.md
      * make fix-copies
      
      * Remove caching logic for local & tglobal attention
      
      * Apply another batch of suggestions from code review
      
      * Add missing checkpoints
      * Format converting scripts
      * Drop (de)parallelize links from longT5 mdx
      
      * Fix converting script + revert config file change
      
      * Revert "Remove caching logic for local & tglobal attention"
      
      This reverts commit 2a619828f6ddc3e65bd9bb1725a12b77fa883a46.
      
      * Stash caching logic in Flax model
      
      * Make side relative bias used always
      
      * Drop caching logic in PT model
      
      * Return side bias as it was
      
      * Drop all remaining model parallel logic
      
      * Remove clamp statements
      
      * Move test files to the proper place
      
      * Update docs with new version of hf-doc-builder
      
      * Fix test imports
      
      * Make some minor improvements
      
      * Add missing checkpoints to docs
      * Make TGlobal model compatible with torch.onnx.export
      * Replace some np.ndarray with jnp.ndarray
      
      * Fix TGlobal for ONNX conversion + update docs
      
      * fix _make_global_fixed_block_ids and masked neg  value
      
      * update flax model
      
      * style and quality
      
      * fix imports
      
      * remove load_tf_weights_in_longt5 from init and fix copies
      
      * add slow test for TGlobal model
      
      * typo fix
      
      * Drop obsolete is_parallelizable and one warning
      
      * Update __init__ files to fix repo-consistency
      
      * fix pipeline test
      
      * Fix some device placements
      
      * [wip]: Update tests -- need to generate summaries to update expected_summary
      
      * Fix quality
      
      * Update LongT5 model card
      
      * Update (slow) summarization tests
      
      * make style
      
      * rename checkpoitns
      
      * finish
      
      * fix flax tests
      Co-authored-by: default avatarphungvanduy <pvduy23@gmail.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarpatil-suraj <surajp815@gmail.com>
      a72f1c9f
    • Sijun He's avatar
      Add Visual Question Answering (VQA) pipeline (#17286) · 66336dc1
      Sijun He authored
      
      
      * wip
      
      * rebase
      
      * all tests pass
      
      * rebase
      
      * ready for PR
      
      * address comments
      
      * fix styles
      
      * add require_torch to pipeline test
      
      * remove remote image to improve CI consistency
      
      * address comments; fix tf/flax tests
      
      * address comments; fix tf/flax tests
      
      * fix tests; add alias
      
      * repo consistency tests
      
      * Update src/transformers/pipelines/visual_question_answering.py
      Co-authored-by: default avatarNielsRogge <48327001+NielsRogge@users.noreply.github.com>
      
      * address comments
      
      * Update src/transformers/pipelines/visual_question_answering.py
      Co-authored-by: default avatarNielsRogge <48327001+NielsRogge@users.noreply.github.com>
      
      * merge
      
      * Update src/transformers/models/auto/modeling_auto.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * merge
      Co-authored-by: default avatarSijun He <sijunhe@Sijuns-MacBook-Pro.local>
      Co-authored-by: default avatarNielsRogge <48327001+NielsRogge@users.noreply.github.com>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      66336dc1
  11. 10 Jun, 2022 1 commit
  12. 09 Jun, 2022 4 commits
  13. 08 Jun, 2022 2 commits
  14. 07 Jun, 2022 3 commits
    • Chan Woo Kim's avatar
      M-CTC-T Model (#16402) · 119e3c0f
      Chan Woo Kim authored
      
      
      * added cbs to notebooks, made copy-paste error fix in generation_utils
      
      * initial push for mctc model
      
      * mctc feature extractor done
      
      * added processor, tokenizer and their tests for MCTC. Have added an MCTC modeling test, adjusting model code accordingly.
      
      * added processor, tokenizer and their tests for MCTC. Have added an MCTC modeling test, adjusting model code accordingly.
      
      * passing attention, now struggling to figure out how attention masks make sense here
      
      * works when excluding attention masks. ask later how one would integrate attention maskshere
      
      * bizarre configuration error (model prefix comes first in config dict json and messes up the order)
      
      * all passing but bizzarre config dict ordering issue when to_dict
      
      * passing all major tests
      
      * feature extraction, processor, tokenizer added & tests passing
      
      * style & consistency & other logistical fixes
      
      * copy paste fix
      
      * model after feature extraction working
      
      * commiting final feature extraction results; need to fix normalization
      
      * feature extraction passing tests; probably should add tests on the specific flashlight-copied functions?
      
      * delete print ; format code a bit
      
      * fixing tests
      
      * passing major tests
      
      * fixing styles
      
      * completed tokenization test with real example; not sure if these values are entirely correct.
      
      * last test fixes from local
      
      * reverting accidentally included custom setup configs
      
      * remove load tf weights; fix config error
      
      * testing couldnt import featureextractor
      
      * fix docs
      
      * fix docs
      
      * resolving comments
      
      * style fixes
      
      * style fixes
      
      * Update to MCTCConv1dSubSampler
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * relposemb fixes
      
      * conv1d name issue; expecting config fail with paraentheses
      
      * fix config issue
      
      * fix config issue
      
      * fix config issue
      
      * change everything to MCTCT
      
      * fixing naming change errors
      
      * archive list
      
      * copyrights and docs
      
      * copyrights and docs
      
      * copyrights and docs
      
      * merge resolution
      
      * move tests, fix to changed optionaldependency structure
      
      * test directories changed
      
      * fixing tests
      
      * how to avoid tf tests?
      
      * how to avoid tf tests?
      
      * tests passing locally
      
      * allow mctctprocessor imported any env
      
      * allow mctctprocessor imported any env
      
      * fixed second round of feedback, need to fix docs
      
      * doc changes not being applied
      
      * all fixed
      
      * style fix
      
      * feedback fixes
      
      * fix copies and feature extraction style fix
      
      * Update tests/models/visual_bert/test_modeling_visual_bert.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * copy paste huggingface:main visual bert
      
      * added eof newline to visual bert; all tests are passing otherwise
      
      * fix slow tests by adding attention mask
      
      * change model id to speechbrain
      
      * make fix-copies
      
      * fix readme unwanted deletes
      
      * fixing readmes, make fix-copies
      
      * consistent M-CTC-T naming
      
      * Update src/transformers/models/mctct/__init__.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * all fixed but variable naming
      
      * adjust double quotes
      
      * fixed variable names
      
      * copyright and mr quilter
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * correct slow tests
      
      * make fix-copies
      
      * Update src/transformers/models/mctct/configuration_mctct.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/models/mctct/configuration_mctct.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * m-ctc-t not mctct
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      119e3c0f
    • V铆tor Fr贸is's avatar
      quicktour.mdx en -> pt translation (#17074) · 706bb836
      V铆tor Fr贸is authored
      
      
      * Quicktour Portuguese Translation
      
      Translated quicktour.mdx until line 161
      
      * Finished translating quicktour.mdx
      
      Ready to upload and adjust eventual .mdx or translation mistakes.
      
      * Add _toctree.yml and fix nits
      
      * Fixed pt-br mdx syntax problem
      
      Closed <frameworkcontent> instance
      
      * Changed </frameworkcontent> line
      
      * Copied missing block from english version of quicktour.mdx
      
      * Reviwed the entire file once again. It should be working now.
      Co-authored-by: default avatarOmar U. Espejel <espejelomar@gmail.com>
      706bb836
    • Omar U. Espejel's avatar
      b1187307
  15. 06 Jun, 2022 3 commits
    • Nicola Procopio's avatar
      Translation/italian: added pipeline_tutorial.mdx [Issue: #17459] (#17507) · 34a886fc
      Nicola Procopio authored
      * added toctree.yml file
      
      * first translation
      
      * added pipeline_tutorial.mdx translation
      
      added pipeline_tutorial.mdx
      updated _toctree.yml
      
      * updated pipeline_tutorial.mdx
      
      * updated _toctree.yml
      
      Updated preprocessing and training
      
      * updated preprocessing.mdx
      
      start translation
      
      * Update _toctree.yml
      
      * Delete preprocessing.mdx
      
      * Update _toctree.yml
      
      * updated _toctree.yml
      
      * added preprocessing
      
      * Update _toctree.yml
      
      * updated _toctree.yml
      
      * undo
      
      * Revert "undo"
      
      This reverts commit 5d38d768752dc80918bf60ada9d185f98b742520.
      
      * Revert "Revert "undo""
      
      This reverts commit 8aa0830b587f915ca7d154ebca282b782e82bd92.
      34a886fc
    • Martina Fumanelli's avatar
      Add installation.mdx Italian translation (#17530) · f6ad0e05
      Martina Fumanelli authored
      * Add the Italian translation of the file installation.mdx and edit _toctree
      
      * Add the Italian translation of the file installation.mdx and edit _toctree
      f6ad0e05
    • Jonatas Grosman's avatar
      Adding the Portuguese version of the tasks/token_classification.mdx documentation (#17492) · 4aed1dc8
      Jonatas Grosman authored
      * add tasks/token_classification pt doc structure
      
      * add tasks/token_classification pt doc translation
      
      * add tasks/token_classification pt doc translation
      4aed1dc8
  16. 03 Jun, 2022 3 commits
  17. 02 Jun, 2022 1 commit
  18. 01 Jun, 2022 2 commits
  19. 31 May, 2022 2 commits
    • Arthur's avatar
      Opt in flax and tf (#17388) · 7822a9b7
      Arthur authored
      
      
      * initial commit
      
      * add init file
      
      * update globakl init
      
      * update index and dummy objects
      
      * style
      
      * update modelling auto
      
      * fix initi typo in src/transformers
      
      * fix typo in modeling tf auto, opt was in wrong mapping name
      
      * fixed a slow test : saved_model
      
      * style
      
      * fix positionnal embedding if no position id is provided
      
      * update tf test
      
      * update test flax requirements
      
      * fixed serialization
      
      * update
      
      * update tf name to allow smooth convertion
      
      * update flax tests
      
      * style
      
      * fix test typo
      
      * fix tf typo test
      
      * add xla for generate support in causal LM
      
      * fixed bug
      
      * cleaned tf tests
      
      * style
      
      * removed from PT for slow tests
      
      * fix typp
      
      * opt test as slow
      
      * trying to fix GPT2 undefined
      
      * correct documentation and add to test doc
      
      * update tf doc
      
      * fix doc
      
      * fake commit
      
      * Apply suggestions from code review
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      
      * update test based on review
      
      * merged main layer for functionning test
      
      * fixup + quality
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * update long comment
      
      * make fix copies
      Co-authored-by: default avatarArthur <arthur@huggingface.co>
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      7822a9b7
    • Martina Fumanelli's avatar
      Setup for Italian translation and add quicktour.mdx translation (#17472) · dfc38463
      Martina Fumanelli authored
      
      
      * Setup for Italian translation and add first document
      
      - Add 'it' folder for files translated into Italian
      - Add _config.py and _toctree.yml files
      - Add translation of quicktour.mdx
      
      * Fix style issue of italian documentation files
      
      * Add 'it' to the languages section in the .github/workflows
      
      * Remove - installation from _toctree for Italian
      
      * Translation for index file
      
      - Add index to _toctree.yml
      - Add translation of index.mdx
      
      * Fix typo in docs/source/it/index.mdx
      
      * Translate code comments in docs/source/it/_config.py
      Co-authored-by: default avatarMartina Fumanelli <martinafumanelli@Martinas-MBP.homenet.telecomitalia.it>
      dfc38463