1. 31 May, 2022 1 commit
    • Ritik Nandwal's avatar
      Added XLM onnx config (#17030) · 5af38953
      Ritik Nandwal authored
      * Add onnx configuration for xlm
      
      * Add supported features for xlm
      
      * Add xlm to models exportable with onnx
      
      * Add xlm architecture to test file
      
      * Modify docs
      
      * Make code quality fixes
      5af38953
  2. 18 May, 2022 1 commit
  3. 16 May, 2022 1 commit
  4. 09 May, 2022 2 commits
  5. 06 May, 2022 1 commit
  6. 26 Apr, 2022 1 commit
  7. 25 Apr, 2022 1 commit
  8. 22 Apr, 2022 1 commit
  9. 18 Apr, 2022 1 commit
  10. 12 Apr, 2022 2 commits
  11. 08 Apr, 2022 1 commit
    • NielsRogge's avatar
      Add TAPEX (#16473) · 4ef0abb7
      NielsRogge authored
      
      
      * Add TapexTokenizer
      
      * Improve docstrings and provide option to provide answer
      
      * Remove option for pretokenized inputs
      
      * Add TAPEX to README
      
      * Fix copies
      
      * Remove option for pretokenized inputs
      
      * Initial commit: add tapex fine-tuning examples on both table-based question answering and table-based fact verification.
      
      * - Draft a README file for running the script and introducing some background.
      - Remove unused code lines in tabfact script.
      - Disable the deafult `pad_to_max_length` option which is memory-consuming.
      
      * * Support `as_target_tokenizer` function for TapexTokenizer.
      * Fix the do_lower_case behaviour of TapexTokenizer.
      * Add unit tests for target scenarios and cased/uncased scenarios for both source and target.
      
      * * Replace the label BartTokenizer with TapexTokenizer's as_target_tokenizer function.
      * Fix typos in tapex example README.
      
      * * fix the evaluation script - remove the property `task_name`
      
      * * Make the label space more clear for tabfact tasks
      
      * * Using a new fine-tuning script for tapex-base on tabfact.
      
      * * Remove the lowercase code outside the tokenizer - we use the tokenizer to control whether do_lower_case
      * Guarantee the hyper-parameter can be run without out-of-memory on 16GB card and report the new reproduced number on wikisql
      
      * * Remove the default tokenizer_name option.
      * Provide evaluation command.
      
      * * Support for WikiTableQuestion dataset.
      
      * Fix a typo in README.
      
      * * Fix the datasets's key name in WikiTableQuestions
      
      * Run make fixup and move test to folder
      
      * Fix quality
      
      * Apply suggestions from code review
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      
      * Apply suggestions from code review
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Apply some more suggestions from code review
      
      * Improve docstrings
      
      * Overwrite failing test
      
      * Improve comment in example scripts
      
      * Fix rebase
      
      * Add TAPEX to Auto mapping
      
      * Add TAPEX to auto config mappings
      
      * Put TAPEX higher than BART in auto mapping
      
      * Add TAPEX to doc tests
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MBP.localdomain>
      Co-authored-by: default avatarSivilTaram <qianlxc@outlook.com>
      Co-authored-by: default avatarNiels Rogge <nielsrogge@nielss-mbp.home>
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      4ef0abb7
  12. 04 Apr, 2022 1 commit
  13. 01 Apr, 2022 1 commit
  14. 25 Mar, 2022 2 commits
  15. 23 Mar, 2022 2 commits
  16. 22 Mar, 2022 1 commit
  17. 21 Mar, 2022 1 commit
  18. 15 Mar, 2022 1 commit
  19. 09 Mar, 2022 1 commit
    • lewtun's avatar
      Add ONNX export for ViT (#15658) · 50dd314d
      lewtun authored
      
      
      * Add ONNX support for ViT
      
      * Refactor to use generic preprocessor
      
      * Add vision dep to tests
      
      * Extend ONNX slow tests to ViT
      
      * Add dummy image generator
      
      * Use model_type to determine modality
      
      * Add deprecation warnings for tokenizer argument
      
      * Add warning when overwriting the preprocessor
      
      * Add optional args to docstrings
      
      * Add minimum PyTorch version to OnnxConfig
      
      * Refactor OnnxConfig class variables from CONSTANT_NAME to snake_case
      
      * Add reasonable value for default atol
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      50dd314d
  20. 02 Mar, 2022 1 commit
  21. 01 Mar, 2022 1 commit
  22. 18 Feb, 2022 1 commit
    • Gunjan Chhablani's avatar
      Add PLBart (#13269) · ae1f8350
      Gunjan Chhablani authored
      * Init PLBART
      
      * Add missing configuration file
      
      * Add conversion script and configurationf ile
      
      * Fix style
      
      * Update modeling and conversion scripts
      
      * Fix scale embedding in config
      
      * Add comment
      
      * Fix conversion script
      
      * Add classification option to conversion script
      
      * Fix vocab size in config doc
      
      * Add tokenizer files from MBart50
      
      * Allow no lang code in regular tokenizer
      
      * Add PLBart Tokenizer Converters
      
      * Remove mask from multi tokenizer
      
      * Remove mask from multi tokenizer
      
      * Change from MBart-50 to MBart tokenizer
      
      * Fix names and modify src/tgt behavior
      
      * Fix imports for tokenizer
      
      * Remove <mask> from multi tokenizer
      
      * Fix style
      
      * Change tokenizer_class to processor_class
      
      * Add attribute map to config class
      
      * Update modeling file to modified MBart code
      
      * Update configuration file to MBart style configuration
      
      * Fix tokenizer
      
      * Separate tokenizers
      
      * Fix error in tokenization auto
      
      * Copy MBart tests
      
      * Replace with MBart tokenization tests
      
      * Fix style
      
      * Fix language code in multi tokenizer
      
      * Fix configuration docs
      
      * Add entry for plbart_multi in transformers init
      
      * Add dummy objects and fix imports
      
      * Fix modeling tests
      
      * Add TODO in config
      
      * Fix copyright year
      
      * Fix modeling docs and test
      
      * Fix some tokenization tests and style
      
      * Add changes from review
      
      * Fix copies
      
      * Fix docs
      
      * Fix docs
      
      * Fix style
      
      * Fix year
      
      * Add changes from review
      
      * Remove extra changes
      
      * Fix base tokenizer and doc
      
      * Fix style
      
      * Fix modeling and slow tokenizer tests
      
      * Remove Multi-tokenizer Converter and Tests
      
      * Delete QA model and Multi Tokenizer dummy objects
      
      * Fix repo consistency and code quality issues
      
      * Fix example documentation
      
      * Fix style
      
      * Remove PLBartTokenizer from type checking in init
      
      * Fix consistency issue
      
      * Add changes from review
      
      * Fix style
      
      * Remove PLBartTokenizerFast
      
      * Remove FastTokenizer converter
      
      * Fix AutoTokenzier mapping
      
      * Add plbart to toctree and fix consistency issues
      
      * Add language codes tokenizer test
      
      * Fix styling and doc issues
      
      * Add fixes for failing tests
      
      * Fix copies
      
      * Fix failing modeling test
      
      * Change assert to assertTrue in modeling tests
      ae1f8350
  23. 10 Feb, 2022 2 commits
  24. 08 Feb, 2022 1 commit
  25. 07 Feb, 2022 1 commit
  26. 29 Jan, 2022 1 commit
  27. 12 Jan, 2022 1 commit
    • lewtun's avatar
      Add ONNX configuration classes to docs (#15121) · 021f2ea9
      lewtun authored
      * Add ONNX classes to main package
      
      * Remove permalinks from ONNX guide
      
      * Fix ToC entry
      
      * Revert "Add ONNX classes to main package"
      
      This reverts commit eb794a5b00d66b0b4eab234987301676d8357630.
      
      * Add ONNX classes to main doc
      
      * Fix syntax highlighting in doc
      
      * Fix text
      
      * Add FeaturesManager to doc
      
      * Use paths to reference ONNX classes
      
      * Add FeaturesManager to init
      
      * Add missing ONNX paths
      021f2ea9
  28. 11 Jan, 2022 2 commits
    • lewtun's avatar
      Update ONNX docs (#14904) · 16f0b7d7
      lewtun authored
      
      
      * Remove docs for deprecated ONNX export
      
      * Tidy up the CLI help messages
      
      * Revamp ONNX docs
      
      * Update auto-config table
      
      * Use DistilBERT as example for consistency
      
      * Wrap up first pass at ONNX docs
      
      * Fix table check
      
      * Add tweaks and introduction
      
      * Add cross-ref
      
      * Fix missing import
      
      * Fix style
      
      * Add permalinks to ONNX configs
      
      * Clarify role of OrderedDict
      
      * Update docs/source/serialization.mdx
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Add doctest syntax to code blocks
      
      * Remove permalinks
      
      * Revert "Remove permalinks"
      
      This reverts commit 099701daf0db27823457867938efdb2d4f22a7c1.
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      16f0b7d7
    • Virus's avatar
      Adds IBERT to models exportable with ONNX (#14868) · c4fa908f
      Virus authored
      * Add IBertOnnxConfig and tests
      
      * add all the supported features for IBERT and remove outputs in IbertOnnxConfig
      
      * use OnnxConfig
      
      * fix codestyle
      
      * remove serialization.rst
      
      * codestyle
      c4fa908f
  29. 07 Jan, 2022 1 commit
  30. 28 Dec, 2021 1 commit
    • Sylvain Gugger's avatar
      Doc styler examples (#14953) · b5e2b183
      Sylvain Gugger authored
      * Fix bad examples
      
      * Add black formatting to style_doc
      
      * Use first nonempty line
      
      * Put it at the right place
      
      * Don't add spaces to empty lines
      
      * Better templates
      
      * Deal with triple quotes in docstrings
      
      * Result of style_doc
      
      * Enable mdx treatment and fix code examples in MDXs
      
      * Result of doc styler on doc source files
      
      * Last fixes
      
      * Break copy from
      b5e2b183
  31. 23 Dec, 2021 1 commit
    • lewtun's avatar
      Add ONNX support for MarianMT models (#14586) · 6b655cc6
      lewtun authored
      * First commit to add MarianMT to ONNX
      
      * Now MarianModel.forward() automatically generates decoder_input_ids, like BartModel.forward()
      
      * Adjusted MarianOnnxConfig.inputs and outputs to work with seq2seq-lm feature
      
      * Style fix
      
      * Added support for other features for already supported models
      
      * Partial support for causal and seq2seq models
      
      * Partial support for causal and seq2seq models
      
      * Add default task for MarianMT ONNX
      
      * Remove automatic creation of decoder_input_ids
      
      * Extend inputs and outputs for MarianMT ONNX config
      
      * Add MarianMT to ONNX unit tests
      
      * Refactor
      
      * OnnxSeq2SeqConfigWithPast to support seq2seq models
      
      * Parameterized the onnx tests
      
      * Restored run_mlm.py
      
      * Restored run_mlm.py
      
      * [WIP] BART update
      
      * BART and MBART
      
      * Add past_key_values and fix dummy decoder inputs
      
      Using a sequence length of 1 in generate_dummy_outputs() produces large discrepancies, presumably due to some hidden optimisations.
      
      * Refactor MarianOnnxConfig to remove custom past_key_values logic
      
      * Fix quality
      
      * Revert "Revert "Added support for other features for already supported models (#14358)" (#14679)"
      
      This reverts commit 0f4e39c5.
      
      * is_torch_available test to avoid failing imports
      
      * sorting parameterize parameters to solve ERROR gw0 gw1
      
      * tests fix
      
      * tests fix
      
      * GPT2 with past fix
      
      * Fixed stateful class attribute change that was breaking things when converting multiple models sequentially
      
      * Removed onnx file
      
      * Refactor Marian export to account for base changes
      
      * Fix copies
      
      * Implemented suggestions
      
      * Extend support for causal LM
      
      * Revert "Revert "Added support for other features for already supported models (#14358)" (#14679)"
      
      This reverts commit 0f4e39c5.
      
      * is_torch_available test to avoid failing imports
      
      * sorting parameterize parameters to solve ERROR gw0 gw1
      
      * tests fix
      
      * tests fix
      
      * GPT2 with past fix
      
      * Fixed stateful class attribute change that was breaking things when converting multiple models sequentially
      
      * Removed onnx file
      
      * Implemented suggestions
      
      * Fixed __init__ to resolve conflict with master
      
      * Revert "Revert "Added support for other features for already supported models (#14358)" (#14679)"
      
      This reverts commit 0f4e39c5
      
      .
      
      * is_torch_available test to avoid failing imports
      
      * sorting parameterize parameters to solve ERROR gw0 gw1
      
      * tests fix
      
      * tests fix
      
      * GPT2 with past fix
      
      * Fixed stateful class attribute change that was breaking things when converting multiple models sequentially
      
      * Removed onnx file
      
      * Implemented suggestions
      
      * Fixed __init__ to resolve conflict with master
      
      * Remove commented import
      
      * Remove ONNX model
      
      * Remove redundant class method
      
      * Tidy up imports
      
      * Fix quality
      
      * Refactor dummy input function
      
      * Add copied from statements to Marian config functions
      
      * Remove false copied from comments
      
      * Fix copy from comment
      Co-authored-by: default avatarMassimiliano Bruni <massimiliano.bruni@hcl.com>
      Co-authored-by: default avatarMichael Benayoun <mickbenayoun@gmail.com>
      6b655cc6
  32. 22 Dec, 2021 1 commit