1. 13 Jun, 2023 1 commit
    • Sylvain Gugger's avatar
      Tied params cleanup (#24211) · 695928e1
      Sylvain Gugger authored
      * First test
      
      * Add info for all models
      
      * style
      
      * Repo consistency
      
      * Fix last model and cleanup prints
      
      * Repo consistency
      
      * Use consistent function for detecting tied weights
      695928e1
  2. 18 May, 2023 1 commit
  3. 06 Feb, 2023 1 commit
    • Sylvain Gugger's avatar
      Update quality tooling for formatting (#21480) · 6f79d264
      Sylvain Gugger authored
      * Result of black 23.1
      
      * Update target to Python 3.7
      
      * Switch flake8 to ruff
      
      * Configure isort
      
      * Configure isort
      
      * Apply isort with line limit
      
      * Put the right black version
      
      * adapt black in check copies
      
      * Fix copies
      6f79d264
  4. 23 Jan, 2023 1 commit
  5. 09 Nov, 2022 1 commit
    • Nicolas Patry's avatar
      Attempting to test automatically the `_keys_to_ignore`. (#20042) · bac2d29a
      Nicolas Patry authored
      
      
      * Attempting to test automatically the `_keys_to_ignore`.
      
      * Style.
      
      * First fix pass.
      
      * Moving test on its own.
      
      * Another batch.
      
      * Second round removing BatchNorm
      
      * Fixing layoutlmv{2,3} + support older Python.
      
      * Disable miss missing warning.
      
      * Removing dodgy additions.
      
      * Big pass.
      
      * mbart.
      
      * More corrections.
      
      * Fixup.
      
      * Updating test_correct_missing_keys
      
      * Add escape hatch for when the head has no extra params so doesn't need
      
      the missing keys check.
      
      * Fixing test.
      
      * Greener.
      
      * Green ! (except for weird splinter bug).
      
      * Adding a test about `named_parameters` usage.
      
      * Shorten message.
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * After rebase modifications.
      
      * More explicit condition checking.
      
      * Fixing slow tests issues.
      
      * Remove extra pdb.
      
      * Remove print.
      
      * Attempt to make failure consistent + fixing roc_bert.
      
      * Removing the seed  (all tests passing with it).
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      bac2d29a
  6. 14 Sep, 2022 1 commit
  7. 03 Aug, 2022 1 commit
    • LSinev's avatar
      Fix torch version comparisons (#18460) · 02b176c4
      LSinev authored
      Comparisons like
      version.parse(torch.__version__) > version.parse("1.6")
      are True for torch==1.6.0+cu101 or torch==1.6.0+cpu
      
      version.parse(version.parse(torch.__version__).base_version) are preferred (and available in pytorch_utils.py
      02b176c4
  8. 23 Jun, 2022 1 commit
  9. 03 May, 2022 1 commit
  10. 12 Apr, 2022 1 commit
    • Anmol Joshi's avatar
      Moved functions to pytorch_utils.py (#16625) · a315988b
      Anmol Joshi authored
      * Moved functions to pytorch_utils.py
      
      * isort formatting
      
      * Reverted tf changes
      
      * isort, make fix-copies
      
      * documentation fix
      
      * Fixed Conv1D import
      
      * Reverted research examples file
      
      * backward compatibility for pytorch_utils
      
      * missing import
      
      * isort fix
      a315988b
  11. 25 Mar, 2022 1 commit
  12. 23 Mar, 2022 1 commit
    • Sylvain Gugger's avatar
      Reorganize file utils (#16264) · 4975002d
      Sylvain Gugger authored
      * Split file_utils in several submodules
      
      * Fixes
      
      * Add back more objects
      
      * More fixes
      
      * Who exactly decided to import that from there?
      
      * Second suggestion to code with code review
      
      * Revert wront move
      
      * Fix imports
      
      * Adapt all imports
      
      * Adapt all imports everywhere
      
      * Revert this import, will fix in a separate commit
      4975002d
  13. 22 Mar, 2022 1 commit
  14. 15 Mar, 2022 1 commit
  15. 09 Feb, 2022 1 commit
  16. 26 Jan, 2022 1 commit
  17. 24 Jan, 2022 1 commit
  18. 11 Jan, 2022 1 commit
  19. 27 Dec, 2021 1 commit
    • Sylvain Gugger's avatar
      Doc styler v2 (#14950) · 87e6e4fe
      Sylvain Gugger authored
      * New doc styler
      
      * Fix issue with args at the start
      
      * Code sample fixes
      
      * Style code examples in MDX
      
      * Fix more patterns
      
      * Typo
      
      * Typo
      
      * More patterns
      
      * Do without black for now
      
      * Get more info in error
      
      * Docstring style
      
      * Re-enable check
      
      * Quality
      
      * Fix add_end_docstring decorator
      
      * Fix docstring
      87e6e4fe
  20. 21 Dec, 2021 1 commit
    • Sylvain Gugger's avatar
      Convert docstrings of modeling files (#14850) · 7af80f66
      Sylvain Gugger authored
      * Convert file_utils docstrings to Markdown
      
      * Test on BERT
      
      * Return block indent
      
      * Temporarily disable doc styler
      
      * Remove from quality checks as well
      
      * Remove doc styler mess
      
      * Remove check from circleCI
      
      * Fix typo
      
      * Convert file_utils docstrings to Markdown
      
      * Test on BERT
      
      * Return block indent
      
      * Temporarily disable doc styler
      
      * Remove from quality checks as well
      
      * Remove doc styler mess
      
      * Remove check from circleCI
      
      * Fix typo
      
      * Let's go on all other model files
      
      * Add templates too
      
      * Styling and quality
      7af80f66
  21. 18 Nov, 2021 2 commits
  22. 15 Oct, 2021 1 commit
  23. 22 Sep, 2021 1 commit
  24. 01 Sep, 2021 1 commit
    • Hamid Shojanazeri's avatar
      Fix for the issue of device-id getting hardcoded for token_type_ids during... · 53ee995a
      Hamid Shojanazeri authored
      Fix for the issue of device-id getting hardcoded for token_type_ids during Tracing for ConvBert (#12287)
      
      * added token_type_ids buffer to fix the issue #5664
      
      * Handling the case that position_id buffer is not registered
      
      * added token_type_ids buffer to fix the issue #5664
      
      * modified to support device conversion when the model is traced
      53ee995a
  25. 03 Aug, 2021 1 commit
  26. 26 Jul, 2021 1 commit
  27. 14 Jun, 2021 1 commit
  28. 08 Jun, 2021 1 commit
  29. 07 Jun, 2021 1 commit
    • Fran莽ois Lagunas's avatar
      Fixes bug that appears when using QA bert and distilation. (#12026) · f8bd8c6c
      Fran莽ois Lagunas authored
      * Fixing bug that appears when using distilation (and potentially other uses).
      During backward pass Pytorch complains with:
      RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation
      This happens because the QA model code modifies the start_positions and end_positions input tensors, using clamp_ function: as a consequence the teacher and the student both modifies the inputs, and backward pass fails.
      
      * Fixing all models QA clamp_ bug.
      f8bd8c6c
  30. 01 Jun, 2021 1 commit
  31. 20 May, 2021 1 commit
  32. 04 May, 2021 1 commit
  33. 26 Apr, 2021 1 commit
  34. 31 Mar, 2021 1 commit
  35. 05 Mar, 2021 2 commits
    • Daniel Hug's avatar
      Refactoring checkpoint names for multiple models (#10527) · 90ecc296
      Daniel Hug authored
      * Refactor checkpoint name in ALBERT and ALBERT_tf
      
      * Refactor checkpoint name in BART and BART_tf
      
      * Refactor checkpoint name in BERT generation
      
      * Refactor checkpoint name in Blenderbot_tf
      
      * Refactor checkpoint name in Blenderbot_small_tf
      
      * Refactor checkpoint name in ConvBERT AND CONVBERT_TF
      
      * Refactor checkpoint name in CTRL AND CTRL_TF
      
      * Refactor checkpoint name in DistilBERT AND DistilBERT_TF
      
      * Refactor checkpoint name in DistilBERT redo
      
      * Refactor checkpoint name in Electra and Electra_tf
      
      * Refactor checkpoint name in FlauBERT and FlauBERT_tf
      
      * Refactor checkpoint name in FSMT
      
      * Refactor checkpoint name in GPT2 and GPT2_tf
      
      * Refactor checkpoint name in IBERT
      
      * Refactor checkpoint name in LED and LED_tf
      
      * Refactor checkpoint name in Longformer and Longformer_tf
      
      * Refactor checkpoint name in Lxmert and Lxmert_tf
      
      * Refactor checkpoint name in Marian_tf
      
      * Refactor checkpoint name in MBART and MBART_tf
      
      * Refactor checkpoint name in MobileBERT and MobileBERT_tf
      
      * Refactor checkpoint name in mpnet and mpnet_tf
      
      * Refactor checkpoint name in openai and openai_tf
      
      * Refactor checkpoint name in pegasus_tf
      
      * Refactor checkpoint name in reformer
      
      * Refactor checkpoint name in Roberta and Roberta_tf
      
      * Refactor checkpoint name in SqueezeBert
      
      * Refactor checkpoint name in Transformer_xl and Transformer_xl_tf
      
      * Refactor checkpoint name in XLM and XLM_tf
      
      * Refactor checkpoint name in XLNET and XLNET_tf
      
      * Refactor checkpoint name in BERT_tf
      
      * run make tests, style, quality, fixup
      90ecc296
    • Sylvain Gugger's avatar
      Fix embeddings for PyTorch 1.8 (#10549) · 7da995c0
      Sylvain Gugger authored
      * Fix embeddings for PyTorch 1.8
      
      * Try with PyTorch 1.8.0
      
      * Fix embeddings init
      
      * Fix copies
      
      * Typo
      
      * More typos
      7da995c0
  36. 27 Jan, 2021 1 commit
  37. 19 Jan, 2021 2 commits