1. 31 May, 2022 1 commit
  2. 25 May, 2022 1 commit
  3. 24 May, 2022 4 commits
    • Jason Phang's avatar
      [WIP] Adding GPT-NeoX-20B (#16659) · 71e60272
      Jason Phang authored
      
      
      * initial
      
      * first try
      
      * working 20B
      
      * 20B tokenizers
      
      * Docs
      
      * Import fixes for missing classes
      
      * Update docs, fixup
      
      * black formatting
      
      * isort
      
      * flake
      
      * dummy objects
      
      * documentation
      
      * Documentation yml
      
      * more docs
      
      * tweaks for tests
      
      * tokenization auto
      
      * fix neox tests
      
      * test
      
      * test
      
      * einsum
      
      * address PR feedback
      
      * Documentation
      
      * Update README.md
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/models/gpt_neox/__init__.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/models/gpt_neox/configuration_gpt_neox.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Remove undefined LaTeX syntax
      
      * Update to full url to avoid confusion about if that's supposed to refer to the Hub
      
      * fix auto
      
      * move tests
      
      * documentation fix
      
      * more doc fixes
      
      * test refactor
      
      * fix import
      
      * fix import
      
      * fix import
      
      * fix import
      
      * fix import
      
      * style fixes
      
      * More modeling fixes
      Co-authored-by: default avatarJason Phang <zp489@gr057.hpc.nyu.edu>
      Co-authored-by: default avatarStella Biderman <stellabiderman@gmail.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      71e60272
    • NielsRogge's avatar
      Clean up CLIP tests (#17380) · 374a2f69
      NielsRogge authored
      
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      374a2f69
    • Nicolas Patry's avatar
      Enabling `imageGPT` auto feature extractor. (#16871) · d9809298
      Nicolas Patry authored
      
      
      * Enablign `imageGPT` auto feature extractor.
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      
      * Small updates.
      
      * Update after rebase to use `input_ids` instead of `pixel_values`.
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      d9809298
    • NielsRogge's avatar
      Add LayoutLMv3 (#17060) · 31ee80d5
      NielsRogge authored
      
      
      * Make forward pass work
      
      * More improvements
      
      * Remove unused imports
      
      * Remove timm dependency
      
      * Improve loss calculation of token classifier
      
      * Fix most tests
      
      * Add docs
      
      * Add model integration test
      
      * Make all tests pass
      
      * Add LayoutLMv3FeatureExtractor
      
      * Improve integration test + make fixup
      
      * Add example script
      
      * Fix style
      
      * Add LayoutLMv3Processor
      
      * Fix style
      
      * Add option to add visual labels
      
      * Make more tokenizer tests pass
      
      * Fix more tests
      
      * Make more tests pass
      
      * Fix bug and improve docs
      
      * Fix import of processors
      
      * Improve docstrings
      
      * Fix toctree and improve docs
      
      * Fix auto tokenizer
      
      * Move tests to model folder
      
      * Move tests to model folder
      
      * change default behavior add_prefix_space
      
      * add prefix space for fast
      
      * add_prefix_spcae set to True for Fast
      
      * no space before `unique_no_split` token
      
      * add test to hightligh special treatment of added tokens
      
      * fix `test_batch_encode_dynamic_overflowing` by building a long enough example
      
      * fix `test_full_tokenizer` with add_prefix_token
      
      * Fix tokenizer integration test
      
      * Make the code more readable
      
      * Add tests for LayoutLMv3Processor
      
      * Fix style
      
      * Add model to README and update init
      
      * Apply suggestions from code review
      
      * Replace asserts by value errors
      
      * Add suggestion by @ducviet00
      
      * Add model to doc tests
      
      * Simplify script
      
      * Improve README
      
      * a step ahead to fix
      
      * Update pair_input_test
      
      * Make all tokenizer tests pass - phew
      
      * Make style
      
      * Add LayoutLMv3 to CI job
      
      * Fix auto mapping
      
      * Fix CI job name
      
      * Make all processor tests pass
      
      * Make tests of LayoutLMv2 and LayoutXLM consistent
      
      * Add copied from statements to fast tokenizer
      
      * Add copied from statements to slow tokenizer
      
      * Remove add_visual_labels attribute
      
      * Fix tests
      
      * Add link to notebooks
      
      * Improve docs of LayoutLMv3Processor
      
      * Fix reference to section
      Co-authored-by: default avatarSaulLu <lucilesaul.com@gmail.com>
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      31ee80d5
  4. 23 May, 2022 1 commit
    • Michael Benayoun's avatar
      Traced models serialization and torchscripting fix (#17206) · 2e7e4280
      Michael Benayoun authored
      * Fix torch.jit.script and pickling issues
      
      * Fix get_attr issues
      
      * Fix import in function
      
      * Fix GPT-J and T5 tracing for torch=1.11
      
      * Gate graph surgery on torch version
      
      * Modeling minor changes to enable TorchScripting
      
      * Model serialization / deserialization test
      
      * Remove _assert_is_none users
      2e7e4280
  5. 19 May, 2022 2 commits
  6. 18 May, 2022 2 commits
  7. 17 May, 2022 5 commits
  8. 16 May, 2022 5 commits
  9. 13 May, 2022 2 commits
  10. 12 May, 2022 3 commits
  11. 11 May, 2022 3 commits
  12. 10 May, 2022 2 commits
    • Leon Derczynski's avatar
      MobileBERT tokenizer tests (#16896) · 4a419d49
      Leon Derczynski authored
      
      
      * unhardcode pretrained model path, make it a class var
      
      * add tests for mobilebert tokenizer
      
      * allow tempfiles for vocab & merge similarity test to autodelete
      
      * add explanatory comments
      
      * remove unused imports, let make style do its.. thing
      
      * remove inheritance and use BERT tok tests for MobileBERT
      
      * Update tests/mobilebert/test_tokenization_mobilebert.py
      Co-authored-by: default avatarSaulLu <55560583+SaulLu@users.noreply.github.com>
      
      * amend class names, remove unused import, add fix for mobilebert's hub pathname
      
      * unhardcode pretrained model path, make it a class var
      
      * add tests for mobilebert tokenizer
      
      * allow tempfiles for vocab & merge similarity test to autodelete
      
      * add explanatory comments
      
      * remove unused imports, let make style do its.. thing
      
      * remove inheritance and use BERT tok tests for MobileBERT
      
      * Update tests/mobilebert/test_tokenization_mobilebert.py
      Co-authored-by: default avatarSaulLu <55560583+SaulLu@users.noreply.github.com>
      
      * amend class names, remove unused import, add fix for mobilebert's hub pathname
      
      * amend paths for model tests being in models/ subdir of /tests
      
      * explicitly rm test from prev path
      Co-authored-by: default avatarSaulLu <55560583+SaulLu@users.noreply.github.com>
      4a419d49
    • Jason Phang's avatar
      Add DebertaV2ForMultipleChoice (#17135) · 48a8f3da
      Jason Phang authored
      48a8f3da
  13. 09 May, 2022 1 commit
    • ghlai9665's avatar
      LayoutLMv2Processor: ensure 1-to-1 mapping between images and samples in case... · e9fd583c
      ghlai9665 authored
      LayoutLMv2Processor: ensure 1-to-1 mapping between images and samples in case of overflowing tokens (#17092)
      
      * add get_overflowing_images function to ensure 1-to-1 mapping between samples and images in LayoutLMv2Processor
      
      * make style
      
      * add test for overflowing_tokens, change assert to ValueError, avoiding unrelated formatting changes
      
      * change line length by passing --preview into black
      e9fd583c
  14. 04 May, 2022 5 commits
  15. 03 May, 2022 1 commit
    • Yih-Dar's avatar
      Move test model folders (#17034) · 19420fd9
      Yih-Dar authored
      
      
      * move test model folders (TODO: fix imports and others)
      
      * fix (potentially partially) imports (in model test modules)
      
      * fix (potentially partially) imports (in tokenization test modules)
      
      * fix (potentially partially) imports (in feature extraction test modules)
      
      * fix import utils.test_modeling_tf_core
      
      * fix path ../fixtures/
      
      * fix imports about generation.test_generation_flax_utils
      
      * fix more imports
      
      * fix fixture path
      
      * fix get_test_dir
      
      * update module_to_test_file
      
      * fix get_tests_dir from wrong transformers.utils
      
      * update config.yml (CircleCI)
      
      * fix style
      
      * remove missing imports
      
      * update new model script
      
      * update check_repo
      
      * update SPECIAL_MODULE_TO_TEST_MAP
      
      * fix style
      
      * add __init__
      
      * update self-scheduled
      
      * fix add_new_model scripts
      
      * check one way to get location back
      
      * python setup.py build install
      
      * fix import in test auto
      
      * update self-scheduled.yml
      
      * update slack notification script
      
      * Add comments about artifact names
      
      * fix for yolos
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      19420fd9