1. 05 Oct, 2022 1 commit
    • r-terada's avatar
      Add sudachi and jumanpp tokenizers for bert_japanese (#19043) · 2f53ab57
      r-terada authored
      * add sudachipy and jumanpp tokenizers for bert_japanese
      
      * use ImportError instead of ModuleNotFoundError in SudachiTokenizer and JumanppTokenizer
      
      * put test cases of test_tokenization_bert_japanese in one line
      
      * add require_sudachi and require_jumanpp decorator for testing
      
      * add sudachi and pyknp(jumanpp) to dependencies
      
      * remove sudachi_dict_small and sudachi_dict_full from dependencies
      
      * empty commit for ci
      2f53ab57
  2. 29 Sep, 2022 1 commit
    • Sylvain Gugger's avatar
      Fix test fetching for examples (#19237) · 655f72a6
      Sylvain Gugger authored
      * Fix test fetching for examples
      
      * Fake example modif
      
      * Debug statements
      
      * Typo
      
      * You need to persist the file...
      
      * Revert change in example
      
      * Remove debug statements
      655f72a6
  3. 28 Sep, 2022 1 commit
  4. 16 Sep, 2022 1 commit
    • Sylvain Gugger's avatar
      Organize test jobs (#19058) · 820cb97a
      Sylvain Gugger authored
      * Tests conditional run
      
      * Syntax
      
      * Deps
      
      * Try early exit
      
      * Another way
      
      * Test with no tests to run
      
      * Test all
      
      * Typo
      
      * Try this way
      
      * With tests to run
      
      * Mostly finished
      
      * Typo
      
      * With a modification in one file only
      
      * No change, no tests
      
      * Final cleanup
      
      * Address review comments
      820cb97a
  5. 15 Sep, 2022 1 commit
  6. 14 Sep, 2022 1 commit
  7. 25 Aug, 2022 1 commit
    • Craig Chan's avatar
      Determine framework automatically before ONNX export (#18615) · fbf382c8
      Craig Chan authored
      
      
      * Automatic detection for framework to use when exporting to ONNX
      
      * Log message change
      
      * Incorporating PR comments, adding unit test
      
      * Adding tf for pip install for run_tests_onnxruntime CI
      
      * Restoring past changes to circleci yaml and test_onnx_v2.py, tests moved to tests/onnx/test_features.py
      
      * Fixup
      
      * Adding test to fetcher
      
      * Updating circleci config to log more
      
      * Changing test class name
      
      * Comment typo fix in tests/onnx/test_features.py
      Co-authored-by: default avatarlewtun <lewis.c.tunstall@gmail.com>
      
      * Moving torch_str/tf_str to self.framework_pt/tf
      
      * Remove -rA flag in circleci config
      Co-authored-by: default avatarlewtun <lewis.c.tunstall@gmail.com>
      fbf382c8
  8. 23 Aug, 2022 1 commit
  9. 18 Aug, 2022 1 commit
  10. 10 Aug, 2022 1 commit
    • Matt's avatar
      TF Examples Rewrite (#18451) · 6eb51450
      Matt authored
      
      
      * Finished QA example
      
      * Dodge a merge conflict
      
      * Update text classification and LM examples
      
      * Update NER example
      
      * New Keras metrics WIP, fix NER example
      
      * Update NER example
      
      * Update MC, summarization and translation examples
      
      * Add XLA warnings when shapes are variable
      
      * Make sure batch_size is consistently scaled by num_replicas
      
      * Add PushToHubCallback to all models
      
      * Add docs links for KerasMetricCallback
      
      * Add docs links for prepare_tf_dataset and jit_compile
      
      * Correct inferred model names
      
      * Don't assume the dataset has 'lang'
      
      * Don't assume the dataset has 'lang'
      
      * Write metrics in text classification
      
      * Add 'framework' to TrainingArguments and TFTrainingArguments
      
      * Export metrics in all examples and add tests
      
      * Fix training args for Flax
      
      * Update command line args for translation test
      
      * make fixup
      
      * Fix accidentally running other tests in fp16
      
      * Remove do_train/do_eval from run_clm.py
      
      * Remove do_train/do_eval from run_mlm.py
      
      * Add tensorflow tests to circleci
      
      * Fix circleci
      
      * Update examples/tensorflow/language-modeling/run_mlm.py
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      
      * Update examples/tensorflow/test_tensorflow_examples.py
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      
      * Update examples/tensorflow/translation/run_translation.py
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      
      * Update examples/tensorflow/token-classification/run_ner.py
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      
      * Fix save path for tests
      
      * Fix some model card kwargs
      
      * Explain the magical -1000
      
      * Actually enable tests this time
      
      * Skip text classification PR until we fix shape inference
      
      * make fixup
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      6eb51450
  11. 08 Aug, 2022 1 commit
  12. 26 Jul, 2022 1 commit
  13. 20 Jul, 2022 1 commit
  14. 19 Jul, 2022 1 commit
  15. 07 Jul, 2022 1 commit
  16. 02 Jun, 2022 1 commit
  17. 01 Jun, 2022 1 commit
  18. 24 May, 2022 1 commit
    • NielsRogge's avatar
      Add LayoutLMv3 (#17060) · 31ee80d5
      NielsRogge authored
      
      
      * Make forward pass work
      
      * More improvements
      
      * Remove unused imports
      
      * Remove timm dependency
      
      * Improve loss calculation of token classifier
      
      * Fix most tests
      
      * Add docs
      
      * Add model integration test
      
      * Make all tests pass
      
      * Add LayoutLMv3FeatureExtractor
      
      * Improve integration test + make fixup
      
      * Add example script
      
      * Fix style
      
      * Add LayoutLMv3Processor
      
      * Fix style
      
      * Add option to add visual labels
      
      * Make more tokenizer tests pass
      
      * Fix more tests
      
      * Make more tests pass
      
      * Fix bug and improve docs
      
      * Fix import of processors
      
      * Improve docstrings
      
      * Fix toctree and improve docs
      
      * Fix auto tokenizer
      
      * Move tests to model folder
      
      * Move tests to model folder
      
      * change default behavior add_prefix_space
      
      * add prefix space for fast
      
      * add_prefix_spcae set to True for Fast
      
      * no space before `unique_no_split` token
      
      * add test to hightligh special treatment of added tokens
      
      * fix `test_batch_encode_dynamic_overflowing` by building a long enough example
      
      * fix `test_full_tokenizer` with add_prefix_token
      
      * Fix tokenizer integration test
      
      * Make the code more readable
      
      * Add tests for LayoutLMv3Processor
      
      * Fix style
      
      * Add model to README and update init
      
      * Apply suggestions from code review
      
      * Replace asserts by value errors
      
      * Add suggestion by @ducviet00
      
      * Add model to doc tests
      
      * Simplify script
      
      * Improve README
      
      * a step ahead to fix
      
      * Update pair_input_test
      
      * Make all tokenizer tests pass - phew
      
      * Make style
      
      * Add LayoutLMv3 to CI job
      
      * Fix auto mapping
      
      * Fix CI job name
      
      * Make all processor tests pass
      
      * Make tests of LayoutLMv2 and LayoutXLM consistent
      
      * Add copied from statements to fast tokenizer
      
      * Add copied from statements to slow tokenizer
      
      * Remove add_visual_labels attribute
      
      * Fix tests
      
      * Add link to notebooks
      
      * Improve docs of LayoutLMv3Processor
      
      * Fix reference to section
      Co-authored-by: default avatarSaulLu <lucilesaul.com@gmail.com>
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      31ee80d5
  19. 16 May, 2022 1 commit
  20. 12 May, 2022 1 commit
  21. 09 May, 2022 1 commit
  22. 04 May, 2022 2 commits
  23. 03 May, 2022 1 commit
    • Yih-Dar's avatar
      Move test model folders (#17034) · 19420fd9
      Yih-Dar authored
      
      
      * move test model folders (TODO: fix imports and others)
      
      * fix (potentially partially) imports (in model test modules)
      
      * fix (potentially partially) imports (in tokenization test modules)
      
      * fix (potentially partially) imports (in feature extraction test modules)
      
      * fix import utils.test_modeling_tf_core
      
      * fix path ../fixtures/
      
      * fix imports about generation.test_generation_flax_utils
      
      * fix more imports
      
      * fix fixture path
      
      * fix get_test_dir
      
      * update module_to_test_file
      
      * fix get_tests_dir from wrong transformers.utils
      
      * update config.yml (CircleCI)
      
      * fix style
      
      * remove missing imports
      
      * update new model script
      
      * update check_repo
      
      * update SPECIAL_MODULE_TO_TEST_MAP
      
      * fix style
      
      * add __init__
      
      * update self-scheduled
      
      * fix add_new_model scripts
      
      * check one way to get location back
      
      * python setup.py build install
      
      * fix import in test auto
      
      * update self-scheduled.yml
      
      * update slack notification script
      
      * Add comments about artifact names
      
      * fix for yolos
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      19420fd9
  24. 02 May, 2022 1 commit
  25. 30 Apr, 2022 1 commit
  26. 15 Apr, 2022 1 commit
  27. 14 Apr, 2022 1 commit
  28. 08 Apr, 2022 1 commit
  29. 28 Mar, 2022 1 commit
  30. 23 Mar, 2022 1 commit
  31. 11 Mar, 2022 1 commit
  32. 09 Mar, 2022 1 commit
    • lewtun's avatar
      Add ONNX export for ViT (#15658) · 50dd314d
      lewtun authored
      
      
      * Add ONNX support for ViT
      
      * Refactor to use generic preprocessor
      
      * Add vision dep to tests
      
      * Extend ONNX slow tests to ViT
      
      * Add dummy image generator
      
      * Use model_type to determine modality
      
      * Add deprecation warnings for tokenizer argument
      
      * Add warning when overwriting the preprocessor
      
      * Add optional args to docstrings
      
      * Add minimum PyTorch version to OnnxConfig
      
      * Refactor OnnxConfig class variables from CONSTANT_NAME to snake_case
      
      * Add reasonable value for default atol
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      50dd314d
  33. 18 Feb, 2022 1 commit
  34. 10 Jan, 2022 1 commit
    • cody-moveworks's avatar
      Make OpenAIGPTTokenizer work with SpaCy 2.x and 3.x (#15019) · a54961c5
      cody-moveworks authored
      * Make OpenAIGPTTokenizer work with SpaCy 3.x
      
      SpaCy 3.x introduced an API change to creating the tokenizer that
      breaks OpenAIGPTTokenizer. The old API for creating the tokenizer in
      SpaCy 2.x no longer works under SpaCy 3.x, but the new API for creating
      the tokenizer in SpaCy 3.x DOES work under SpaCy 2.x. Switching to the
      new API should allow OpenAIGPTTokenizer to work under both SpaCy 2.x and
      SpaCy 3.x versions.
      
      * Add is_spacy_available and is_ftfy_available methods to file utils
      
      * Add spacy and ftfy unittest decorator to testing utils
      
      * Add tests for OpenAIGPTTokenizer that require spacy and ftfy
      
      * Modify CircleCI config to run tests that require spacy and ftfy
      
      * Remove unneeded unittest decorators are reuse test code
      
      * Run make fixup
      a54961c5
  35. 27 Dec, 2021 1 commit
    • Sylvain Gugger's avatar
      Doc styler v2 (#14950) · 87e6e4fe
      Sylvain Gugger authored
      * New doc styler
      
      * Fix issue with args at the start
      
      * Code sample fixes
      
      * Style code examples in MDX
      
      * Fix more patterns
      
      * Typo
      
      * Typo
      
      * More patterns
      
      * Do without black for now
      
      * Get more info in error
      
      * Docstring style
      
      * Re-enable check
      
      * Quality
      
      * Fix add_end_docstring decorator
      
      * Fix docstring
      87e6e4fe
  36. 21 Dec, 2021 1 commit
    • Sylvain Gugger's avatar
      Convert docstrings of modeling files (#14850) · 7af80f66
      Sylvain Gugger authored
      * Convert file_utils docstrings to Markdown
      
      * Test on BERT
      
      * Return block indent
      
      * Temporarily disable doc styler
      
      * Remove from quality checks as well
      
      * Remove doc styler mess
      
      * Remove check from circleCI
      
      * Fix typo
      
      * Convert file_utils docstrings to Markdown
      
      * Test on BERT
      
      * Return block indent
      
      * Temporarily disable doc styler
      
      * Remove from quality checks as well
      
      * Remove doc styler mess
      
      * Remove check from circleCI
      
      * Fix typo
      
      * Let's go on all other model files
      
      * Add templates too
      
      * Styling and quality
      7af80f66
  37. 17 Dec, 2021 1 commit
  38. 09 Dec, 2021 1 commit
  39. 08 Dec, 2021 1 commit