1. 16 May, 2022 1 commit
  2. 13 May, 2022 3 commits
  3. 12 May, 2022 4 commits
  4. 11 May, 2022 5 commits
  5. 10 May, 2022 6 commits
    • Leon Derczynski's avatar
      MobileBERT tokenizer tests (#16896) · 4a419d49
      Leon Derczynski authored
      
      
      * unhardcode pretrained model path, make it a class var
      
      * add tests for mobilebert tokenizer
      
      * allow tempfiles for vocab & merge similarity test to autodelete
      
      * add explanatory comments
      
      * remove unused imports, let make style do its.. thing
      
      * remove inheritance and use BERT tok tests for MobileBERT
      
      * Update tests/mobilebert/test_tokenization_mobilebert.py
      Co-authored-by: default avatarSaulLu <55560583+SaulLu@users.noreply.github.com>
      
      * amend class names, remove unused import, add fix for mobilebert's hub pathname
      
      * unhardcode pretrained model path, make it a class var
      
      * add tests for mobilebert tokenizer
      
      * allow tempfiles for vocab & merge similarity test to autodelete
      
      * add explanatory comments
      
      * remove unused imports, let make style do its.. thing
      
      * remove inheritance and use BERT tok tests for MobileBERT
      
      * Update tests/mobilebert/test_tokenization_mobilebert.py
      Co-authored-by: default avatarSaulLu <55560583+SaulLu@users.noreply.github.com>
      
      * amend class names, remove unused import, add fix for mobilebert's hub pathname
      
      * amend paths for model tests being in models/ subdir of /tests
      
      * explicitly rm test from prev path
      Co-authored-by: default avatarSaulLu <55560583+SaulLu@users.noreply.github.com>
      4a419d49
    • Jason Phang's avatar
      Add DebertaV2ForMultipleChoice (#17135) · 48a8f3da
      Jason Phang authored
      48a8f3da
    • Nicolas Brousse's avatar
      Add MLFLOW_FLATTEN_PARAMS support in MLflowCallback (#17148) · e99f0efe
      Nicolas Brousse authored
      * add support for MLFLOW_FLATTEN_PARAMS
      
      * ensure key is str
      
      * fix style and update warning msg
      
      * Empty commit to trigger CI
      
      * fix bug in check_inits.py
      
      * add unittest for flatten_dict utils
      
      * fix 'NoneType' object is not callable on __del__
      
      * add generic flatten_dict unittest to SPECIAL_MODULE_TO_TEST_MAP
      
      * fix style
      e99f0efe
    • Stas Bekman's avatar
      missing file (#17164) · 976835d5
      Stas Bekman authored
      976835d5
    • Stas Bekman's avatar
      [Deepspeed] add many more models to the model zoo test (#12695) · f8615044
      Stas Bekman authored
      * model zoo take 2
      
      * add deberta
      
      * new param for zero2
      
      * doc update
      
      * doc update
      
      * add layoutlm
      
      * bump deepspeed
      
      * add deberta-v2, funnel, longformer
      
      * new models
      
      * style
      
      * add t5_v1
      
      * update TAPAS status
      
      * reorg problematic models
      
      * move doc to another PR
      
      * style
      
      * fix checkpoint check test
      
      * making progress on more models running
      
      * cleanup
      
      * new version
      
      * cleanup
      f8615044
    • Nicolas Patry's avatar
      LogSumExp trick `question_answering` pipeline. (#17143) · 6d80c92c
      Nicolas Patry authored
      * LogSumExp trick `question_answering` pipeline.
      
      * Adding a failing test.
      6d80c92c
  6. 09 May, 2022 3 commits
  7. 06 May, 2022 1 commit
  8. 05 May, 2022 1 commit
  9. 04 May, 2022 6 commits
  10. 03 May, 2022 4 commits
    • Sylvain Gugger's avatar
      Fix RNG reload in resume training from epoch checkpoint (#17055) · 1c9fcd0e
      Sylvain Gugger authored
      * Fix RNG reload in resume training from epoch checkpoint
      
      * Fix test
      1c9fcd0e
    • Sylvain Gugger's avatar
      Make Trainer compatible with sharded checkpoints (#17053) · a8fa2f91
      Sylvain Gugger authored
      * Make Trainer compatible with sharded checkpoints
      
      * Add doc
      a8fa2f91
    • Yih-Dar's avatar
      Move test model folders (#17034) · 19420fd9
      Yih-Dar authored
      
      
      * move test model folders (TODO: fix imports and others)
      
      * fix (potentially partially) imports (in model test modules)
      
      * fix (potentially partially) imports (in tokenization test modules)
      
      * fix (potentially partially) imports (in feature extraction test modules)
      
      * fix import utils.test_modeling_tf_core
      
      * fix path ../fixtures/
      
      * fix imports about generation.test_generation_flax_utils
      
      * fix more imports
      
      * fix fixture path
      
      * fix get_test_dir
      
      * update module_to_test_file
      
      * fix get_tests_dir from wrong transformers.utils
      
      * update config.yml (CircleCI)
      
      * fix style
      
      * remove missing imports
      
      * update new model script
      
      * update check_repo
      
      * update SPECIAL_MODULE_TO_TEST_MAP
      
      * fix style
      
      * add __init__
      
      * update self-scheduled
      
      * fix add_new_model scripts
      
      * check one way to get location back
      
      * python setup.py build install
      
      * fix import in test auto
      
      * update self-scheduled.yml
      
      * update slack notification script
      
      * Add comments about artifact names
      
      * fix for yolos
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      19420fd9
    • Sanchit Gandhi's avatar
      [FlaxBert] Add ForCausalLM (#16995) · cd9274d0
      Sanchit Gandhi authored
      * [FlaxBert] Add ForCausalLM
      
      * make style
      
      * fix output attentions
      
      * Add RobertaForCausalLM
      
      * remove comment
      
      * fix fx-to-pt model loading
      
      * remove comment
      
      * add modeling tests
      
      * add enc-dec model tests
      
      * add big_bird
      
      * add electra
      
      * make style
      
      * make repo-consitency
      
      * add to docs
      
      * remove roberta test
      
      * quality
      
      * amend cookiecutter
      
      * fix attention_mask bug in flax bert model tester
      
      * tighten pt-fx thresholds to 1e-5
      
      * add 'copied from' statements
      
      * amend 'copied from' statements
      
      * amend 'copied from' statements
      
      * quality
      cd9274d0
  11. 02 May, 2022 3 commits
  12. 29 Apr, 2022 2 commits
  13. 28 Apr, 2022 1 commit