1. 01 Nov, 2021 1 commit
  2. 29 Oct, 2021 1 commit
  3. 28 Oct, 2021 1 commit
    • NielsRogge's avatar
      Add SegFormer (#14019) · 1dc96a76
      NielsRogge authored
      
      
      * First draft
      
      * Make style & quality
      
      * Improve conversion script
      
      * Add print statement to see actual slice
      
      * Make absolute tolerance smaller
      
      * Fix image classification models
      
      * Add post_process_semantic method
      
      * Disable padding
      
      * Improve conversion script
      
      * Rename to ForSemanticSegmentation, add integration test, remove post_process methods
      
      * Improve docs
      
      * Fix code quality
      
      * Fix feature extractor tests
      
      * Fix tests for image classification model
      
      * Delete file
      
      * Add is_torch_available to feature extractor
      
      * Improve documentation of feature extractor methods
      
      * Apply suggestions from @sgugger's code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Apply some more suggestions of code review
      
      * Rebase with master
      
      * Fix rebase issues
      
      * Make sure model only outputs hidden states when the user wants to
      
      * Apply suggestions from code review
      
      * Add pad method
      
      * Support padding of 2d images
      
      * Add print statement
      
      * Add print statement
      
      * Move padding method to SegformerFeatureExtractor
      
      * Fix issue
      
      * Add casting of segmentation maps
      
      * Add test for padding
      
      * Add small note about padding
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      1dc96a76
  4. 26 Oct, 2021 1 commit
  5. 22 Oct, 2021 1 commit
    • Yeoun Yi's avatar
      Translate README.md to Korean (#14015) · 9f53f049
      Yeoun Yi authored
      * Create README_ko.md
      
      * Update README.md
      
      * Update README_zh-hans.md
      
      * Update README_zh-hant.md
      
      * Update README_ko.md
      
      * Update check_copies.py
      
      * Update README_ko.md
      
      * typo
      
      * match with readme_ko
      9f53f049
  6. 15 Oct, 2021 1 commit
  7. 12 Oct, 2021 1 commit
    • Yih-Dar's avatar
      Add TFEncoderDecoderModel + Add cross-attention to some TF models (#13222) · 8b240a06
      Yih-Dar authored
      
      
      * Add cross attentions to TFGPT2Model
      
      * Add TFEncoderDecoderModel
      
      * Add TFBaseModelOutputWithPoolingAndCrossAttentions
      
      * Add cross attentions to TFBertModel
      
      * Fix past or past_key_values argument issue
      
      * Fix generation
      
      * Fix save and load
      
      * Add some checks and comments
      
      * Clean the code that deals with past keys/values
      
      * Add kwargs to processing_inputs
      
      * Add serving_output to TFEncoderDecoderModel
      
      * Some cleaning + fix use_cache value issue
      
      * Fix tests + add bert2bert/bert2gpt2 tests
      
      * Fix more tests
      
      * Ignore crossattention.bias when loading GPT2 weights into TFGPT2
      
      * Fix return_dict_in_generate in tf generation
      
      * Fix is_token_logit_eos_token bug in tf generation
      
      * Finalize the tests after fixing some bugs
      
      * Fix another is_token_logit_eos_token bug in tf generation
      
      * Add/Update docs
      
      * Add TFBertEncoderDecoderModelTest
      
      * Clean test script
      
      * Add TFEncoderDecoderModel to the library
      
      * Add cross attentions to TFRobertaModel
      
      * Add TFRobertaEncoderDecoderModelTest
      
      * make style
      
      * Change the way of position_ids computation
      
      * bug fix
      
      * Fix copies in tf_albert
      
      * Remove some copied from and apply some fix-copies
      
      * Remove some copied
      
      * Add cross attentions to some other TF models
      
      * Remove encoder_hidden_states from TFLayoutLMModel.call for now
      
      * Make style
      
      * Fix TFRemBertForCausalLM
      
      * Revert the change to longformer + Remove copies
      
      * Revert the change to albert and convbert + Remove copies
      
      * make quality
      
      * make style
      
      * Add TFRembertEncoderDecoderModelTest
      
      * make quality and fix-copies
      
      * test TFRobertaForCausalLM
      
      * Fixes for failed tests
      
      * Fixes for failed tests
      
      * fix more tests
      
      * Fixes for failed tests
      
      * Fix Auto mapping order
      
      * Fix TFRemBertEncoder return value
      
      * fix tf_rembert
      
      * Check copies are OK
      
      * Fix missing TFBaseModelOutputWithPastAndCrossAttentions is not defined
      
      * Add TFEncoderDecoderModelSaveLoadTests
      
      * fix tf weight loading
      
      * check the change of use_cache
      
      * Revert the change
      
      * Add missing test_for_causal_lm for TFRobertaModelTest
      
      * Try cleaning past
      
      * fix _reorder_cache
      
      * Revert some files to original versions
      
      * Keep as many copies as possible
      
      * Apply suggested changes - Use raise ValueError instead of assert
      
      * Move import to top
      
      * Fix wrong require_torch
      
      * Replace more assert by raise ValueError
      
      * Add test_pt_tf_model_equivalence (the test won't pass for now)
      
      * add test for loading/saving
      
      * finish
      
      * finish
      
      * Remove test_pt_tf_model_equivalence
      
      * Update tf modeling template
      
      * Remove pooling, added in the prev. commit, from MainLayer
      
      * Update tf modeling test template
      
      * Move inputs["use_cache"] = False to modeling_tf_utils.py
      
      * Fix torch.Tensor in the comment
      
      * fix use_cache
      
      * Fix missing use_cache in ElectraConfig
      
      * Add a note to from_pretrained
      
      * Fix style
      
      * Change test_encoder_decoder_save_load_from_encoder_decoder_from_pt
      
      * Fix TFMLP (in TFGPT2) activation issue
      
      * Fix None past_key_values value in serving_output
      
      * Don't call get_encoderdecoder_model in TFEncoderDecoderModelTest.test_configuration_tie until we have a TF checkpoint on Hub
      
      * Apply review suggestions - style for cross_attns in serving_output
      
      * Apply review suggestions - change assert + docstrings
      
      * break the error message to respect the char limit
      
      * deprecate the argument past
      
      * fix docstring style
      
      * Update the encoder-decoder rst file
      
      * fix Unknown interpreted text role "method"
      
      * fix typo
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      8b240a06
  8. 07 Oct, 2021 1 commit
  9. 06 Oct, 2021 1 commit
  10. 29 Sep, 2021 2 commits
  11. 27 Sep, 2021 1 commit
  12. 21 Sep, 2021 2 commits
    • Kamal Raj's avatar
      [FLAX] Question Answering Example (#13649) · 78807d86
      Kamal Raj authored
      * flax qa example
      
      * Updated README:  Added Large model
      
      * added utils_qa.py FULL_COPIES
      
      * Updates:
      1. Copyright Year updated
      2. added dtype arg
      3. passing seed and dtype to load model
      4. Check eval flag before running eval
      
      * updated README
      
      * updated code comment
      78807d86
    • Kamal Raj's avatar
      beit-flax (#13515) · a2dec768
      Kamal Raj authored
      * beit-flax
      
      * updated FLAX_BEIT_MLM_DOCSTRING
      
      * removed bool_masked_pos from classification
      
      * updated Copyright
      
      * code refactoring: x -> embeddings
      
      * updated test: rm from_pt
      
      * Update docs/source/model_doc/beit.rst
      
      * model code dtype updates and
      other changes according to review
      
      * relative_position_bias
      revert back to pytorch design
      a2dec768
  13. 16 Sep, 2021 2 commits
  14. 14 Sep, 2021 1 commit
  15. 13 Sep, 2021 1 commit
  16. 08 Sep, 2021 1 commit
  17. 01 Sep, 2021 1 commit
  18. 31 Aug, 2021 1 commit
    • Sylvain Gugger's avatar
      Tests fetcher tests (#13340) · 8b2de0e4
      Sylvain Gugger authored
      * Incorporate tests dependencies in tests_fetcher
      
      * Harder modif
      
      * Debug
      
      * Loop through all files
      
      * Last modules
      
      * Remove debug statement
      8b2de0e4
  19. 30 Aug, 2021 1 commit
  20. 27 Aug, 2021 1 commit
    • Anton Lozhkov's avatar
      Add Wav2Vec2 & Hubert ForSequenceClassification (#13153) · b6f332ec
      Anton Lozhkov authored
      * Add hubert classifier + tests
      
      * Add hubert classifier + tests
      
      * Dummies for all classification tests
      
      * Wav2Vec2 classifier + ER test
      
      * Fix hubert integration tests
      
      * Add hubert IC
      
      * Pass tests for all classification tasks on Hubert
      
      * Pass all tests + copies
      
      * Move models to the SUPERB org
      b6f332ec
  21. 25 Aug, 2021 1 commit
  22. 23 Aug, 2021 1 commit
    • Yih-Dar's avatar
      Make Flax GPT2 working with cross attention (#13008) · 2e20c0f3
      Yih-Dar authored
      
      
      * make flax gpt2 working with cross attention
      
      * Remove encoder->decoder projection layer
      
      * A draft (incomplete) for FlaxEncoderDecoderModel
      
      * Add the method from_encoder_decoder_pretrained + the docstrings
      
      * Fix the mistakes of using EncoderDecoderModel
      
      * Fix style
      
      * Add FlaxEncoderDecoderModel to the library
      
      * Fix cyclic imports
      
      * Add FlaxEncoderDecoderModel to modeling_flax_auto.py
      
      * Remove question comments
      
      * add tests for FlaxEncoderDecoderModel
      
      * add flax_encoder_decoder to the lists of ignored entries in check_repo.py
      
      * fix missing required positional arguments
      
      * Remove **kwargs when creating FlaxEncoderDecoderModel in from_encoder_decoder_pretrained()
      
      Also fix generation eos/pad tokens issue
      
      * Fix: Use sequences from the generated_output
      
      * Change a check from assert to raise ValueError
      
      * Fix examples and token ids issues
      
      * Fix missing all_cross_attentions when outputting tuple in modeling_gpt2
      
      * Remove the changes in configuration docstrings.
      
      * allow for bert 2 gpt2
      
      * make fix-copies
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Change remaining examples to bert2gpt2
      
      * Change the test to Bert2GPT2
      
      * Fix examples
      
      * Fix import
      
      * Fix unpack bug
      
      * Rename to FlaxEncoderDecoderModelTest and change the test to bert2gpt2
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Fix: NotImplentedError -> NotImplementedError
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * up
      
      * finalize
      Co-authored-by: default avatarydshieh <ydshieh@user.noreply>
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      2e20c0f3
  23. 16 Aug, 2021 1 commit
  24. 12 Aug, 2021 3 commits
  25. 10 Aug, 2021 2 commits
  26. 06 Aug, 2021 1 commit
    • Sylvain Gugger's avatar
      [WIP] Disentangle auto modules from other modeling files (#13023) · 9870093f
      Sylvain Gugger authored
      * Initial work
      
      * All auto models
      
      * All tf auto models
      
      * All flax auto models
      
      * Tokenizers
      
      * Add feature extractors
      
      * Fix typos
      
      * Fix other typo
      
      * Use the right config
      
      * Remove old mapping names and update logic in AutoTokenizer
      
      * Update check_table
      
      * Fix copies and check_repo script
      
      * Fix last test
      
      * Add back name
      
      * clean up
      
      * Update template
      
      * Update template
      
      * Forgot a )
      
      * Use alternative to fixup
      
      * Fix TF model template
      
      * Address review comments
      
      * Address review comments
      
      * Style
      9870093f
  27. 04 Aug, 2021 2 commits
    • NielsRogge's avatar
      Add BEiT (#12994) · 83e5a106
      NielsRogge authored
      
      
      * First pass
      
      * Make conversion script work
      
      * Improve conversion script
      
      * Fix bug, conversion script working
      
      * Improve conversion script, implement BEiTFeatureExtractor
      
      * Make conversion script work based on URL
      
      * Improve conversion script
      
      * Add tests, add documentation
      
      * Fix bug in conversion script
      
      * Fix another bug
      
      * Add support for converting masked image modeling model
      
      * Add support for converting masked image modeling
      
      * Fix bug
      
      * Add print statement for debugging
      
      * Fix another bug
      
      * Make conversion script finally work for masked image modeling models
      
      * Move id2label for datasets to JSON files on the hub
      
      * Make sure id's are read in as integers
      
      * Add integration tests
      
      * Make style & quality
      
      * Fix test, add BEiT to README
      
      * Apply suggestions from @sgugger's review
      
      * Apply suggestions from code review
      
      * Make quality
      
      * Replace nielsr by microsoft in tests, add docs
      
      * Rename BEiT to Beit
      
      * Minor fix
      
      * Fix docs of BeitForMaskedImageModeling
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      83e5a106
    • Patrick von Platen's avatar
      [Flax] Correctly Add MT5 (#12988) · a317e6c3
      Patrick von Platen authored
      
      
      * finish PR
      
      * finish mt5
      
      * push
      
      * up
      
      * Update tests/test_modeling_flax_mt5.py
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      a317e6c3
  28. 26 Jul, 2021 1 commit
  29. 15 Jul, 2021 1 commit
  30. 14 Jul, 2021 1 commit
    • Sylvain Gugger's avatar
      Only test the files impacted by changes in the diff (#12644) · 084873b0
      Sylvain Gugger authored
      
      
      * Base test
      
      * More test
      
      * Fix mistake
      
      * Add a docstring change
      
      * Add doc ignore
      
      * Add changes
      
      * Add recursive dep search
      
      * Add recursive dep search
      
      * save
      
      * Finalize test mapping
      
      * Fix bug
      
      * Print prettier
      
      * Ignore comments and empty lines
      
      * Make script runnable from anywhere
      
      * Need dev install
      
      * Like that
      
      * Adapt
      
      * Add as artifact
      
      * Try on torch tests
      
      * Fix yaml error
      
      * Install GitPython
      
      * Apply everywhere
      
      * Be more defensive
      
      * Revert to all tests if something is wrong
      
      * Install GitPython
      
      * Test if there are tests before launching.
      
      * Fixes
      
      * Fixes
      
      * Fixes
      
      * Fixes
      
      * Bash syntax is horrible
      
      * Be less stupid
      
      * Try differently
      
      * Typo
      
      * Typo
      
      * Typo
      
      * Style
      
      * Better name
      
      * Escape quotes
      
      * Ignore black unhelpful re-formatting
      
      * Not a docstring
      
      * Deal with inits in dependency map
      
      * Run all tests once PR is merged.
      
      * Add last job
      
      * Apply suggestions from code review
      Co-authored-by: default avatarStas Bekman <stas00@users.noreply.github.com>
      
      * Stronger dependencies gather
      
      * Ignore empty lines too!
      
      * Clean up
      
      * Fix quality
      Co-authored-by: default avatarStas Bekman <stas00@users.noreply.github.com>
      084873b0
  31. 13 Jul, 2021 1 commit
  32. 09 Jul, 2021 1 commit
    • Will Rice's avatar
      Add TFHubertModel (#12206) · fb65f65e
      Will Rice authored
      * TFHubert
      
      * Update with TFWav2Vec Bug Fixes
      
      * Add OOV Error
      
      * Feedback changes
      
      * Fix kwargs call
      fb65f65e
  33. 01 Jul, 2021 1 commit