1. 30 Nov, 2022 1 commit
  2. 28 Nov, 2022 1 commit
    • Matt's avatar
      More TF int dtype fixes (#20384) · de4159a3
      Matt authored
      * Add a test to ensure int dummy inputs are int64
      
      * Move the test into the existing int64 test and update a lot of existing dummies
      
      * Fix remaining dummies
      
      * Fix remaining dummies
      
      * Test for int64 serving sigs as well
      
      * Update core tests to use tf.int64
      
      * Add better messages to the assertions
      
      * Update all serving sigs to int64
      
      * More sneaky hiding tf.int32s
      
      * Add an optional int32 signature in save_pretrained
      
      * make fixup
      
      * Add Amy's suggestions
      
      * Switch all serving sigs back to tf.int32
      
      * Switch all dummies to tf.int32
      
      * Adjust tests to check for tf.int32 instead of tf.int64
      
      * Fix base dummy_inputs dtype
      
      * Start casting to tf.int32 in input_processing
      
      * Change dtype for unpack_inputs test
      
      * Add proper tf.int32 test
      
      * Make the alternate serving signature int64
      de4159a3
  3. 09 Nov, 2022 1 commit
  4. 18 Oct, 2022 1 commit
  5. 11 Oct, 2022 1 commit
    • Joao Gante's avatar
      馃毃馃毃馃毃 TF: Remove `TFWrappedEmbeddings` (breaking: TF embedding initialization... · 462cd641
      Joao Gante authored
      馃毃馃毃馃毃  TF: Remove `TFWrappedEmbeddings` (breaking: TF embedding initialization updated for encoder-decoder models) (#19263)
      
      * added test
      
      * correct embedding init
      
      * some changes in blenderbot (incomplete)
      
      * update blenderbot (diff to be used as reference)
      
      * update blenderbot_small
      
      * update LED
      
      * update marian
      
      * update T5 and remove TFWrappedEmbeddings
      
      * nullcontext() -> ContextManagers()
      
      * fix embedding init
      462cd641
  6. 22 Sep, 2022 1 commit
  7. 15 Sep, 2022 1 commit
    • Matt's avatar
      Update serving signatures and make sure we actually use them (#19034) · 2322eb8e
      Matt authored
      * Override save() to use the serving signature as the default
      
      * Replace int32 with int64 in all our serving signatures
      
      * Remember one very important line so as not to break every test at once
      
      * Dtype fix for TFLED
      
      * dtype fix for shift_tokens_right in general
      
      * Dtype fixes in mBART and RAG
      
      * Fix dtypes for test_unpack_inputs
      
      * More dtype fixes
      
      * Yet more mBART + RAG dtype fixes
      
      * Yet more mBART + RAG dtype fixes
      
      * Add a check that the model actually has a serving method
      2322eb8e
  8. 14 Sep, 2022 2 commits
  9. 12 Sep, 2022 1 commit
  10. 07 Sep, 2022 1 commit
  11. 06 Aug, 2022 1 commit
  12. 03 Aug, 2022 1 commit
    • LSinev's avatar
      Fix torch version comparisons (#18460) · 02b176c4
      LSinev authored
      Comparisons like
      version.parse(torch.__version__) > version.parse("1.6")
      are True for torch==1.6.0+cu101 or torch==1.6.0+cpu
      
      version.parse(version.parse(torch.__version__).base_version) are preferred (and available in pytorch_utils.py
      02b176c4
  13. 01 Aug, 2022 1 commit
  14. 11 Jul, 2022 1 commit
  15. 01 Jul, 2022 1 commit
    • Sanchit Gandhi's avatar
      [Flax] Add remat (gradient checkpointing) (#17843) · 485bbe79
      Sanchit Gandhi authored
      * [Flax] Add remat (gradient checkpointing)
      
      * fix variable naming in test
      
      * flip: checkpoint using a method
      
      * fix naming
      
      * fix class naming
      
      * apply PVP's suggestions from code review
      
      * make fix-copies
      
      * fix big-bird, electra, roberta
      
      * cookie-cutter
      
      * fix flax big-bird
      
      * move test to common
      485bbe79
  16. 29 Jun, 2022 1 commit
  17. 20 Jun, 2022 2 commits
  18. 13 Jun, 2022 1 commit
  19. 07 Jun, 2022 1 commit
    • Sylvain Gugger's avatar
      Add examples telemetry (#17552) · 3cab9027
      Sylvain Gugger authored
      * Add examples telemetry
      
      * Alternative approach
      
      * Add to all other examples
      
      * Add to templates as well
      
      * Put framework separately
      
      * Same for TensorFlow
      3cab9027
  20. 16 May, 2022 1 commit
  21. 12 May, 2022 1 commit
  22. 10 May, 2022 1 commit
  23. 09 May, 2022 1 commit
  24. 03 May, 2022 3 commits
    • Pavel Belevich's avatar
    • Yih-Dar's avatar
      Move test model folders (#17034) · 19420fd9
      Yih-Dar authored
      
      
      * move test model folders (TODO: fix imports and others)
      
      * fix (potentially partially) imports (in model test modules)
      
      * fix (potentially partially) imports (in tokenization test modules)
      
      * fix (potentially partially) imports (in feature extraction test modules)
      
      * fix import utils.test_modeling_tf_core
      
      * fix path ../fixtures/
      
      * fix imports about generation.test_generation_flax_utils
      
      * fix more imports
      
      * fix fixture path
      
      * fix get_test_dir
      
      * update module_to_test_file
      
      * fix get_tests_dir from wrong transformers.utils
      
      * update config.yml (CircleCI)
      
      * fix style
      
      * remove missing imports
      
      * update new model script
      
      * update check_repo
      
      * update SPECIAL_MODULE_TO_TEST_MAP
      
      * fix style
      
      * add __init__
      
      * update self-scheduled
      
      * fix add_new_model scripts
      
      * check one way to get location back
      
      * python setup.py build install
      
      * fix import in test auto
      
      * update self-scheduled.yml
      
      * update slack notification script
      
      * Add comments about artifact names
      
      * fix for yolos
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      19420fd9
    • Sanchit Gandhi's avatar
      [FlaxBert] Add ForCausalLM (#16995) · cd9274d0
      Sanchit Gandhi authored
      * [FlaxBert] Add ForCausalLM
      
      * make style
      
      * fix output attentions
      
      * Add RobertaForCausalLM
      
      * remove comment
      
      * fix fx-to-pt model loading
      
      * remove comment
      
      * add modeling tests
      
      * add enc-dec model tests
      
      * add big_bird
      
      * add electra
      
      * make style
      
      * make repo-consitency
      
      * add to docs
      
      * remove roberta test
      
      * quality
      
      * amend cookiecutter
      
      * fix attention_mask bug in flax bert model tester
      
      * tighten pt-fx thresholds to 1e-5
      
      * add 'copied from' statements
      
      * amend 'copied from' statements
      
      * amend 'copied from' statements
      
      * quality
      cd9274d0
  25. 02 May, 2022 1 commit
  26. 25 Apr, 2022 1 commit
  27. 19 Apr, 2022 1 commit
    • Suraj Patil's avatar
      [Flax] improve large model init and loading (#16148) · d3bd9ac7
      Suraj Patil authored
      
      
      * begin do_init
      
      * add params_shape_tree
      
      * raise error if params are accessed when do_init is False
      
      * don't allow do_init=False when keys are missing
      
      * make shape tree a property
      
      * assign self._params at the end
      
      * add test for do_init
      
      * add do_init arg to all flax models
      
      * fix param setting
      
      * disbale do_init for composite models
      
      * update test
      
      * add do_init in FlaxBigBirdForMultipleChoice
      
      * better names and errors
      
      * improve test
      
      * style
      
      * add a warning when do_init=False
      
      * remove extra if
      
      * set params after _required_params
      
      * add test for from_pretrained
      
      * do_init => _do_init
      
      * chage warning to info
      
      * fix typo
      
      * add params in init_weights
      
      * add params to gpt neo init
      
      * add params to init_weights
      
      * update do_init test
      
      * Trigger CI
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * update template
      
      * trigger CI
      
      * style
      
      * style
      
      * fix template
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      d3bd9ac7
  28. 12 Apr, 2022 1 commit
    • Anmol Joshi's avatar
      Moved functions to pytorch_utils.py (#16625) · a315988b
      Anmol Joshi authored
      * Moved functions to pytorch_utils.py
      
      * isort formatting
      
      * Reverted tf changes
      
      * isort, make fix-copies
      
      * documentation fix
      
      * Fixed Conv1D import
      
      * Reverted research examples file
      
      * backward compatibility for pytorch_utils
      
      * missing import
      
      * isort fix
      a315988b
  29. 05 Apr, 2022 2 commits
    • Matt's avatar
      Adding new train_step logic to make things less confusing for users (#15994) · 43540052
      Matt authored
      
      
      * Adding new train_step logic to make things less confusing for users
      
      * DO NOT ASK WHY WE NEED THAT SUBCLASS
      
      * Metrics now working, at least for single-output models with type annotations!
      
      * Updates and TODOs for the new train_step
      
      * Make fixup
      
      * Temporary test workaround until T5 has types
      
      * Temporary test workaround until T5 has types
      
      * I think this actually works! Needs a lot of tests though
      
      * MAke style/quality
      
      * Revert changes to T5 tests
      
      * Deleting the aforementioned unmentionable subclass
      
      * Deleting the aforementioned unmentionable subclass
      
      * Adding a Keras API test
      
      * Style fixes
      
      * Removing unneeded TODO and comments
      
      * Update test_step too
      
      * Stop trying to compute metrics with the dummy_loss, patch up test
      
      * Make style
      
      * make fixup
      
      * Docstring cleanup
      
      * make fixup
      
      * make fixup
      
      * Stop expanding 1D input tensors when using dummy loss
      
      * Adjust T5 test given the new compile()
      
      * make fixup
      
      * Skipping test for convnext
      
      * Removing old T5-specific Keras test now that we have a common one
      
      * make fixup
      
      * make fixup
      
      * Only skip convnext test on CPU
      
      * Update src/transformers/modeling_tf_utils.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/modeling_tf_utils.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Avoiding TF import issues
      
      * make fixup
      
      * Update compile() to support TF 2.3
      
      * Skipping model.fit() on template classes for now
      
      * Skipping model.fit() on template class tests for now
      
      * Replace ad-hoc solution with find_labels
      
      * make fixup
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      43540052
    • SaulLu's avatar
      add a template to add missing tokenization test (#16553) · 02214cb3
      SaulLu authored
      
      
      * add a template to add missing tokenization test
      
      * add cookiecutter setting
      
      * improve doc
      
      * Update templates/adding_a_missing_tokenization_test/README.md
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      02214cb3
  30. 04 Apr, 2022 1 commit
  31. 01 Apr, 2022 1 commit
  32. 30 Mar, 2022 1 commit
  33. 25 Mar, 2022 1 commit
  34. 23 Mar, 2022 2 commits