1. 11 Apr, 2023 1 commit
  2. 24 Feb, 2023 1 commit
  3. 14 Feb, 2023 1 commit
  4. 07 Feb, 2023 2 commits
    • Sylvain Gugger's avatar
      Cleanup quality (#21493) · 67d07487
      Sylvain Gugger authored
      * Remove mentions of flake8/isort
      
      * Clean up inits
      
      * Deall with all other inits
      
      * Last special rule for dummy files
      67d07487
    • Arthur's avatar
      [CI ] Remove `past` in favor of `pat_key_values` (#21443) · 12eb528b
      Arthur authored
      * fix past renamed to past_key_value
      
      * update more `past`that were ski^锚d
      
      * fixup
      
      * remove changes made to rag
      
      * refactor `_reorder_cache` to use `past_key_values`
      
      * fix git `prepare_inputs_for_generation` to pass tests when false is needed in use_cache
      12eb528b
  5. 19 Jan, 2023 1 commit
  6. 09 Jan, 2023 1 commit
  7. 08 Jan, 2023 1 commit
    • Arthur's avatar
      Replace `past` with `past_key_values` (#20944) · f0577df6
      Arthur authored
      * start cleanup
      
      * more updates
      
      * more models are affected
      
      * more updates
      
      * update generation utils
      
      * style
      
      * revert change that removed reorder cachce
      
      * update generation utils
      
      * style
      
      * style
      
      * remove reorder cache
      f0577df6
  8. 03 Jan, 2023 1 commit
  9. 27 Dec, 2022 1 commit
  10. 08 Dec, 2022 1 commit
  11. 05 Dec, 2022 1 commit
  12. 30 Nov, 2022 1 commit
  13. 28 Nov, 2022 1 commit
    • Matt's avatar
      More TF int dtype fixes (#20384) · de4159a3
      Matt authored
      * Add a test to ensure int dummy inputs are int64
      
      * Move the test into the existing int64 test and update a lot of existing dummies
      
      * Fix remaining dummies
      
      * Fix remaining dummies
      
      * Test for int64 serving sigs as well
      
      * Update core tests to use tf.int64
      
      * Add better messages to the assertions
      
      * Update all serving sigs to int64
      
      * More sneaky hiding tf.int32s
      
      * Add an optional int32 signature in save_pretrained
      
      * make fixup
      
      * Add Amy's suggestions
      
      * Switch all serving sigs back to tf.int32
      
      * Switch all dummies to tf.int32
      
      * Adjust tests to check for tf.int32 instead of tf.int64
      
      * Fix base dummy_inputs dtype
      
      * Start casting to tf.int32 in input_processing
      
      * Change dtype for unpack_inputs test
      
      * Add proper tf.int32 test
      
      * Make the alternate serving signature int64
      de4159a3
  14. 09 Nov, 2022 1 commit
  15. 18 Oct, 2022 1 commit
  16. 11 Oct, 2022 1 commit
    • Joao Gante's avatar
      馃毃馃毃馃毃 TF: Remove `TFWrappedEmbeddings` (breaking: TF embedding initialization... · 462cd641
      Joao Gante authored
      馃毃馃毃馃毃  TF: Remove `TFWrappedEmbeddings` (breaking: TF embedding initialization updated for encoder-decoder models) (#19263)
      
      * added test
      
      * correct embedding init
      
      * some changes in blenderbot (incomplete)
      
      * update blenderbot (diff to be used as reference)
      
      * update blenderbot_small
      
      * update LED
      
      * update marian
      
      * update T5 and remove TFWrappedEmbeddings
      
      * nullcontext() -> ContextManagers()
      
      * fix embedding init
      462cd641
  17. 22 Sep, 2022 1 commit
  18. 15 Sep, 2022 1 commit
    • Matt's avatar
      Update serving signatures and make sure we actually use them (#19034) · 2322eb8e
      Matt authored
      * Override save() to use the serving signature as the default
      
      * Replace int32 with int64 in all our serving signatures
      
      * Remember one very important line so as not to break every test at once
      
      * Dtype fix for TFLED
      
      * dtype fix for shift_tokens_right in general
      
      * Dtype fixes in mBART and RAG
      
      * Fix dtypes for test_unpack_inputs
      
      * More dtype fixes
      
      * Yet more mBART + RAG dtype fixes
      
      * Yet more mBART + RAG dtype fixes
      
      * Add a check that the model actually has a serving method
      2322eb8e
  19. 14 Sep, 2022 2 commits
  20. 12 Sep, 2022 1 commit
  21. 07 Sep, 2022 1 commit
  22. 06 Aug, 2022 1 commit
  23. 03 Aug, 2022 1 commit
    • LSinev's avatar
      Fix torch version comparisons (#18460) · 02b176c4
      LSinev authored
      Comparisons like
      version.parse(torch.__version__) > version.parse("1.6")
      are True for torch==1.6.0+cu101 or torch==1.6.0+cpu
      
      version.parse(version.parse(torch.__version__).base_version) are preferred (and available in pytorch_utils.py
      02b176c4
  24. 01 Aug, 2022 1 commit
  25. 11 Jul, 2022 1 commit
  26. 01 Jul, 2022 1 commit
    • Sanchit Gandhi's avatar
      [Flax] Add remat (gradient checkpointing) (#17843) · 485bbe79
      Sanchit Gandhi authored
      * [Flax] Add remat (gradient checkpointing)
      
      * fix variable naming in test
      
      * flip: checkpoint using a method
      
      * fix naming
      
      * fix class naming
      
      * apply PVP's suggestions from code review
      
      * make fix-copies
      
      * fix big-bird, electra, roberta
      
      * cookie-cutter
      
      * fix flax big-bird
      
      * move test to common
      485bbe79
  27. 29 Jun, 2022 1 commit
  28. 20 Jun, 2022 2 commits
  29. 13 Jun, 2022 1 commit
  30. 07 Jun, 2022 1 commit
    • Sylvain Gugger's avatar
      Add examples telemetry (#17552) · 3cab9027
      Sylvain Gugger authored
      * Add examples telemetry
      
      * Alternative approach
      
      * Add to all other examples
      
      * Add to templates as well
      
      * Put framework separately
      
      * Same for TensorFlow
      3cab9027
  31. 16 May, 2022 1 commit
  32. 12 May, 2022 1 commit
  33. 10 May, 2022 1 commit
  34. 09 May, 2022 1 commit
  35. 03 May, 2022 3 commits
    • Pavel Belevich's avatar
    • Yih-Dar's avatar
      Move test model folders (#17034) · 19420fd9
      Yih-Dar authored
      
      
      * move test model folders (TODO: fix imports and others)
      
      * fix (potentially partially) imports (in model test modules)
      
      * fix (potentially partially) imports (in tokenization test modules)
      
      * fix (potentially partially) imports (in feature extraction test modules)
      
      * fix import utils.test_modeling_tf_core
      
      * fix path ../fixtures/
      
      * fix imports about generation.test_generation_flax_utils
      
      * fix more imports
      
      * fix fixture path
      
      * fix get_test_dir
      
      * update module_to_test_file
      
      * fix get_tests_dir from wrong transformers.utils
      
      * update config.yml (CircleCI)
      
      * fix style
      
      * remove missing imports
      
      * update new model script
      
      * update check_repo
      
      * update SPECIAL_MODULE_TO_TEST_MAP
      
      * fix style
      
      * add __init__
      
      * update self-scheduled
      
      * fix add_new_model scripts
      
      * check one way to get location back
      
      * python setup.py build install
      
      * fix import in test auto
      
      * update self-scheduled.yml
      
      * update slack notification script
      
      * Add comments about artifact names
      
      * fix for yolos
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      19420fd9
    • Sanchit Gandhi's avatar
      [FlaxBert] Add ForCausalLM (#16995) · cd9274d0
      Sanchit Gandhi authored
      * [FlaxBert] Add ForCausalLM
      
      * make style
      
      * fix output attentions
      
      * Add RobertaForCausalLM
      
      * remove comment
      
      * fix fx-to-pt model loading
      
      * remove comment
      
      * add modeling tests
      
      * add enc-dec model tests
      
      * add big_bird
      
      * add electra
      
      * make style
      
      * make repo-consitency
      
      * add to docs
      
      * remove roberta test
      
      * quality
      
      * amend cookiecutter
      
      * fix attention_mask bug in flax bert model tester
      
      * tighten pt-fx thresholds to 1e-5
      
      * add 'copied from' statements
      
      * amend 'copied from' statements
      
      * amend 'copied from' statements
      
      * quality
      cd9274d0