1. 09 Sep, 2022 1 commit
  2. 07 Sep, 2022 1 commit
  3. 06 Sep, 2022 1 commit
  4. 01 Sep, 2022 1 commit
  5. 25 Aug, 2022 1 commit
  6. 24 Aug, 2022 3 commits
  7. 22 Aug, 2022 1 commit
  8. 18 Aug, 2022 3 commits
  9. 17 Aug, 2022 1 commit
  10. 16 Aug, 2022 1 commit
    • zhoutang776's avatar
      Update run_translation_no_trainer.py (#18637) · 25e651a2
      zhoutang776 authored
      * Update run_translation_no_trainer.py
      
      found an error in selecting `no_decay` parameters and some small modifications when the user continues to train from a checkpoint
      
      * fixs `no_decay` and `resume_step` issue
      
      1. change `no_decay` list
      2. if use continue to train their model from provided checkpoint, the `resume_step` will not be initialized properly if `args.gradient_accumulation_steps != 1`
      25e651a2
  11. 14 Aug, 2022 1 commit
    • Karim Foda's avatar
      Flax Remat for LongT5 (#17994) · d6eeb871
      Karim Foda authored
      
      
      * [Flax] Add remat (gradient checkpointing)
      
      * fix variable naming in test
      
      * flip: checkpoint using a method
      
      * fix naming
      
      * fix class naming
      
      * apply PVP's suggestions from code review
      
      * add gradient_checkpointing to examples
      
      * Add gradient_checkpointing to run_mlm_flax
      
      * Add remat to longt5
      
      * Add gradient checkpointing test longt5
      
      * Fix args errors
      
      * Fix remaining tests
      
      * Make fixup & quality fixes
      
      * replace kwargs
      
      * remove unecessary kwargs
      
      * Make fixup changes
      
      * revert long_t5_flax changes
      
      * Remove return_dict and copy to LongT5
      
      * Remove test_gradient_checkpointing
      Co-authored-by: default avatarsanchit-gandhi <sanchit@huggingface.co>
      d6eeb871
  12. 11 Aug, 2022 2 commits
  13. 10 Aug, 2022 1 commit
    • Matt's avatar
      TF Examples Rewrite (#18451) · 6eb51450
      Matt authored
      
      
      * Finished QA example
      
      * Dodge a merge conflict
      
      * Update text classification and LM examples
      
      * Update NER example
      
      * New Keras metrics WIP, fix NER example
      
      * Update NER example
      
      * Update MC, summarization and translation examples
      
      * Add XLA warnings when shapes are variable
      
      * Make sure batch_size is consistently scaled by num_replicas
      
      * Add PushToHubCallback to all models
      
      * Add docs links for KerasMetricCallback
      
      * Add docs links for prepare_tf_dataset and jit_compile
      
      * Correct inferred model names
      
      * Don't assume the dataset has 'lang'
      
      * Don't assume the dataset has 'lang'
      
      * Write metrics in text classification
      
      * Add 'framework' to TrainingArguments and TFTrainingArguments
      
      * Export metrics in all examples and add tests
      
      * Fix training args for Flax
      
      * Update command line args for translation test
      
      * make fixup
      
      * Fix accidentally running other tests in fp16
      
      * Remove do_train/do_eval from run_clm.py
      
      * Remove do_train/do_eval from run_mlm.py
      
      * Add tensorflow tests to circleci
      
      * Fix circleci
      
      * Update examples/tensorflow/language-modeling/run_mlm.py
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      
      * Update examples/tensorflow/test_tensorflow_examples.py
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      
      * Update examples/tensorflow/translation/run_translation.py
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      
      * Update examples/tensorflow/token-classification/run_ner.py
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      
      * Fix save path for tests
      
      * Fix some model card kwargs
      
      * Explain the magical -1000
      
      * Actually enable tests this time
      
      * Skip text classification PR until we fix shape inference
      
      * make fixup
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      6eb51450
  14. 08 Aug, 2022 3 commits
    • Rasmus Arpe Fogh Jensen's avatar
      Update no_trainer.py scripts to include accelerate gradient accumulation wrapper (#18473) · a765b68a
      Rasmus Arpe Fogh Jensen authored
      * Added accelerate gradient accumulation wrapper to run_image_classification_no_trainer.py example script
      
      * make fixup changes
      
      * PR comments
      
      * changed input to Acceletor based on PR comment, ran make fixup
      
      * Added comment explaining the sync_gradients statement
      
      * Fixed lr scheduler max steps
      
      * Changed run_clm_no_trainer.py script to use accelerate gradient accum wrapper
      
      * Fixed all scripts except wav2vec2 pretraining to use accelerate gradient accum wrapper
      
      * Added accelerate gradient accum wrapper for wav2vec2_pretraining_no_trainer.py script
      
      * make fixup and lr_scheduler step inserted back into run_qa_beam_search_no_trainer.py
      
      * removed changes to run_wav2vec2_pretraining_no_trainer.py script and fixed using wrong constant in qa_beam_search_no_trainer.py script
      a765b68a
    • Sylvain Gugger's avatar
      Fix compatibility with 1.12 (#17925) · 70b0d4e1
      Sylvain Gugger authored
      
      
      * Fix compatibility with 1.12
      
      * Remove pin from examples requirements
      
      * Update torch scatter version
      
      * Fix compatibility with 1.12
      
      * Remove pin from examples requirements
      
      * Update torch scatter version
      
      * fix torch.onnx.symbolic_opset12 import
      
      * Reject bad version
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      70b0d4e1
    • regisss's avatar
      88a0ce57
  15. 06 Aug, 2022 2 commits
  16. 04 Aug, 2022 2 commits
  17. 03 Aug, 2022 2 commits
  18. 02 Aug, 2022 1 commit
  19. 01 Aug, 2022 6 commits
  20. 29 Jul, 2022 1 commit
  21. 28 Jul, 2022 3 commits
    • Vijay S Kalmath's avatar
      Migrate metrics used in flax examples to Evaluate (#18348) · da503ea0
      Vijay S Kalmath authored
      Currently, tensorflow examples use the `load_metric` function from
      Datasets library, commit migrates function call to `load` function
      from Evaluate library.
      da503ea0
    • Vijay S Kalmath's avatar
      Migrate metric to Evaluate library for tensorflow examples (#18327) · a2586795
      Vijay S Kalmath authored
      * Migrate metric to Evaluate library in tf examples
      
      Currently tensorflow examples use `load_metric` function from Datasets
      library , commit migrates function call to `load` function to
      Evaluate library.
      
      Fix for #18306
      
      * Migrate metric to Evaluate library in tf examples
      
      Currently tensorflow examples use `load_metric` function from Datasets
      library , commit migrates function call to `load` function to
      Evaluate library.
      
      Fix for #18306
      
      * Migrate `metric` to Evaluate for all tf examples
      
      Currently tensorflow examples use `load_metric` function from Datasets
      library , commit migrates function call to `load` function to
      Evaluate library.
      a2586795
    • Loubna Ben Allal's avatar
      Fix codeparrot deduplication - ignore whitespaces (#18023) · 286a18fa
      Loubna Ben Allal authored
      * ignore whitspaces for hash
      
      * reformat code
      
      * Update README.md
      286a18fa
  22. 27 Jul, 2022 2 commits
    • Lysandre's avatar
      Dev version · c89a592e
      Lysandre authored
      c89a592e
    • Sanchit Gandhi's avatar
      [Flax] Fix incomplete batches in example scripts (#17863) · 7490a97c
      Sanchit Gandhi authored
      * [Flax] Fix incomplete batches in example scripts
      
      * fix dataloader batching
      
      * convert jnp batch idxs to np array
      
      * add missing `pad_shard_unpad` to final prediction generate step
      
      * only `pad_shard_unpad` at inference time
      
      * merge conflicts
      
      * remove incomplete batch step from eval
      
      * fix run_qa.py
      
      * add `pad_shard_unpad` to run_flax_ner.py
      
      * add `pad_shard_unpad` to run_flax_glue.py
      
      * add `pad_shard_unpad` to run_image_classification.py
      
      * make style
      
      * fix mlm flax eval batches
      
      * remove redundant imports
      7490a97c