1. 23 Mar, 2022 1 commit
  2. 12 Mar, 2022 1 commit
    • Stas Bekman's avatar
      [Deepspeed] add support for bf16 mode (#14569) · 580dd87c
      Stas Bekman authored
      
      
      * [WIP] add support for bf16 mode
      
      * prep for bf16
      
      * prep for bf16
      
      * fix; zero2/bf16 is ok
      
      * check bf16 is available
      
      * test fixes
      
      * enable zero3_bf16
      
      * config files
      
      * docs
      
      * split stage_dtype; merge back to non-dtype-specific config file
      
      * fix doc
      
      * cleanup
      
      * cleanup
      
      * bfloat16 => bf16 to match the PR changes
      
      * s/zero_gather_fp16_weights_on_model_save/zero_gather_16bit_weights_on_model_save/; s/save_fp16_model/save_16bit_model/
      
      * test fixes/skipping
      
      * move
      
      * fix
      
      * Update docs/source/main_classes/deepspeed.mdx
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * backticks
      
      * cleanup
      
      * cleanup
      
      * cleanup
      
      * new version
      
      * add note about grad accum in bf16
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      580dd87c
  3. 09 Feb, 2022 1 commit
  4. 10 Jan, 2022 1 commit
  5. 11 Nov, 2021 1 commit
  6. 22 Oct, 2021 1 commit
  7. 14 Oct, 2021 1 commit
  8. 08 Aug, 2021 1 commit
  9. 30 Jul, 2021 1 commit
    • 21jun's avatar
      fix typo in gradient_checkpointing arg (#12855) · 5c673efa
      21jun authored
      help for `ModelArguments.gradient_checkpointing` should be
      "If True, use gradient checkpointing to save memory
      at the expense of slower backward pass."
      not "Whether to freeze the feature extractor layers of the model."
      (which is duplicated from `freeze_feature_extractor` arg)
      5c673efa
  10. 23 Jul, 2021 1 commit
  11. 15 Jul, 2021 1 commit
  12. 25 Jun, 2021 1 commit
  13. 14 Jun, 2021 1 commit
  14. 09 Jun, 2021 2 commits
  15. 08 Jun, 2021 1 commit
  16. 12 May, 2021 1 commit
  17. 14 Apr, 2021 1 commit
  18. 30 Mar, 2021 1 commit
  19. 22 Mar, 2021 2 commits
  20. 21 Mar, 2021 4 commits
  21. 19 Mar, 2021 3 commits
  22. 18 Mar, 2021 6 commits
  23. 05 Mar, 2021 1 commit
  24. 01 Mar, 2021 1 commit
    • Patrick von Platen's avatar
      Add Fine-Tuning for Wav2Vec2 (#10145) · 0234de84
      Patrick von Platen authored
      
      
      * add encode labels function to tokenizer
      
      * start adding finetuning
      
      * init dropout
      
      * upload
      
      * correct convert script
      
      * apply changes
      
      * fix second typo
      
      * make first dummy training run
      
      * adapt convert script
      
      * push confg for comparison
      
      * remove conf
      
      * finish training
      
      * adapt data collator
      
      * add research folder
      
      * update according to fairseq feedback
      
      * some minor corrections
      
      * refactor masking indices a bit
      
      * some minor changes
      
      * clean tokenizer
      
      * finish clean-up
      
      * remove previous logic
      
      * update run script
      
      * correct training
      
      * finish changes
      
      * finish model
      
      * correct bug
      
      * fix training a bit more
      
      * add some tests
      
      * finish gradient checkpointing
      
      * finish example
      
      * correct gradient checkpointing
      
      * improve tokenization method
      
      * revert changes in tokenizer
      
      * revert general change
      
      * adapt fine-tuning
      
      * update
      
      * save intermediate test
      
      * Update README.md
      
      * finish finetuning
      
      * delete conversion script
      
      * Update src/transformers/models/wav2vec2/configuration_wav2vec2.py
      
      * Update src/transformers/models/wav2vec2/processing_wav2vec2.py
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      
      * finish wav2vec2 script
      
      * finish wav2vec2 fine-tuning
      
      * finalize test
      
      * correct test
      
      * adapt tests
      
      * finish
      
      * remove test file
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      0234de84