1. 01 May, 2024 10 commits
  2. 30 Apr, 2024 1 commit
    • Anton Vlasjuk's avatar
      Fix seq2seq collator padding (#30556) · 9112520b
      Anton Vlasjuk authored
      * fix seq2seq data collator to respect the given padding strategy
      
      further added tests for the seq2seq data collator in the style of the `data_collator_for_token_classification` (pt, tf, np)
      
      * formatting and change bool equals "==" to "is"
      
      * add missed return types in tests
      
      * update numpy test as it can handle unequal shapes, not like pt or tf
      9112520b
  3. 26 Apr, 2024 1 commit
  4. 24 Apr, 2024 1 commit
  5. 18 Apr, 2024 2 commits
  6. 17 Apr, 2024 2 commits
  7. 15 Apr, 2024 2 commits
  8. 10 Apr, 2024 1 commit
    • Pavel Iakubovskii's avatar
      Fix and simplify semantic-segmentation example (#30145) · 56d001b2
      Pavel Iakubovskii authored
      * Remove unused augmentation
      
      * Fix pad_if_smaller() and remove unused augmentation
      
      * Add indentation
      
      * Fix requirements
      
      * Update dataset use instructions
      
      * Replace transforms with albumentations
      
      * Replace identity transform with None
      
      * Fixing formatting
      
      * Fixed comment place
      56d001b2
  9. 09 Apr, 2024 1 commit
  10. 08 Apr, 2024 2 commits
  11. 05 Apr, 2024 1 commit
  12. 02 Apr, 2024 1 commit
  13. 30 Mar, 2024 1 commit
  14. 25 Mar, 2024 1 commit
  15. 21 Mar, 2024 1 commit
  16. 20 Mar, 2024 1 commit
  17. 15 Mar, 2024 2 commits
  18. 12 Mar, 2024 3 commits
  19. 11 Mar, 2024 2 commits
    • Yitong Huang's avatar
      Make torch xla available on GPU (#29334) · 873d9bb3
      Yitong Huang authored
      
      
      * add USE_TORCH_XLA env
      
      * rename torch_tpu to torch_xla
      
      * better is_torch_xla_available; fix some fsdp and performance issues
      
      * fix format
      
      * fix bug when pjrt_device is cpu
      
      * fix bug
      
      * fix the deprecation handling
      
      ---------
      Co-authored-by: default avataranw90 <ang868@gmail.com>
      Co-authored-by: default avatarwangang.wa <wangang.wa@alibaba-inc.com>
      873d9bb3
    • Tanay Mehta's avatar
      Add Fill-in-the-middle training objective example - PyTorch (#27464) · 6d67837f
      Tanay Mehta authored
      * add: initial script to train clm fim
      
      * fix: if training model from scratch, new tokens will be added and embeddings resized
      
      * fix: fixed attention_mask errors when generating FIM data
      
      * fix: file formatted using black
      
      * add: run_fim_no_trainer.py and fixed some comments in run_fim.py
      
      * add: added fim examples to the README.md and ran code fixup
      
      * fix: little bug in both fim training scripts
      
      * fix: remove comment from notebook and added a note on fim related params
      
      * fix: minor typo in README
      
      * add: suggested minor changes to README and run_fim.py
      
      * add: gradient_accumulation_steps and gradient_checkpointing args
      
      * add: improved model embedding resizing
      
      * add: pad_to_multiple_of and attn_implementation params
      
      * add: requested minor changes
      
      * add: deepspeed zero compatibility
      
      * add: resize embeddings layer with zero3 support for fim model initialization
      6d67837f
  20. 21 Feb, 2024 1 commit
  21. 19 Feb, 2024 2 commits
  22. 16 Feb, 2024 1 commit