1. 11 Mar, 2024 1 commit
    • Tanay Mehta's avatar
      Add Fill-in-the-middle training objective example - PyTorch (#27464) · 6d67837f
      Tanay Mehta authored
      * add: initial script to train clm fim
      
      * fix: if training model from scratch, new tokens will be added and embeddings resized
      
      * fix: fixed attention_mask errors when generating FIM data
      
      * fix: file formatted using black
      
      * add: run_fim_no_trainer.py and fixed some comments in run_fim.py
      
      * add: added fim examples to the README.md and ran code fixup
      
      * fix: little bug in both fim training scripts
      
      * fix: remove comment from notebook and added a note on fim related params
      
      * fix: minor typo in README
      
      * add: suggested minor changes to README and run_fim.py
      
      * add: gradient_accumulation_steps and gradient_checkpointing args
      
      * add: improved model embedding resizing
      
      * add: pad_to_multiple_of and attn_implementation params
      
      * add: requested minor changes
      
      * add: deepspeed zero compatibility
      
      * add: resize embeddings layer with zero3 support for fim model initialization
      6d67837f
  2. 16 Feb, 2024 1 commit
  3. 22 Mar, 2023 1 commit
  4. 30 Jan, 2023 1 commit
  5. 23 Mar, 2022 1 commit
  6. 10 Feb, 2022 1 commit
  7. 22 Sep, 2021 1 commit
  8. 14 Jun, 2021 1 commit
  9. 25 May, 2021 1 commit
  10. 21 Apr, 2021 1 commit
  11. 06 Apr, 2021 1 commit
  12. 19 Mar, 2021 1 commit
  13. 01 Feb, 2021 1 commit
  14. 22 Dec, 2020 1 commit
  15. 11 Dec, 2020 1 commit
  16. 19 Nov, 2020 1 commit
    • Matthias's avatar
      fix small typo (#8644) · a79a96dd
      Matthias authored
      Fixed a small typo on the XLNet and permutation language modelling section
      a79a96dd
  17. 18 Nov, 2020 1 commit
  18. 04 Nov, 2020 2 commits
  19. 02 Nov, 2020 1 commit
  20. 30 Oct, 2020 1 commit
  21. 29 Oct, 2020 1 commit
  22. 26 Oct, 2020 1 commit
  23. 22 Oct, 2020 1 commit
  24. 29 Jul, 2020 1 commit
  25. 25 May, 2020 1 commit
  26. 07 May, 2020 2 commits