"tests/models/mega/test_modeling_mega.py" did not exist on "88ef8893cd649cc2b4adb9885aba88c750118cff"
  1. 26 May, 2022 1 commit
  2. 23 May, 2022 1 commit
  3. 20 May, 2022 1 commit
  4. 12 May, 2022 2 commits
  5. 10 May, 2022 1 commit
    • Stas Bekman's avatar
      [Deepspeed] add many more models to the model zoo test (#12695) · f8615044
      Stas Bekman authored
      * model zoo take 2
      
      * add deberta
      
      * new param for zero2
      
      * doc update
      
      * doc update
      
      * add layoutlm
      
      * bump deepspeed
      
      * add deberta-v2, funnel, longformer
      
      * new models
      
      * style
      
      * add t5_v1
      
      * update TAPAS status
      
      * reorg problematic models
      
      * move doc to another PR
      
      * style
      
      * fix checkpoint check test
      
      * making progress on more models running
      
      * cleanup
      
      * new version
      
      * cleanup
      f8615044
  6. 09 May, 2022 1 commit
  7. 04 May, 2022 1 commit
  8. 02 May, 2022 2 commits
  9. 29 Apr, 2022 1 commit
  10. 28 Apr, 2022 1 commit
  11. 17 Apr, 2022 1 commit
  12. 15 Apr, 2022 1 commit
  13. 06 Apr, 2022 1 commit
  14. 01 Apr, 2022 1 commit
  15. 28 Mar, 2022 1 commit
  16. 24 Mar, 2022 1 commit
  17. 23 Mar, 2022 1 commit
  18. 18 Mar, 2022 1 commit
  19. 12 Mar, 2022 1 commit
    • Stas Bekman's avatar
      [Deepspeed] add support for bf16 mode (#14569) · 580dd87c
      Stas Bekman authored
      
      
      * [WIP] add support for bf16 mode
      
      * prep for bf16
      
      * prep for bf16
      
      * fix; zero2/bf16 is ok
      
      * check bf16 is available
      
      * test fixes
      
      * enable zero3_bf16
      
      * config files
      
      * docs
      
      * split stage_dtype; merge back to non-dtype-specific config file
      
      * fix doc
      
      * cleanup
      
      * cleanup
      
      * bfloat16 => bf16 to match the PR changes
      
      * s/zero_gather_fp16_weights_on_model_save/zero_gather_16bit_weights_on_model_save/; s/save_fp16_model/save_16bit_model/
      
      * test fixes/skipping
      
      * move
      
      * fix
      
      * Update docs/source/main_classes/deepspeed.mdx
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * backticks
      
      * cleanup
      
      * cleanup
      
      * cleanup
      
      * new version
      
      * add note about grad accum in bf16
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      580dd87c
  20. 03 Mar, 2022 1 commit
  21. 01 Mar, 2022 1 commit
  22. 18 Feb, 2022 1 commit
  23. 15 Feb, 2022 1 commit
  24. 09 Feb, 2022 1 commit
  25. 28 Jan, 2022 1 commit
  26. 27 Jan, 2022 2 commits
  27. 18 Jan, 2022 1 commit
  28. 17 Jan, 2022 1 commit
  29. 14 Jan, 2022 1 commit
  30. 30 Dec, 2021 1 commit
    • Nicolas Patry's avatar
      Enabling `tokenizers` upgrade. (#14941) · 08cb5718
      Nicolas Patry authored
      * Enabling `tokenizers` upgrade.
      
      * Moved ugly comment.
      
      * Tokenizers==0.11.1 needs an update to keep borrow checker
      
      happy in highly contiguous calls.
      
      * Support both 0.11.1 and 0.11.0
      08cb5718
  31. 22 Dec, 2021 2 commits
  32. 17 Dec, 2021 1 commit
  33. 16 Dec, 2021 1 commit
  34. 15 Dec, 2021 2 commits
  35. 09 Dec, 2021 1 commit