"tests/models/vscode:/vscode.git/clone" did not exist on "56b03c96b865a40811f4eb2942e71aaab4cd38c2"
  1. 09 May, 2021 1 commit
  2. 07 May, 2021 2 commits
  3. 06 May, 2021 1 commit
  4. 05 May, 2021 1 commit
  5. 04 May, 2021 2 commits
    • Sylvain Gugger's avatar
      Reproducible checkpoint (#11582) · 6b241e0e
      Sylvain Gugger authored
      * Set generator in dataloader
      
      * Use generator in all random samplers
      
      * Checkpoint all RNG states
      
      * Final version
      
      * Quality
      
      * Test
      
      * Address review comments
      
      * Quality
      
      * Remove debug util
      
      * Add python and numpy RNGs
      
      * Split states in different files in distributed
      
      * Quality
      
      * local_rank for TPUs
      
      * Only use generator when accepted
      
      * Add test
      
      * Set seed to avoid flakiness
      
      * Make test less flaky
      
      * Quality
      6b241e0e
    • Patrick von Platen's avatar
      [FlaxRoberta] Add FlaxRobertaModels & adapt run_mlm_flax.py (#11470) · 084a187d
      Patrick von Platen authored
      
      
      * add flax roberta
      
      * make style
      
      * correct initialiazation
      
      * modify model to save weights
      
      * fix copied from
      
      * fix copied from
      
      * correct some more code
      
      * add more roberta models
      
      * Apply suggestions from code review
      
      * merge from master
      
      * finish
      
      * finish docs
      Co-authored-by: default avatarPatrick von Platen <patrick@huggingface.co>
      084a187d
  6. 03 May, 2021 1 commit
  7. 30 Apr, 2021 4 commits
  8. 29 Apr, 2021 1 commit
  9. 26 Apr, 2021 4 commits
  10. 23 Apr, 2021 5 commits
  11. 22 Apr, 2021 2 commits
  12. 21 Apr, 2021 3 commits
  13. 20 Apr, 2021 2 commits
    • Sylvain Gugger's avatar
      Update to use datasets remove_cloumns method (#11343) · f1b938fd
      Sylvain Gugger authored
      * Update to use datasets remove_cloumns method
      
      * Quality
      f1b938fd
    • rajvi-k's avatar
      Added translation example script (#11196) · bfd83c17
      rajvi-k authored
      * initial changes
      
      * modified evaluation
      
      * updated evaluation
      
      * updated evaluation on text translation example script
      
      * added translation example script
      
      * Formatted translation example script
      
      * Reformatted translation example
      
      * Fixed evaluation bug and added support for other tokenisers
      
      * Fixed evaluation bug and added support for other tokenisers
      
      * Added translation example script
      
      * Formatted summarization example script
      
      * Removed typos from summarization example script
      bfd83c17
  14. 14 Apr, 2021 2 commits
  15. 13 Apr, 2021 1 commit
  16. 12 Apr, 2021 2 commits
  17. 09 Apr, 2021 3 commits
  18. 08 Apr, 2021 3 commits
    • Stas Bekman's avatar
      [tests] relocate core integration tests (#11146) · 66446909
      Stas Bekman authored
      * relocate core integration tests
      
      * add sys.path context manager
      
      * cleanup
      
      * try
      
      * try2
      
      * fix path
      
      * doc
      
      * style
      
      * add dep
      
      * add 2 more deps
      66446909
    • Andrea Cappelli's avatar
      Run mlm pad to multiple for fp16 (#11128) · 6c40e497
      Andrea Cappelli authored
      * Add mlm collator pad to multiple option (#10627)
      
      * Use padding to 8x in run mlm (#10627)
      6c40e497
    • Stas Bekman's avatar
      [DeepSpeed] ZeRO Stage 3 (#10753) · c6d66484
      Stas Bekman authored
      
      
      * synced gpus
      
      * fix
      
      * fix
      
      * need to use t5-small for quality tests
      
      * notes
      
      * complete merge
      
      * fix a disappearing std stream problem
      
      * start zero3 tests
      
      * wip
      
      * tune params
      
      * sorting out the pre-trained model loading
      
      * reworking generate loop wip
      
      * wip
      
      * style
      
      * fix tests
      
      * split the tests
      
      * refactor tests
      
      * wip
      
      * parameterized
      
      * fix
      
      * workout the resume from non-ds checkpoint pass + test
      
      * cleanup
      
      * remove no longer needed code
      
      * split getter/setter functions
      
      * complete the docs
      
      * suggestions
      
      * gpus and their compute capabilities link
      
      * Apply suggestions from code review
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      
      * style
      
      * remove invalid paramgd
      
      * automatically configure zero3 params that rely on hidden size
      
      * make _get_resized_embeddings zero3-aware
      
      * add test exercising resize_token_embeddings()
      
      * add docstring
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      c6d66484