1. 22 Nov, 2021 1 commit
  2. 09 Nov, 2021 1 commit
  3. 30 Sep, 2021 1 commit
    • Suraj Patil's avatar
      [examples/flax] use Repository API for push_to_hub (#13672) · 7db2a79b
      Suraj Patil authored
      * use Repository for push_to_hub
      
      * update readme
      
      * update other flax scripts
      
      * update readme
      
      * update qa example
      
      * fix push_to_hub call
      
      * fix typo
      
      * fix more typos
      
      * update readme
      
      * use abosolute path to get repo name
      
      * fix glue script
      7db2a79b
  4. 10 Sep, 2021 1 commit
  5. 28 Aug, 2021 1 commit
  6. 27 Aug, 2021 1 commit
  7. 09 Aug, 2021 2 commits
  8. 06 Aug, 2021 1 commit
  9. 30 Jul, 2021 1 commit
  10. 27 Jul, 2021 1 commit
  11. 20 Jul, 2021 2 commits
  12. 14 Jul, 2021 1 commit
  13. 13 Jul, 2021 1 commit
  14. 09 Jul, 2021 1 commit
  15. 08 Jul, 2021 1 commit
  16. 07 Jul, 2021 3 commits
  17. 06 Jul, 2021 1 commit
  18. 05 Jul, 2021 3 commits
  19. 29 Jun, 2021 1 commit
  20. 28 Jun, 2021 2 commits
  21. 25 Jun, 2021 1 commit
  22. 15 Jun, 2021 1 commit
  23. 14 Jun, 2021 3 commits
  24. 11 Jun, 2021 1 commit
    • Suraj Patil's avatar
      Flax CLM script (#12023) · 15b498f3
      Suraj Patil authored
      * first draft
      
      * max_seq_length => block_size
      
      * fix arg names
      
      * fix typos
      
      * fix loss calculation
      
      * add max examples, fix  train eval steps, metrics
      
      * optimizer mask
      
      * fix perpelexity, metric logging
      
      * fix logging
      
      * data_collator = > data_loader
      
      * refactor loss_fn
      
      * support single GPU
      
      * pass distributed to write_metric
      
      * fix jitting
      
      * fix single device training
      
      * fix single device metrics
      
      * close inner progress bars once finished
      
      * add overwrite_cache arg
      
      * ifx dataset caching issue
      
      * add more logs
      
      * few small fixes,
      
      * address nicholas suggestions
      
      * fix docstr
      
      * address patricks suggestions
      
      * make flake happy
      
      * pass new new_dropout_rng to apply_gradients
      
      * reset train metrics after every epoc
      
      * remove distributed logis, small fixes
      15b498f3
  25. 09 Jun, 2021 1 commit
  26. 03 Jun, 2021 1 commit
  27. 24 May, 2021 1 commit
  28. 19 May, 2021 1 commit
  29. 04 May, 2021 1 commit
  30. 23 Apr, 2021 1 commit
  31. 21 Apr, 2021 1 commit