1. 23 May, 2022 1 commit
  2. 16 May, 2022 2 commits
  3. 12 May, 2022 1 commit
  4. 04 May, 2022 1 commit
  5. 21 Apr, 2022 1 commit
  6. 11 Apr, 2022 1 commit
    • Jia LI's avatar
      Jia multi gpu eval (#16428) · 4868a830
      Jia LI authored
      
      
      * add simple multi gpu complet
      
      * add human_eval_multi_gpu
      
      * use copy strategy to distribute across gpu, to avoid padding
      
      * add doc string
      
      * update code style
      
      * use task id to arrange output
      
      * truncate input to avoid zero pad
      
      * Stop the copy mechanism
      
      * update style
      
      * restore copies to scale better in distributed mode
      
      * update style
      
      * replace human eval
      
      * Apply suggestions from code review
      
      1. Tokenize all input at the same time
      2. use attention_mask to get the input length
      3. other small fixes
      Co-authored-by: default avatarLeandro von Werra <lvwerra@users.noreply.github.com>
      
      * correct typo and update docstring
      
      * update code style
      
      * remove num sample division constraint
      
      * remove max len calculation
      
      * use accelerator.gather once to speed up
      
      * use accelerate set_seed; update accelerate version
      
      * correct gather bug
      Co-authored-by: default avatarLeandro von Werra <lvwerra@users.noreply.github.com>
      4868a830
  7. 12 Jan, 2022 1 commit
  8. 23 Dec, 2021 1 commit
  9. 13 Dec, 2021 1 commit
  10. 02 Dec, 2021 1 commit
    • Leandro von Werra's avatar
      Add CodeParrot 馃 codebase (#14536) · 43f953cc
      Leandro von Werra authored
      
      
      * add readme skeleton
      
      * update readme
      
      * add initialization script
      
      * add deduplication script
      
      * add codeparrot training script
      
      * add code generation evaluation
      
      * add validation loss script
      
      * add requirements
      
      * update readme
      
      * tweak readme
      
      * make style
      
      * add highlights to readme
      
      * add CLIs to scripts
      
      * add tokenizer training script
      
      * add docstring to constant length dataset
      
      * fix defaults in arguments
      
      * update readme with cli
      
      * move image to hub
      
      * tweaks of readme
      
      * fix cli commands
      
      * add author
      
      * explain env variables
      
      * fix formatting
      
      * Update examples/research_projects/codeparrot/README.md
      Co-authored-by: default avatarlewtun <lewis.c.tunstall@gmail.com>
      
      * Apply suggestions from code review
      Co-authored-by: default avatarlewtun <lewis.c.tunstall@gmail.com>
      
      * replace generic with gpt2 tokenizer
      Co-authored-by: default avatarlewtun <lewis.c.tunstall@gmail.com>
      43f953cc