1. 11 Oct, 2021 4 commits
  2. 08 Oct, 2021 3 commits
  3. 07 Oct, 2021 2 commits
  4. 06 Oct, 2021 2 commits
  5. 05 Oct, 2021 5 commits
  6. 04 Oct, 2021 2 commits
    • Bram Vanroy's avatar
      Update no_* argument (HfArgumentParser) (#13865) · 12b4d66a
      Bram Vanroy authored
      * update no_* argument
      
      Changes the order so that the no_* argument is created after the original argument AND sets the default for this no_* argument to False
      
      * import copy
      
      * update test
      
      * make style
      
      * Use kwargs to set default=False
      
      * make style
      12b4d66a
    • Sidd Karamcheti's avatar
      Add Mistral GPT-2 Stability Tweaks (#13573) · 3a8de58c
      Sidd Karamcheti authored
      
      
      * Add layer-wise scaling
      
      * Add reorder & upcasting argument
      
      * Add OpenAI GPT-2 weight initialization scheme
      
      * start `layer_idx` count at zero for consistency
      
      * disentangle attn and reordered and upscaled attn function
      
      * rename `scale_attn_by_layer` to `scale_attn_by_layer_id`
      
      * make autocast from amp compatible with pytorch<1.6
      
      * fix docstring
      
      * style fixes
      
      * Add fixes from PR feedback, style tweaks
      
      * Fix doc whitespace
      
      * Reformat
      
      * First pass scale_attn_by_layer_idx and reorder_and_upcast_attn tests
      
      * Rename scale_attn_by_layer_idx, add tip
      
      * Remove extra newline
      
      * add test for weight initialization
      
      * update code format
      
      * add assert check weights are fp32
      
      * remove assert
      
      * Fix incorrect merge
      
      * Fix shape mismatch in baddbmm
      
      * Add generation test for Mistral flags
      Co-authored-by: default avatarleandro <leandro.vonwerra@spoud.io>
      Co-authored-by: default avatarKeshav Santhanam <keshav2@stanford.edu>
      Co-authored-by: default avatarJ38 <jebolton@stanford.edu>
      3a8de58c
  7. 30 Sep, 2021 2 commits
  8. 29 Sep, 2021 2 commits
  9. 26 Sep, 2021 1 commit
  10. 25 Sep, 2021 1 commit
  11. 24 Sep, 2021 2 commits
  12. 23 Sep, 2021 1 commit
  13. 22 Sep, 2021 4 commits
  14. 21 Sep, 2021 8 commits
  15. 20 Sep, 2021 1 commit