1. 30 Jan, 2024 1 commit
  2. 29 Jan, 2024 13 commits
  3. 28 Jan, 2024 1 commit
  4. 27 Jan, 2024 2 commits
  5. 26 Jan, 2024 12 commits
  6. 25 Jan, 2024 6 commits
  7. 24 Jan, 2024 5 commits
    • Steven Liu's avatar
      [docs] Fix doc format (#28684) · f40b87de
      Steven Liu authored
      * fix hfoptions
      
      * revert changes to other files
      
      * fix
      f40b87de
    • Fanli Lin's avatar
      improve efficient training on CPU documentation (#28646) · 8278b153
      Fanli Lin authored
      
      
      * update doc
      
      * revert
      
      * typo fix
      
      * refine
      
      * add dtypes
      
      * Update docs/source/en/perf_train_cpu.md
      Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
      
      * Update docs/source/en/perf_train_cpu.md
      Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
      
      * Update docs/source/en/perf_train_cpu.md
      Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
      
      * no comma
      
      * use avx512-vnni
      
      ---------
      Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
      8278b153
    • nakranivaibhav's avatar
      Improved type hinting for all attention parameters (#28479) · 5d29530e
      nakranivaibhav authored
      * Changed type hinting for all attention inputs to 'Optional[Tuple[torch.FloatTensor,...]] = None'
      
      * Fixed the ruff formatting issue
      
      * fixed type hinting for all hidden_states to 'Optional[Tuple[torch.FloatTensor, ...]] = None'
      
      * Changed type hinting in these 12 scripts modeling_dpr.py,modeling_nat.py,idefics/vision.py,modeling_tf_dpr.py,modeling_luke.py,modeling_swin.py,modeling_tf_swin.py,modeling_blip.py,modeling_tf_blip.py,modeling_donut_swin.py,modeling_dinat.py,modeling_swinv2.py
      
      * test fail update
      
      * fixed type hinting for these 15 scripts modeling_xlnet.py,modeling_tf_xlnet.py,modeling_led.py,modeling_tf_led.py,modleing_rwkv.py,modeling_dpt.py,modeling_tf_cvt.py,modeling_clip.py,modeling_flax_clip.py,modeling_tf_clip.py,modeling_longformer.py,modeling_tf_longformer.py,modeling_siglip.py,modeling_clap.py,modeling_git.py
      
      * Changed type hinting in these 12 scripts modeling_dpr.py,modeling_nat.py,idefics/vision.py,modeling_tf_dpr.py,modeling_luke.py,modeling_swin.py,modeling_tf_swin.py,modeling_blip.py,modeling_tf_blip.py,modeling_donut_swin.py,modeling_dinat.py,modeling_swinv2.py
      
      * test fail update
      
      * Removed the myvenv file
      
      * Fixed type hinting for these 8 scripts modeling_tvlt.py,modeling_sam.py,modeling_tf_sam.py,modeling_tvp.py,modeling_rag.py,modeling_tf_rag.py,modeling_tf_xlm.py,modeling_xlm.py
      5d29530e
    • Steven Liu's avatar
      [docs] DeepSpeed (#28542) · 738ec75c
      Steven Liu authored
      * config
      
      * optim
      
      * pre deploy
      
      * deploy
      
      * save weights, memory, troubleshoot, non-Trainer
      
      * done
      738ec75c
    • amyeroberts's avatar
      Add back in generation types (#28681) · bb6aa8bc
      amyeroberts authored
      bb6aa8bc