1. 15 Mar, 2023 2 commits
    • Kashif Rasul's avatar
      T5Attention support for cross-attention (#2654) · cf4227cd
      Kashif Rasul authored
      
      
      * fix AttnProcessor2_0
      
      Fix use of AttnProcessor2_0 for cross attention with mask
      
      * added scale_qk and out_bias flags
      
      * fixed for xformers
      
      * check if it has scale argument
      
      * Update cross_attention.py
      
      * check torch version
      
      * fix sliced attn
      
      * style
      
      * set scale
      
      * fix test
      
      * fixed addedKV processor
      
      * revert back AttnProcessor2_0
      
      * if missing if
      
      * fix inner_dim
      
      ---------
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      cf4227cd
    • Henrik Forstén's avatar
      Controlnet training (#2545) · 79eb3d07
      Henrik Forstén authored
      * Controlnet training code initial commit
      
      Works with circle dataset: https://github.com/lllyasviel/ControlNet/blob/main/docs/train.md
      
      
      
      * Script for adding a controlnet to existing model
      
      * Fix control image transform
      
      Control image should be in 0..1 range.
      
      * Add license header and remove more unused configs
      
      * controlnet training readme
      
      * Allow nonlocal model in add_controlnet.py
      
      * Formatting
      
      * Remove unused code
      
      * Code quality
      
      * Initialize controlnet in training script
      
      * Formatting
      
      * Address review comments
      
      * doc style
      
      * explicit constructor args and submodule names
      
      * hub dataset
      
      NOTE -  not tested
      
      * empty prompts
      
      * add conditioning image
      
      * rename
      
      * remove instance data dir
      
      * image_transforms -> -1,1 . conditioning_image_transformers -> 0, 1
      
      * nits
      
      * remove local rank config
      
      I think this isn't necessary in any of our training scripts
      
      * validation images
      
      * proportion_empty_prompts typo
      
      * weight copying to controlnet bug
      
      * call log validation fix
      
      * fix
      
      * gitignore wandb
      
      * fix progress bar and resume from checkpoint iteration
      
      * initial step fix
      
      * log multiple images
      
      * fix
      
      * fixes
      
      * tracker project name configurable
      
      * misc
      
      * add controlnet requirements.txt
      
      * update docs
      
      * image labels
      
      * small fixes
      
      * log validation using existing models for pipeline
      
      * fix for deepspeed saving
      
      * memory usage docs
      
      * Update examples/controlnet/train_controlnet.py
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      
      * Update examples/controlnet/train_controlnet.py
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      
      * Update examples/controlnet/README.md
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      
      * Update examples/controlnet/README.md
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      
      * Update examples/controlnet/README.md
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      
      * Update examples/controlnet/README.md
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      
      * Update examples/controlnet/README.md
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      
      * Update examples/controlnet/README.md
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      
      * Update examples/controlnet/README.md
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      
      * Update examples/controlnet/README.md
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      
      * remove extra is main process check
      
      * link to dataset in intro paragraph
      
      * remove unnecessary paragraph
      
      * note on deepspeed
      
      * Update examples/controlnet/README.md
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * assert -> value error
      
      * weights and biases note
      
      * move images out of git
      
      * remove .gitignore
      
      ---------
      Co-authored-by: default avatarWilliam Berman <WLBberman@gmail.com>
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      79eb3d07
  2. 14 Mar, 2023 5 commits
  3. 13 Mar, 2023 3 commits
  4. 10 Mar, 2023 3 commits
  5. 09 Mar, 2023 8 commits
  6. 08 Mar, 2023 1 commit
  7. 07 Mar, 2023 3 commits
  8. 06 Mar, 2023 4 commits
  9. 05 Mar, 2023 1 commit
  10. 04 Mar, 2023 1 commit
  11. 03 Mar, 2023 8 commits
  12. 02 Mar, 2023 1 commit