1. 23 Mar, 2023 14 commits
    • Steven Liu's avatar
      [docs] Add Colab notebooks and Spaces (#2713) · 1870fb05
      Steven Liu authored
      * add colab notebook and spaces
      
      * fix image link
      1870fb05
    • YiYi Xu's avatar
      Flax controlnet (#2727) · df91c447
      YiYi Xu authored
      
      
      * add contronet flax
      
      ---------
      Co-authored-by: default avataryiyixuxu <yixu310@gmail,com>
      df91c447
    • Pedro Cuenca's avatar
      Skip `mps` in text-to-video tests (#2792) · aa0531fa
      Pedro Cuenca authored
      * Skip mps in text-to-video tests.
      
      * style
      
      * Skip UNet3D mps tests.
      aa0531fa
    • Haofan Wang's avatar
      Update train_text_to_image_lora.py (#2767) · dc5b4e23
      Haofan Wang authored
      * Update train_text_to_image_lora.py
      
      * Update train_text_to_image_lora.py
      
      * Update train_text_to_image_lora.py
      
      * Update train_text_to_image_lora.py
      
      * format
      dc5b4e23
    • Sayak Paul's avatar
      [Docs] small fixes to the text to video doc. (#2787) · 0d7aac3e
      Sayak Paul authored
      * small fixes to the text to video doc.
      
      * add: Spaces link.
      
      * add: warning on research-only model.
      0d7aac3e
    • Nipun Jindal's avatar
      [2737]: Add DPMSolverMultistepScheduler to CLIP guided community pipeline (#2779) · 055c90f5
      Nipun Jindal authored
      
      
      [2737]: Add DPMSolverMultistepScheduler to CLIP guided community pipelines
      Co-authored-by: default avatarnjindal <njindal@adobe.com>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      055c90f5
    • Kashif Rasul's avatar
      Music Spectrogram diffusion pipeline (#1044) · 2ef9bdd7
      Kashif Rasul authored
      
      
      * initial TokenEncoder and ContinuousEncoder
      
      * initial modules
      
      * added ContinuousContextTransformer
      
      * fix copy paste error
      
      * use numpy for get_sequence_length
      
      * initial terminal relative positional encodings
      
      * fix weights keys
      
      * fix assert
      
      * cross attend style: concat encodings
      
      * make style
      
      * concat once
      
      * fix formatting
      
      * Initial SpectrogramPipeline
      
      * fix input_tokens
      
      * make style
      
      * added mel output
      
      * ignore weights for config
      
      * move mel to numpy
      
      * import pipeline
      
      * fix class names and import
      
      * moved models to models folder
      
      * import ContinuousContextTransformer and SpectrogramDiffusionPipeline
      
      * initial spec diffusion converstion script
      
      * renamed config to t5config
      
      * added weight loading
      
      * use arguments instead of t5config
      
      * broadcast noise time to batch dim
      
      * fix call
      
      * added scale_to_features
      
      * fix weights
      
      * transpose laynorm weight
      
      * scale is a vector
      
      * scale the query outputs
      
      * added comment
      
      * undo scaling
      
      * undo depth_scaling
      
      * inital get_extended_attention_mask
      
      * attention_mask is none in self-attention
      
      * cleanup
      
      * manually invert attention
      
      * nn.linear need bias=False
      
      * added T5LayerFFCond
      
      * remove to fix conflict
      
      * make style and dummy
      
      * remove unsed variables
      
      * remove predict_epsilon
      
      * Move accelerate to a soft-dependency (#1134)
      
      * finish
      
      * finish
      
      * Update src/diffusers/modeling_utils.py
      
      * Update src/diffusers/pipeline_utils.py
      Co-authored-by: default avatarAnton Lozhkov <anton@huggingface.co>
      
      * more fixes
      
      * fix
      Co-authored-by: default avatarAnton Lozhkov <anton@huggingface.co>
      
      * fix order
      
      * added initial midi to note token data pipeline
      
      * added int to int tokenizer
      
      * remove duplicate
      
      * added logic for segments
      
      * add melgan to pipeline
      
      * move autoregressive gen into pipeline
      
      * added note_representation_processor_chain
      
      * fix dtypes
      
      * remove immutabledict req
      
      * initial doc
      
      * use np.where
      
      * require note_seq
      
      * fix typo
      
      * update dependency
      
      * added note-seq to test
      
      * added is_note_seq_available
      
      * fix import
      
      * added toc
      
      * added example usage
      
      * undo for now
      
      * moved docs
      
      * fix merge
      
      * fix imports
      
      * predict first segment
      
      * avoid un-needed copy to and from cpu
      
      * make style
      
      * Copyright
      
      * fix style
      
      * add test and fix inference steps
      
      * remove bogus files
      
      * reorder models
      
      * up
      
      * remove transformers dependency
      
      * make work with diffusers cross attention
      
      * clean more
      
      * remove @
      
      * improve further
      
      * up
      
      * uP
      
      * Apply suggestions from code review
      
      * Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py
      
      * loop over all tokens
      
      * make style
      
      * Added a section on the model
      
      * fix formatting
      
      * grammer
      
      * formatting
      
      * make fix-copies
      
      * Update src/diffusers/pipelines/__init__.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Update src/diffusers/pipelines/spectrogram_diffusion/pipeline_spectrogram_diffusion.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * added callback ad optional ionnx
      
      * do not squeeze batch dim
      
      * clean up more
      
      * upload
      
      * convert jax to nnumpy
      
      * make style
      
      * fix warning
      
      * make fix-copies
      
      * fix warning
      
      * add initial fast tests
      
      * add initial pipeline_params
      
      * eval mode due to dropout
      
      * skip batch tests as pipeline runs on a single file
      
      * make style
      
      * fix relative path
      
      * fix doc tests
      
      * Update src/diffusers/models/t5_film_transformer.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Update src/diffusers/models/t5_film_transformer.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Update docs/source/en/api/pipelines/spectrogram_diffusion.mdx
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * add MidiProcessor
      
      * format
      
      * fix org
      
      * Apply suggestions from code review
      
      * Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py
      
      * make style
      
      * pin protobuf to <4
      
      * fix formatting
      
      * white space
      
      * tensorboard needs protobuf
      
      ---------
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarAnton Lozhkov <anton@huggingface.co>
      2ef9bdd7
    • Naoki Ainoya's avatar
      Rename 'CLIPFeatureExtractor' class to 'CLIPImageProcessor' (#2732) · 14e3a28c
      Naoki Ainoya authored
      The 'CLIPFeatureExtractor' class name has been renamed to 'CLIPImageProcessor' in order to comply with future deprecation. This commit includes the necessary changes to the affected files.
      14e3a28c
    • Mishig's avatar
      [doc wip] literalinclude (#2718) · 8e35ef01
      Mishig authored
      8e35ef01
    • Patrick von Platen's avatar
      [UNet3DModel] Fix with attn processor (#2790) · a8315ce1
      Patrick von Platen authored
      * [UNet3DModel] Fix attn processor
      
      * make style
      a8315ce1
    • Sayak Paul's avatar
      0d633a42
    • Sayak Paul's avatar
      [Examples] InstructPix2Pix instruct training script (#2478) · 9dc84448
      Sayak Paul authored
      
      
      * add: initial implementation of the pix2pix instruct training script.
      
      * shorten cli arg.
      
      * fix: main process check.
      
      * fix: dataset column names.
      
      * simplify tokenization.
      
      * proper placement of null conditions.
      
      * apply styling.
      
      * remove debugging message for conditioning do.
      
      * complete license.
      
      * add: requirements.tzt
      
      * wandb column name order.
      
      * fix: augmentation.
      
      * change: dataset_id.
      
      * fix: convert_to_np() call.
      
      * fix: reshaping.
      
      * fix: final ema copy.
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * address PR comments.
      
      * add: readme details.
      
      * config fix.
      
      * downgrade version.
      
      * reduce image width in the readme.
      
      * note on hyperparameters during generation.
      
      * add: output images.
      
      * update readme.
      
      * minor edits to readme.
      
      * debugging statement.
      
      * explicitly placement of the pipeline.
      
      * bump minimum diffusers version.
      
      * fix: device attribute error.
      
      * weight dtype.
      
      * debugging.
      
      * add dtype inform.
      
      * add seoarate te and vae.
      
      * add: explicit casting/
      
      * remove casting.
      
      * up.
      
      * up 2.
      
      * up 3.
      
      * autocast.
      
      * disable mixed-precision in the final inference.
      
      * debugging information.
      
      * autocasting.
      
      * add: instructpix2pix training section to the docs.
      
      * Empty-Commit
      
      ---------
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      9dc84448
    • Sayak Paul's avatar
      add: section on multiple controlnets. (#2762) · c681ad1a
      Sayak Paul authored
      
      
      * add: section on multiple controlnets.
      Co-authored-by: default avatarWilliam Berman <WLBberman@gmail.com>
      
      * fix: docs.
      
      * fix: docs.
      
      ---------
      Co-authored-by: default avatarWilliam Berman <WLBberman@gmail.com>
      c681ad1a
    • Haofan Wang's avatar
      Support for Offset Noise in examples (#2753) · e0d8c9ef
      Haofan Wang authored
      * add noise offset
      
      * make style
      e0d8c9ef
  2. 22 Mar, 2023 3 commits
    • Pedro Cuenca's avatar
      `mps`: remove warmup passes (#2771) · 92e1164e
      Pedro Cuenca authored
      * Remove warmup passes in mps tests.
      
      * Update mps docs: no warmup pass in PyTorch 2
      
      * Update imports.
      92e1164e
    • Patrick von Platen's avatar
      [MS Text To Video] Add first text to video (#2738) · ca1a2229
      Patrick von Platen authored
      
      
      * [MS Text To Video} Add first text to video
      
      * upload
      
      * make first model example
      
      * match unet3d params
      
      * make sure weights are correcctly converted
      
      * improve
      
      * forward pass works, but diff result
      
      * make forward work
      
      * fix more
      
      * finish
      
      * refactor video output class.
      
      * feat: add support for a video export utility.
      
      * fix: opencv availability check.
      
      * run make fix-copies.
      
      * add: docs for the model components.
      
      * add: standalone pipeline doc.
      
      * edit docstring of the pipeline.
      
      * add: right path to TransformerTempModel
      
      * add: first set of tests.
      
      * complete fast tests for text to video.
      
      * fix bug
      
      * up
      
      * three fast tests failing.
      
      * add: note on slow tests
      
      * make work with all schedulers
      
      * apply styling.
      
      * add slow tests
      
      * change file name
      
      * update
      
      * more correction
      
      * more fixes
      
      * finish
      
      * up
      
      * Apply suggestions from code review
      
      * up
      
      * finish
      
      * make copies
      
      * fix pipeline tests
      
      * fix more tests
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPedro Cuenca <pedro@huggingface.co>
      
      * apply suggestions
      
      * up
      
      * revert
      
      ---------
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      Co-authored-by: default avatarPedro Cuenca <pedro@huggingface.co>
      ca1a2229
    • Steven Liu's avatar
      [docs] Clarify purpose of reproducibility docs (#2756) · 7fe88613
      Steven Liu authored
      * clarify purpose of repro docs
      
      * apply feedback
      7fe88613
  3. 21 Mar, 2023 10 commits
  4. 20 Mar, 2023 6 commits
  5. 18 Mar, 2023 3 commits
  6. 17 Mar, 2023 3 commits
  7. 16 Mar, 2023 1 commit