• Sayak Paul's avatar
    [Pipelines] Adds pix2pix zero (#2334) · fd3d5502
    Sayak Paul authored
    * add: support for BLIP generation.
    
    * add: support for editing synthetic images.
    
    * remove unnecessary comments.
    
    * add inits and run make fix-copies.
    
    * version change of diffusers.
    
    * fix: condition for loading the captioner.
    
    * default conditions_input_image to False.
    
    * guidance_amount -> cross_attention_guidance_amount
    
    * fix inputs to check_inputs()
    
    * fix: attribute.
    
    * fix: prepare_attention_mask() call.
    
    * debugging.
    
    * better placement of references.
    
    * remove torch.no_grad() decorations.
    
    * put torch.no_grad() context before the first denoising loop.
    
    * detach() latents before decoding them.
    
    * put deocding in a torch.no_grad() context.
    
    * add reconstructed image for debugging.
    
    * no_grad(0
    
    * apply formatting.
    
    * address one-off suggestions from the draft PR.
    
    * back to torch.no_grad() and add more elaborate comments.
    
    * refactor prepare_unet() per Patrick's suggestions.
    
    * more elaborate description for .
    
    * formatting.
    
    * add docstrings to the methods specific to pix2pix zero.
    
    * suspecting a redundant noise prediction.
    
    * needed for gradient computation chain.
    
    * less hacks.
    
    * fix: attention mask handling within the processor.
    
    * remove attention reference map computation.
    
    * fix: cross attn args.
    
    * fix: prcoessor.
    
    * store attention maps.
    
    * fix: attention processor.
    
    * update docs and better treatment to xa args.
    
    * update the final noise computation call.
    
    * change xa args call.
    
    * remove xa args option from the pipeline.
    
    * add: docs.
    
    * first test.
    
    * fix: url call.
    
    * fix: argument call.
    
    * remove image conditioning for now.
    
    * 🚨 add: fast tests.
    
    * explicit placement of the xa attn weights.
    
    * add: slow tests 🐢
    
    * fix: tests.
    
    * edited direction embedding should be on the same device as prompt_embeds.
    
    * debugging message.
    
    * debugging.
    
    * add pix2pix zero pipeline for a non-deterministic test.
    
    * debugging/
    
    * remove debugging message.
    
    * make caption generation _
    
    * address comments (part I).
    
    * address PR comments (part II)
    
    * fix: DDPM test assertion.
    
    * refactor doc.
    
    * address PR comments (part III).
    
    * fix: type annotation for the scheduler.
    
    * apply styling.
    
    * skip_mps and add note on embeddings in the docs.
    fd3d5502
__init__.py 4.32 KB