• Sayak Paul's avatar
    [Experimental] Diffusion LoRA DPO training (#6422) · 2a97067b
    Sayak Paul authored
    * add: experimental script for diffusion dpo training.
    
    * random_crop cli.
    
    * fix: caption tokenization.
    
    * fix: pixel_values index.
    
    * fix: grad?
    
    * debug
    
    * fix: reduction.
    
    * fixes in the loss calculation.
    
    * style
    
    * fix: unwrap call.
    
    * fix: validation inference.
    
    * add: initial sdxl script
    
    * debug
    
    * make sure images in the tuple are of same res
    
    * fix model_max_length
    
    * report print
    
    * boom
    
    * fix: numerical issues.
    
    * fix: resolution
    
    * comment about resize.
    
    * change the order of the training transformation.
    
    * save call.
    
    * debug
    
    * remove print
    
    * manually detaching necessary?
    
    * use the same vae for validation.
    
    * add: readme.
    2a97067b
train_diffusion_dpo.py 37.6 KB