examples/research_projects/diffusion_dpo/train_diffusion_dpo.py · 2a97067b8483db0cd832511a1d7df9b870a3e500 · OpenDAS / diffusers

[Experimental] Diffusion LoRA DPO training (#6422) · 2a97067b

Sayak Paul authored Jan 05, 2024

* add: experimental script for diffusion dpo training.

* random_crop cli.

* fix: caption tokenization.

* fix: pixel_values index.

* fix: grad?

* debug

* fix: reduction.

* fixes in the loss calculation.

* style

* fix: unwrap call.

* fix: validation inference.

* add: initial sdxl script

* debug

* make sure images in the tuple are of same res

* fix model_max_length

* report print

* boom

* fix: numerical issues.

* fix: resolution

* comment about resize.

* change the order of the training transformation.

* save call.

* debug

* remove print

* manually detaching necessary?

* use the same vae for validation.

* add: readme.

2a97067b

train_diffusion_dpo.py 37.6 KB

Replace train_diffusion_dpo.py