• glide-the's avatar
    fix: CogVideox train dataset _preprocess_data crop video (#9574) · 66eef9a6
    glide-the authored
    
    
    * Removed int8 to float32 conversion (`* 2.0 - 1.0`) from `train_transforms` as it caused image overexposure.
    
    Added `_resize_for_rectangle_crop` function to enable video cropping functionality. The cropping mode can be configured via `video_reshape_mode`, supporting options: ['center', 'random', 'none'].
    
    * The number 127.5 may experience precision loss during division operations.
    
    * wandb request pil image Type
    
    * Resizing bug
    
    * del jupyter
    
    * make style
    
    * Update examples/cogvideo/README.md
    
    * make style
    
    ---------
    
    Co-authored-by: --unset <--unset>
    Co-authored-by: default avatarAryan <aryan@huggingface.co>
    66eef9a6
README.md 16.1 KB