• Yuxuan Zhang's avatar
    Modify the implementation of retrieve_timesteps in CogView4-Control. (#11125) · 1d37f420
    Yuxuan Zhang authored
    
    
    * 1
    
    * change to channel 1
    
    * cogview4 control training
    
    * add CacheMixin
    
    * 1
    
    * remove initial_input_channels change for val
    
    * 1
    
    * update
    
    * use 3.5
    
    * new loss
    
    * 1
    
    * use imagetoken
    
    * for megatron convert
    
    * 1
    
    * train con and uc
    
    * 2
    
    * remove guidance_scale
    
    * Update pipeline_cogview4_control.py
    
    * fix
    
    * use cogview4 pipeline with timestep
    
    * update shift_factor
    
    * remove the uncond
    
    * add max length
    
    * change convert and use GLMModel instead of GLMForCasualLM
    
    * fix
    
    * [cogview4] Add attention mask support to transformer model
    
    * [fix] Add attention mask for padded token
    
    * update
    
    * remove padding type
    
    * Update train_control_cogview4.py
    
    * resolve conflicts with #10981
    
    * add control convert
    
    * use control format
    
    * fix
    
    * add missing import
    
    * update with cogview4 formate
    
    * make style
    
    * Update pipeline_cogview4_control.py
    
    * Update pipeline_cogview4_control.py
    
    * remove
    
    * Update pipeline_cogview4_control.py
    
    * put back
    
    * Apply style fixes
    
    ---------
    Co-authored-by: default avatarOleehyO <leehy0357@gmail.com>
    Co-authored-by: default avataryiyixuxu <yixu310@gmail.com>
    Co-authored-by: default avatargithub-actions[bot] <github-actions[bot]@users.noreply.github.com>
    1d37f420
pipeline_cogview4_control.py 34.5 KB