"examples/flux-control/train_control_flux.py" did not exist on "4fbd310fd2bf89ade978da4c02da41ca14bd1194"
Optimization for gridwise group norm (#453)
* use another instance to check the efficiency
* optimize group layer norm
* 1. coalesce load/store data for gridwise layer norm welford. 2. move a sqrt and divison into a outer static loop
* add more instances to layernorm
* add 2 more test cases
* remove ignore in generating tuple of vector
Co-authored-by:
Chao Liu <chao.liu2@amd.com>
Showing
Please register or sign in to comment