"vscode:/vscode.git/clone" did not exist on "2205cc9e1ab15eee51ae5c6c54c258de8afefe26"
[Ck tile] layernorm2d fwd optimize (#1637)
* optimze small N case using vec io and using rcp div
* [Ck_tile] layernorm, add param to control fastdiv; change generate codes and test pass
* [Ck_tile] fix blockSize compute in Generic2dBlockShape
* [Ck_tile]fix kfastfdiv template style
* [Ck_tile] layernorm, fix stype in review
---------
Co-authored-by:
dummycoderfe <noplydummmycoder@163.com>
Showing
Please register or sign in to comment