"platforms/opencl/vscode:/vscode.git/clone" did not exist on "787869c355c7fae243794f619beff3b730315429"
-
Shangyan Zhou authored
* Fix hidden_size % 128 != 0 * Add `align_down()` function * Use the full warp to wait TMA store * Support arbitrary hidden sizes in fp8 cast * lint
abba6add