"platforms/common/src/kernels/pme.cc" did not exist on "3b2579a5fd04d21e5f4aefd23922bbc2a19612c3"
-
Shangyan Zhou authored
* Fix hidden_size % 128 != 0 * Add `align_down()` function * Use the full warp to wait TMA store * Support arbitrary hidden sizes in fp8 cast * lint
abba6add