- 19 Dec, 2025 2 commits
-
-
silentCoder-dev authored
* remove triton dependence in testing & move triton baseline into example * use ceildiv and handles arbitrary M correctly for triton
-
silentCoder-dev authored
* rename test for curand & add triton baseline * add a comment for calling T.rng_rand() four times * refactor tilelang&triton kernel * Add boundary checks for M not divisible by 128
-