"git@developer.sourcefind.cn:OpenDAS/nni.git" did not exist on "7748a2d4232ec3a8d1104a1a9d9114a907d3b765"
[Language] Support tile operator `T.cumsum` (#423)
* [Feature] Implement CumSum operation in TileLang * Added CumSumOp class for cumulative sum operations, including argument validation and lowering logic. * Introduced CumSum2D template for CUDA, supporting both forward and reverse cumulative sums. * Created tests for CumSum functionality in shared memory and fragment contexts. * Updated language interface to include cumsum operation, enhancing the reduction capabilities of TileLang. * Refactored reduce.py to support cumsum functionality with appropriate memory allocation and copying mechanisms. * lint fix
Showing
Please register or sign in to comment