[Enhancement] Add buffer load copy functions and improve copy logic in tilelang (#946)
- Introduced new functions for buffer load copy with stride and parallel execution. - Enhanced the copy logic in `copy.py` to simplify nested if statements for BufferLoad nodes. - Added corresponding test cases for the new buffer load functionalities.
Showing
Please register or sign in to comment