Add more GPU architctures support (#76)
* Add more GPU architctures support
* Merge fmha and mla runner
* add varlen & non varlen support, and add incontiguous tensor support
* update readme
* add varlen api
---------
Co-authored-by:
dianzhangc <dianzhangc@nvidia.com>
Showing
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
csrc/sm100/pybind.cu
0 → 100644
File moved
File moved
Please register or sign in to comment