[Enhancement] Add nvrtc execution backend (#461)
* [wip] feat: add nvrtc backend
* [wip] fix: handle out_idx
* [wip] refactor: move lib logic to libgen
* feat: cache for nvrtc backend
* fmt: run format
* fix: handle cuda bindings import error
* fix: handle cuda bindings import error
* fix: handle cuda bindings import error
* fix: handle cuda bindings import error
* fix: get kernel source
* refactor: speedup pyimport
* Improve error handling for missing cuda-python dependency in nvrtc backend. Raise ImportError with detailed installation instructions instead of logging a warning.
* Enhance nvrtc backend error handling by introducing a flag to check for cuda-python availability. Raise ImportError with detailed installation instructions during initialization if the nvrtc backend is unavailable, improving user experience and clarity.
* Update README.md to include recent NVRTC Backend addition, highlighting reduced compilation time for CUDA templates.
* fix tl_templates
* ensure CUDA context
---------
Co-authored-by:
LeiWang1999 <leiwang1999@outlook.com>
Showing
tilelang/contrib/nvrtc.py
0 → 100644
Please register or sign in to comment