• Gabriel Wu's avatar
    [Enhancement] Add nvrtc execution backend (#461) · 17f7394f
    Gabriel Wu authored
    
    
    * [wip] feat: add nvrtc backend
    
    * [wip] fix: handle out_idx
    
    * [wip] refactor: move lib logic to libgen
    
    * feat: cache for nvrtc backend
    
    * fmt: run format
    
    * fix: handle cuda bindings import error
    
    * fix: handle cuda bindings import error
    
    * fix: handle cuda bindings import error
    
    * fix: handle cuda bindings import error
    
    * fix: get kernel source
    
    * refactor: speedup pyimport
    
    * Improve error handling for missing cuda-python dependency in nvrtc backend. Raise ImportError with detailed installation instructions instead of logging a warning.
    
    * Enhance nvrtc backend error handling by introducing a flag to check for cuda-python availability. Raise ImportError with detailed installation instructions during initialization if the nvrtc backend is unavailable, improving user experience and clarity.
    
    * Update README.md to include recent NVRTC Backend addition, highlighting reduced compilation time for CUDA templates.
    
    * fix tl_templates
    
    * ensure CUDA context
    
    ---------
    Co-authored-by: default avatarLeiWang1999 <leiwang1999@outlook.com>
    17f7394f
wrapper.py 43.5 KB