• Lei Wang's avatar
    [Enhancement] Enhance CUDA compilation by integrating pass context configuration (#1283) · 551ac60d
    Lei Wang authored
    - Updated the `tilelang_callback_cuda_compile` function to accept a `pass_config` parameter, allowing for more flexible compilation options.
    - Introduced handling for fast math and PTXAS options based on the provided pass configuration.
    - Modified the CUDA build process in `rt_mod_cuda.cc` to utilize the current pass context, improving the integration of compilation settings.
    - Refactored NVCC command construction to use a dedicated function for better clarity and maintainability.
    551ac60d
rt_mod_cuda.cc 3.81 KB