"vscode:/vscode.git/clone" did not exist on "d754574c0405dffd94a28ba95beff6b6ca1c440f"
[Enhancement] Enhance CUDA compilation by integrating pass context configuration (#1283)
- Updated the `tilelang_callback_cuda_compile` function to accept a `pass_config` parameter, allowing for more flexible compilation options. - Introduced handling for fast math and PTXAS options based on the provided pass configuration. - Modified the CUDA build process in `rt_mod_cuda.cc` to utilize the current pass context, improving the integration of compilation settings. - Refactored NVCC command construction to use a dedicated function for better clarity and maintainability.
Showing
Please register or sign in to comment