• Lei Wang's avatar
    [Bugfix] Convey `compile_flags` to ffi compilation path with pass_configs (#1434) · fba12a5f
    Lei Wang authored
    * [Enhancement] Add device compile flags support in pass configuration
    
    * Introduced `kDeviceCompileFlags` option in the pass configuration to allow additional device compiler flags for CUDA compilation.
    * Updated the `tilelang_callback_cuda_compile` function to merge extra flags from the pass configuration, enhancing flexibility in compiler options.
    * Modified the `JITKernel` class to handle device compile flags appropriately, ensuring they are included during compilation.
    * Documented the new pass configuration key for clarity on usage and expected input formats.
    
    * lint fix
    
    * [Refactor] Simplify compile_flags handling in JIT functions
    
    * Removed redundant string check for compile_flags in the compile, jit, and lazy_jit functions, ensuring compile_flags is consistently treated as a list.
    * Updated the JITKernel class to handle compile_flags as a list when a string is provided, enhancing code clarity and maintainability.
    
    * lint fix
    
    * fix
    fba12a5f
pass_config.py 6.33 KB