• Lei Wang's avatar
    [Language] Support accumulative `T.reduce_sum` (#436) · 6c737768
    Lei Wang authored
    * [Enhancement] Update reduce operations to support clear option in sum and abs sum (#436)
    
    * Modified reduce_sum and reduce_absmax functions to include a clear parameter, allowing for accumulation on existing values.
    * Updated ReduceOp::Lower method to handle initialization and buffer duplication based on the clear flag for sum and abs sum operations.
    * Added new tests for reduce_sum and reduce_max with clear functionality to ensure correctness in various scenarios.
    * Enhanced documentation for reduce functions to clarify the behavior of the clear parameter.
    
    * lint fix
    
    * Update tensor type annotations in test_tilelang_transform_annotate_device_regions.py from Buffer to Tensor
    
    * Update tensor type in reduce sum tests from float16 to float32 for improved precision
    6c737768
test_tilelang_language_reduce_max.py 2.34 KB