"vscode:/vscode.git/clone" did not exist on "53be59dcc072c78730a83f848154357286c63ccd"
  • Lei Wang's avatar
    [Language] Support accumulative `T.reduce_sum` (#436) · 6c737768
    Lei Wang authored
    * [Enhancement] Update reduce operations to support clear option in sum and abs sum (#436)
    
    * Modified reduce_sum and reduce_absmax functions to include a clear parameter, allowing for accumulation on existing values.
    * Updated ReduceOp::Lower method to handle initialization and buffer duplication based on the clear flag for sum and abs sum operations.
    * Added new tests for reduce_sum and reduce_max with clear functionality to ensure correctness in various scenarios.
    * Enhanced documentation for reduce functions to clarify the behavior of the clear parameter.
    
    * lint fix
    
    * Update tensor type annotations in test_tilelang_transform_annotate_device_regions.py from Buffer to Tensor
    
    * Update tensor type in reduce sum tests from float16 to float32 for improved precision
    6c737768
reduce.cc 11.5 KB