"device_operation/include/device_batched_gemm_xdl.hpp" did not exist on "4be7f0198e55f386d51cdb127dc0fa69427d6fe0"
  • Lei Wang's avatar
    [Enhancement] Add thread count validation for ReduceOp fragment layout inference (#1225) · 67cc8611
    Lei Wang authored
    * [Enhancement] Add thread count validation for ReduceOp fragment layout inference
    
    * Introduced a check to ensure that the thread count is divisible by the replicate extent during layout inference in ReduceOpNode. This validation prevents layout inference failures and provides detailed error messages to guide users in resolving issues related to thread block sizes and fragment layouts.
    * Updated tests to remove unsupported configurations that could lead to layout inference errors, ensuring more robust testing scenarios.
    
    * lint fix
    67cc8611
reduce.cc 20.1 KB