-
Lei Wang authored
* [Refactor] Enhance TMA barrier validation and support for additional architectures (#463) * Updated the TMA barrier validation in `inject_tma_barrier.cc` to check for non-empty `barrier_id_to_range_` before raising an error for missing `create_list_of_mbarrier`. * Refactored architecture checks in `phase.py` to utilize a new constant `SUPPORTED_TMA_ARCHS`, allowing for easier updates and improved readability in the target architecture validation logic. * [Feature] Implement fast integer power operation and related API * Added a new math operation `tl.power_of_int` in `math.cc` for efficient integer exponentiation. * Introduced a corresponding Python API `pow_of_int` in `tir/op.py` to facilitate usage in TileLang. * Enhanced `common.h` with a template function for integer power calculations. * Updated documentation to reflect the new functionality and usage examples.
1f5eb492