• Lei Wang's avatar
    [Feature] Implement fast integer power operation and related API (#466) · 1f5eb492
    Lei Wang authored
    * [Refactor] Enhance TMA barrier validation and support for additional architectures (#463)
    
    * Updated the TMA barrier validation in `inject_tma_barrier.cc` to check for non-empty `barrier_id_to_range_` before raising an error for missing `create_list_of_mbarrier`.
    * Refactored architecture checks in `phase.py` to utilize a new constant `SUPPORTED_TMA_ARCHS`, allowing for easier updates and improved readability in the target architecture validation logic.
    
    * [Feature] Implement fast integer power operation and related API
    
    * Added a new math operation `tl.power_of_int` in `math.cc` for efficient integer exponentiation.
    * Introduced a corresponding Python API `pow_of_int` in `tir/op.py` to facilitate usage in TileLang.
    * Enhanced `common.h` with a template function for integer power calculations.
    * Updated documentation to reflect the new functionality and usage examples.
    1f5eb492
common.h 7.1 KB