• Chenggang Zhao's avatar
    Support Ampere architecture (#204) · b8d90fb7
    Chenggang Zhao authored
    * Update README
    
    * Update `setup.py`
    
    * Fix headers
    
    * Add `DISABLE_NVSHMEM` for APIs
    
    * Fix launch
    
    * Fix TMA settings
    
    * Fix TMA usages
    
    * Fix dlink
    
    * Separate layout kernels
    
    * Update version
    
    * Add `is_sm90_compiled`
    
    * Fix tests
    
    * Add NVLink connection checks
    
    * Update README
    
    * Fix tests
    
    * Add some comments
    
    * Minor fix
    
    * Minor fix
    
    * Fix bugs
    b8d90fb7
utils.py 3.32 KB