• Cunxiao Ni's avatar
    [Example] add w4a8 gemm kernel (#815) · 4bcb1593
    Cunxiao Ni authored
    * [Bugfix] fix autotune bug
    
    * [Example] add w4a8 gemm kernel
    
    * fix lint: pinned the version of `ml_dtypes`
    The version of ml_dtypes should be pinned in the dependency specification. If the version of ml_dtypes is too low, it may result in errors such as fp4 not being defined.
    
    * Renames example for dequantization GEMM
    
    * format
    
    * add w4a8 example to ci
    
    * fix lint
    4bcb1593
example_dequant_gemm_w4a8.py 7.4 KB