[Example] add w4a8 gemm kernel (#815)
* [Bugfix] fix autotune bug * [Example] add w4a8 gemm kernel * fix lint: pinned the version of `ml_dtypes` The version of ml_dtypes should be pinned in the dependency specification. If the version of ml_dtypes is too low, it may result in errors such as fp4 not being defined. * Renames example for dequantization GEMM * format * add w4a8 example to ci * fix lint
Showing
| ... | @@ -4,6 +4,6 @@ numpy>=1.23.5 | ... | @@ -4,6 +4,6 @@ numpy>=1.23.5 |
| tqdm>=4.62.3 | tqdm>=4.62.3 | ||
| typing_extensions>=4.10.0 | typing_extensions>=4.10.0 | ||
| cloudpickle | cloudpickle | ||
| ml_dtypes | ml_dtypes>=0.5.3 | ||
| psutil | psutil | ||
| torch | torch |
Please register or sign in to comment