"torch_cluster/src/serial_cuda.c" did not exist on "b9a7f32674a28069a8d08a8bd544193b834977be"
[Feature] Blazing fast W4A16 inference (#202)
* add w4a16 * fix `deploy.py` * add doc * add w4a16 kernels * fuse w1/w3 & bugfixes * fix typo * python * guard sm75/80 features * add missing header * refactor * qkvo bias * update cost model * fix lint * update `deploy.py`
Showing
Please register or sign in to comment