update the prerequisites

6c1aea93 · Muyang Li · 703fff75 · 6c1aea93
Commit 6c1aea93 authored Nov 08, 2024 by Muyang Li
Hide whitespace changes
Inline Side-by-side

Showing with 4 additions and 0 deletions

README.md README.md +4 -0

No files found.
--- a/README.md
+++ b/README.md
@@ -29,6 +29,10 @@ SVDQuant is a post-training quantization technique for 4-bit weights and activat
 ![efficiency](./assets/efficiency.jpg)SVDQuant reduces the model size of the 12B FLUX.1 by 3.6×. Additionally, Nunchaku, further cuts memory usage of the 16-bit model by 3.5× and delivers 3.0× speedups over the NF4 W4A16 baseline on both the desktop and laptop NVIDIA RTX 4090 GPUs. Remarkably, on laptop 4090, it achieves in total 10.1× speedup by eliminating CPU offloading.
 ## Installation
+**Note**: We currently support only NVIDIA GPUs with architectures sm_86 (Ampere: RTX 3090, A6000), sm_89 (Ada: RTX 4090), and sm_80 (A100). See [this issue](https://github.com/mit-han-lab/nunchaku/issues/1) for more details.
 1. Install dependencies:
 	```shell
 	conda create -n nunchaku python=3.11