Commit d499aa11 authored by Muyang Li's avatar Muyang Li
Browse files

fix the links

parent e3597f7e
...@@ -8,7 +8,7 @@ ...@@ -8,7 +8,7 @@
SVDQuant is a post-training quantization technique for 4-bit weights and activations that well maintains visual fidelity. On 12B FLUX.1-dev, it achieves 3.6× memory reduction compared to the BF16 model. By eliminating CPU offloading, it offers 8.7× speedup over the 16-bit model when on a 16GB laptop 4090 GPU, 3× faster than the NF4 W4A16 baseline. On PixArt-∑, it demonstrates significantly superior visual quality over other W4A4 or even W4A8 baselines. "E2E" means the end-to-end latency including the text encoder and VAE decoder. SVDQuant is a post-training quantization technique for 4-bit weights and activations that well maintains visual fidelity. On 12B FLUX.1-dev, it achieves 3.6× memory reduction compared to the BF16 model. By eliminating CPU offloading, it offers 8.7× speedup over the 16-bit model when on a 16GB laptop 4090 GPU, 3× faster than the NF4 W4A16 baseline. On PixArt-∑, it demonstrates significantly superior visual quality over other W4A4 or even W4A8 baselines. "E2E" means the end-to-end latency including the text encoder and VAE decoder.
**SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models**<br> **SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models**<br>
Muyang Li, Yujun Lin, Zhekai Zhang, Tianle Cai, Xiuyu Li, Junxian Guo, Enze Xie, Chenlin Meng, Jun-Yan Zhu, and Song Han <br> [Muyang Li](https://lmxyy.me), [Yujun Lin](https://yujunlin.com), [Zhekai Zhang](https://hanlab.mit.edu/team/zhekai-zhang), [Tianle Cai](https://www.tianle.website/#/), [Xiuyu Li](https://xiuyuli.com), [Junxian Guo](https://github.com/JerryGJX), [Enze Xie](https://xieenze.github.io), [Chenlin Meng](https://cs.stanford.edu/~chenlin/), [Jun-Yan Zhu](https://www.cs.cmu.edu/~junyanz/), and [Song Han](https://hanlab.mit.edu/songhan) <br>
*MIT, NVIDIA, CMU, Princeton, UC Berkeley, SJTU, and Pika Labs* <br> *MIT, NVIDIA, CMU, Princeton, UC Berkeley, SJTU, and Pika Labs* <br>
![teaser](./assets/demo.gif) ![teaser](./assets/demo.gif)
......
...@@ -18,12 +18,12 @@ ...@@ -18,12 +18,12 @@
<br> <br>
<a href='https://github.com/JerryGJX'>Junxian Guo</a>, <a href='https://github.com/JerryGJX'>Junxian Guo</a>,
<a href='https://xieenze.github.io'>Enze Xie</a>, <a href='https://xieenze.github.io'>Enze Xie</a>,
<a href='https://www.cs.cmu.edu/~srinivas/'>Chenlin Meng</a>, <a href='https://cs.stanford.edu/~chenlin/'>Chenlin Meng</a>,
<a href='https://cs.stanford.edu/~chenlin/'>Jun-Yan Zhu</a>, <a href='https://www.cs.cmu.edu/~junyanz/'>Jun-Yan Zhu</a>,
and <a href='https://hanlab.mit.edu/songhan'>Song Han</a> and <a href='https://hanlab.mit.edu/songhan'>Song Han</a>
</h3> </h3>
<div style="display: flex; justify-content: center; align-items: center; text-align: center;"> <div style="display: flex; justify-content: center; align-items: center; text-align: center;">
[Paper] <a href="https://arxiv.org/abs/2411.05007">[Paper]</a>
&nbsp; &nbsp;
<a href='https://github.com/mit-han-lab/nunchaku'> <a href='https://github.com/mit-han-lab/nunchaku'>
[Code] [Code]
......
...@@ -18,12 +18,12 @@ ...@@ -18,12 +18,12 @@
<br> <br>
<a href='https://github.com/JerryGJX'>Junxian Guo</a>, <a href='https://github.com/JerryGJX'>Junxian Guo</a>,
<a href='https://xieenze.github.io'>Enze Xie</a>, <a href='https://xieenze.github.io'>Enze Xie</a>,
<a href='https://www.cs.cmu.edu/~srinivas/'>Chenlin Meng</a>, <a href='https://cs.stanford.edu/~chenlin/'>Chenlin Meng</a>,
<a href='https://cs.stanford.edu/~chenlin/'>Jun-Yan Zhu</a>, <a href='https://www.cs.cmu.edu/~junyanz/'>Jun-Yan Zhu</a>,
and <a href='https://hanlab.mit.edu/songhan'>Song Han</a> and <a href='https://hanlab.mit.edu/songhan'>Song Han</a>
</h3> </h3>
<div style="display: flex; justify-content: center; align-items: center; text-align: center;"> <div style="display: flex; justify-content: center; align-items: center; text-align: center;">
[Paper] <a href="https://arxiv.org/abs/2411.05007">[Paper]</a>
&nbsp; &nbsp;
<a href='https://github.com/mit-han-lab/nunchaku'> <a href='https://github.com/mit-han-lab/nunchaku'>
[Code] [Code]
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment