index.rst 1.19 KB
Newer Older
1
2
3
Nunchaku Documentation
======================
**Nunchaku** is a high-performance inference engine optimized for low-bit diffusion models and LLMs,
Muyang Li's avatar
Muyang Li committed
4
5
as introduced in our paper `SVDQuant <svdquant_paper_>`_.
Check out `DeepCompressor <deepcompressor_repo_>`_ for the quantization library.
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38

.. toctree::
   :maxdepth: 2
   :caption: Installation

   installation/installation.rst
   installation/setup_windows.rst

.. toctree::
    :maxdepth: 1
    :caption: Usage Tutorials

    usage/basic_usage.rst
    usage/lora.rst
    usage/kontext.rst
    usage/controlnet.rst
    usage/qencoder.rst
    usage/offload.rst
    usage/attention.rst
    usage/fbcache.rst
    usage/pulid.rst

.. toctree::
    :maxdepth: 1
    :caption: Python API Reference

    python_api/nunchaku.rst

.. toctree::
    :maxdepth: 1
    :caption: Useful Tools
    :titlesonly:

Muyang Li's avatar
Muyang Li committed
39
40
    ComfyUI Plugin: ComfyUI-nunchaku <https://github.com/mit-han-lab/ComfyUI-nunchaku>
    Custom Model Quantization: DeepCompressor <https://github.com/mit-han-lab/deepcompressor>
41
42
43
44
45
46
47
48
49
    Gradio Demos <https://github.com/mit-han-lab/nunchaku/tree/main/app>


.. toctree::
    :maxdepth: 1
    :caption: Other Resources

    faq/faq.rst
    developer/contribution_guide.rst