Add torchao quant (int4/int8/fp8) to llama models (#1341)
Co-authored-by:
Lianmin Zheng <lianminzheng@gmail.com>
Showing
test/srt/test_torchao.py
0 → 100644
Please register or sign in to comment
Co-authored-by:
Lianmin Zheng <lianminzheng@gmail.com>