Implement AWQ quantization support for LLaMA (#1032)
Co-authored-by:Robert Irvine <robert@seamlessml.com> Co-authored-by:
root <rirv938@gmail.com> Co-authored-by:
Casper <casperbh.96@gmail.com> Co-authored-by:
julian-q <julianhquevedo@gmail.com>
Showing
csrc/quantization.cpp
0 → 100644
This diff is collapsed.
Please register or sign in to comment