- 18 Feb, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 05 Feb, 2025 1 commit
-
-
Michael Goin authored
-
- 31 Jan, 2025 1 commit
-
-
Brian Dellabetta authored
Based on a request by @mgoin , with @kylesayrs we have added an example doc for int4 w4a16 quantization, following the pre-existing int8 w8a8 quantization example and the example available in [`llm-compressor`](https://github.com/vllm-project/llm-compressor/blob/main/examples/quantization_w4a16/llama3_example.py ) FIX #n/a (no issue created) @kylesayrs and I have discussed a couple additional improvements for the quantization docs. We will revisit at a later date, possibly including: - A section for "choosing the correct quantization scheme/ compression technique" - Additional vision or audio calibration datasets --------- Signed-off-by:
Brian Dellabetta <bdellabe@redhat.com> Co-authored-by:
Michael Goin <michael@neuralmagic.com>
-
- 29 Jan, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 23 Jan, 2025 2 commits
-
-
Gregory Shtrasberg authored
Signed-off-by:
Gregory Shtrasberg <Gregory.Shtrasberg@amd.com> Co-authored-by:
Micah Williamson <micah.williamson@amd.com>
-
Michael Goin authored
Signed-off-by:
mgoin <michael@neuralmagic.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
- 15 Jan, 2025 1 commit
-
-
Kyle Sayers authored
Signed-off-by:Kyle Sayers <kylesayrs@gmail.com>
-
- 14 Jan, 2025 1 commit
-
-
TJian authored
Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Co-authored-by:
tjtanaa <tunjian.tan@embeddedllm.com>
-
- 12 Jan, 2025 1 commit
-
-
Rafael Vasquez authored
Signed-off-by:Rafael Vasquez <rafvasq21@gmail.com>
-
- 08 Jan, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 06 Jan, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-