"vscode:/vscode.git/clone" did not exist on "a1d3866dda6539a0e9e2cfc49e9cb1e887baaaec"
- 24 Apr, 2025 1 commit
-
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
- 23 Apr, 2025 1 commit
-
-
Michael Yao authored
Signed-off-by:windsonsea <haifeng.yao@daocloud.io>
-
- 22 Apr, 2025 1 commit
-
-
Lei Wang authored
Signed-off-by:
xinyuxiao <xinyuxiao2024@gmail.com> Co-authored-by:
xinyuxiao <xinyuxiao2024@gmail.com>
-
- 11 Apr, 2025 1 commit
-
-
Michael Goin authored
-
- 07 Apr, 2025 2 commits
-
-
Driss Guessous authored
Signed-off-by:drisspg <drisspguessous@gmail.com>
-
yihong authored
Signed-off-by:yihong0618 <zouzou0208@gmail.com>
-
- 05 Apr, 2025 1 commit
-
-
Tristan Leclercq authored
Signed-off-by:Tristan Leclercq <tristanleclercq@gmail.com>
-
- 01 Apr, 2025 1 commit
-
-
chaow-amd authored
Signed-off-by:chaow <chaow@amd.com>
-
- 24 Mar, 2025 1 commit
-
-
Jee Jee Li authored
-
- 21 Mar, 2025 1 commit
-
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
- 03 Mar, 2025 1 commit
-
-
Qubitium-ModelCloud authored
Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
- 28 Feb, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 27 Feb, 2025 1 commit
-
-
Szymon Ożóg authored
-
- 18 Feb, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 05 Feb, 2025 1 commit
-
-
Michael Goin authored
-
- 31 Jan, 2025 1 commit
-
-
Brian Dellabetta authored
Based on a request by @mgoin , with @kylesayrs we have added an example doc for int4 w4a16 quantization, following the pre-existing int8 w8a8 quantization example and the example available in [`llm-compressor`](https://github.com/vllm-project/llm-compressor/blob/main/examples/quantization_w4a16/llama3_example.py ) FIX #n/a (no issue created) @kylesayrs and I have discussed a couple additional improvements for the quantization docs. We will revisit at a later date, possibly including: - A section for "choosing the correct quantization scheme/ compression technique" - Additional vision or audio calibration datasets --------- Signed-off-by:
Brian Dellabetta <bdellabe@redhat.com> Co-authored-by:
Michael Goin <michael@neuralmagic.com>
-
- 29 Jan, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 23 Jan, 2025 2 commits
-
-
Gregory Shtrasberg authored
Signed-off-by:
Gregory Shtrasberg <Gregory.Shtrasberg@amd.com> Co-authored-by:
Micah Williamson <micah.williamson@amd.com>
-
Michael Goin authored
Signed-off-by:
mgoin <michael@neuralmagic.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
- 15 Jan, 2025 1 commit
-
-
Kyle Sayers authored
Signed-off-by:Kyle Sayers <kylesayrs@gmail.com>
-
- 14 Jan, 2025 1 commit
-
-
TJian authored
Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Co-authored-by:
tjtanaa <tunjian.tan@embeddedllm.com>
-
- 12 Jan, 2025 1 commit
-
-
Rafael Vasquez authored
Signed-off-by:Rafael Vasquez <rafvasq21@gmail.com>
-
- 08 Jan, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 06 Jan, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-