Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
a5115f4f
Unverified
Commit
a5115f4f
authored
Jun 11, 2025
by
Cyrus Leung
Committed by
GitHub
Jun 11, 2025
Browse files
[Doc] Fix quantization link titles (#19478)
Signed-off-by:
DarkLight1337
<
tlleungac@connect.ust.hk
>
parent
68b4a261
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
14 additions
and
14 deletions
+14
-14
docs/features/quantization/README.md
docs/features/quantization/README.md
+13
-13
docs/features/quantization/quark.md
docs/features/quantization/quark.md
+1
-1
No files found.
docs/features/quantization/README.md
View file @
a5115f4f
...
@@ -7,16 +7,16 @@ Quantization trades off model precision for smaller memory footprint, allowing l
...
@@ -7,16 +7,16 @@ Quantization trades off model precision for smaller memory footprint, allowing l
Contents:
Contents:
-
[
Supported
_
Hardware
](
supported_hardware.md
)
-
[
Supported
Hardware
](
supported_hardware.md
)
-
[
Auto
_Awq
](
auto_awq.md
)
-
[
Auto
AWQ
](
auto_awq.md
)
-
[
B
nb
](
bnb.md
)
-
[
B
itsAndBytes
](
bnb.md
)
-
[
Bit
blas
](
bitblas.md
)
-
[
Bit
BLAS
](
bitblas.md
)
-
[
G
guf
](
gguf.md
)
-
[
G
GUF
](
gguf.md
)
-
[
G
ptqm
odel
](
gptqmodel.md
)
-
[
G
PTQM
odel
](
gptqmodel.md
)
-
[
I
nt4
](
int4.md
)
-
[
I
NT4 W4A16
](
int4.md
)
-
[
I
nt
8
](
int8.md
)
-
[
I
NT8 W8A
8
](
int8.md
)
-
[
F
p
8
](
fp8.md
)
-
[
F
P8 W8A
8
](
fp8.md
)
-
[
Modelopt
](
modelopt.md
)
-
[
NVIDIA TensorRT Model Optimizer
](
modelopt.md
)
-
[
Quark
](
quark.md
)
-
[
AMD
Quark
](
quark.md
)
-
[
Quantized
_Kvc
ache
](
quantized_kvcache.md
)
-
[
Quantized
KV C
ache
](
quantized_kvcache.md
)
-
[
Torch
ao
](
torchao.md
)
-
[
Torch
AO
](
torchao.md
)
docs/features/quantization/quark.md
View file @
a5115f4f
---
---
title
:
AMD Q
UARK
title
:
AMD Q
uark
---
---
[](
){
#quark }
[](
){
#quark }
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment