@@ -170,4 +170,4 @@ KTransformers allows users to easily switch between different backends through s
...
@@ -170,4 +170,4 @@ KTransformers allows users to easily switch between different backends through s
**Note:** Currently, using AMXInt8 requires reading weights from a BF16 GGUF file and performing online quantization during model loading. This may cause slightly slower load times. Future versions will provide pre-quantized weights to eliminate this overhead.
**Note:** Currently, using AMXInt8 requires reading weights from a BF16 GGUF file and performing online quantization during model loading. This may cause slightly slower load times. Future versions will provide pre-quantized weights to eliminate this overhead.