Update AMX.md

073ce601 · Atream · GitHub · 2bcdf10f · 073ce601
Unverified Commit 073ce601 authored Apr 29, 2025 by Atream Committed by GitHub Apr 29, 2025
Hide whitespace changes
Inline Side-by-side

Showing with 2 additions and 2 deletions

doc/en/AMX.md doc/en/AMX.md +2 -2

No files found.
--- a/doc/en/AMX.md
+++ b/doc/en/AMX.md
@@ -5,7 +5,7 @@ What excites me most about Qwen3MoE is that, unlike the 671 B “giant” model,

 Server CPU (Xeon 4) + RTX 4090

-Consumer-grade CPU (Core i9-14900KF + dual-channel DDR4-4000 MT/s) + RTX 4090
+Consumer-grade CPU (Core i9-14900KF + dual-channel DDR5-4000 MT/s) + RTX 4090

 The results are as follows:

@@ -170,4 +170,4 @@ KTransformers allows users to easily switch between different backends through s

 **Note:** Currently, using AMXInt8 requires reading weights from a BF16 GGUF file and performing online quantization during model loading. This may cause slightly slower load times. Future versions will provide pre-quantized weights to eliminate this overhead.

-![Image](https://github.com/user-attachments/assets/7c33c410-3af9-456f-aa67-5b24e19ba680)
\ No newline at end of file
+![Image](https://github.com/user-attachments/assets/7c33c410-3af9-456f-aa67-5b24e19ba680)