Merge pull request #1167 from kvcache-ai/update-llama4-tutorial-patch-1

update llama4 tutorial

Merge pull request #1167 from kvcache-ai/update-llama4-tutorial-patch-1
update llama4 tutorial
22a30d70 · Jianwei Dong · GitHub · 8770b6d5 · dfaf2b20 · 22a30d70
Unverified Commit 22a30d70 authored Apr 18, 2025 by Jianwei Dong Committed by GitHub Apr 18, 2025
Hide whitespace changes
Inline Side-by-side

Showing with 5 additions and 1 deletion

doc/en/llama4.md doc/en/llama4.md +5 -1

No files found.
--- a/doc/en/llama4.md
+++ b/doc/en/llama4.md
@@ -74,7 +74,11 @@ USE_BALANCE_SERVE=1 USE_NUMA=1 bash ./install.sh
 ### 4. Use our custom config.json
-Currently, it's needed to use our custom config.json(https://github.com/kvcache-ai/ktransformers/blob/support-llama4/doc/en/config.json) to replace your config.json in your `--model_path`.
+Currently, you need to copy the content of our custom config file into the `config.json` under your `--model_path`.  
+- Use [scout_config.json](https://github.com/kvcache-ai/ktransformers/blob/support-llama4/doc/en/scout_config.json) for the Llama-4-Scout-17B-16E model  
+- Use [maverick_config.json](https://github.com/kvcache-ai/ktransformers/blob/support-llama4/doc/en/maverick_config.json) for the Llama-4-Maverick-17B-128E model  
+Please make sure to replace the content of `config.json` with the appropriate one accordingly.
 ### 5. Run LLaMA 4 Inference Server