Unverified Commit 346d2022 authored by Atream's avatar Atream Committed by GitHub
Browse files

Update llama4.md

parent a46c43b2
...@@ -66,7 +66,11 @@ USE_BALANCE_SERVE=1 bash ./install.sh ...@@ -66,7 +66,11 @@ USE_BALANCE_SERVE=1 bash ./install.sh
USE_BALANCE_SERVE=1 USE_NUMA=1 bash ./install.sh USE_BALANCE_SERVE=1 USE_NUMA=1 bash ./install.sh
``` ```
### 4. Run LLaMA 4 Inference Server ### 4. Use our custom config.json
Currently, it's needed to use our custom config.json(https://github.com/kvcache-ai/ktransformers/blob/support-llama4/doc/en/config.json) to replace your config.json in your `--model_path`.
### 5. Run LLaMA 4 Inference Server
Make sure you have: Make sure you have:
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment