llama_405B.md 705 Bytes
Newer Older
1
2
3
# Run Llama 3.1 405B

## Run 405B (fp8) on a Single Node
4
5
6
7
8

```bash
python -m sglang.launch_server --model-path meta-llama/Meta-Llama-3.1-405B-Instruct-FP8 --tp 8
```

9
10
## Run 405B (fp16) on Two Nodes

11
```bash
12
13
# on the first node, replace 172.16.4.52:20000 with your own node ip address and port

14
python3 -m sglang.launch_server --model-path meta-llama/Meta-Llama-3.1-405B-Instruct --tp 16 --dist-init-addr-addr 172.16.4.52:20000 --nnodes 2 --node-rank 0
15

16
# on the second node, replace 172.18.45.52:20000 with your own node ip address and port
17

18
python3 -m sglang.launch_server --model-path meta-llama/Meta-Llama-3.1-405B-Instruct --tp 16 --dist-init-addr-addr 172.18.45.52:20000 --nnodes 2 --node-rank 1
19
```