Files

mlmz 977f785dad Docs: Rewrite docs for LLama 405B and ModelSpace (#2773 )

Co-authored-by: Chayenne <zhaochen20@outlook.com>

2025-01-08 00:02:59 -08:00

722 B

Raw Blame History

Example: Run Llama 3.1 405B

# Run 405B (fp8) on a single node
python -m sglang.launch_server --model-path meta-llama/Meta-Llama-3.1-405B-Instruct-FP8 --tp 8

# Run 405B (fp16) on two nodes
## on the first node, replace the `172.16.4.52:20000` with your own first node ip address and port
python3 -m sglang.launch_server --model-path meta-llama/Meta-Llama-3.1-405B-Instruct --tp 16 --nccl-init-addr 172.16.4.52:20000 --nnodes 2 --node-rank 0

## on the first node, replace the `172.16.4.52:20000` with your own first node ip address and port
python3 -m sglang.launch_server --model-path meta-llama/Meta-Llama-3.1-405B-Instruct --tp 16 --nccl-init-addr 172.16.4.52:20000 --nnodes 2 --node-rank 1

722 B Raw Blame History

Example: Run Llama 3.1 405B

722 B

Raw Blame History