Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
change
sglang
Commits
c8a9e791
Unverified
Commit
c8a9e791
authored
Aug 28, 2024
by
Dr. Artificial曾小健
Committed by
GitHub
Aug 27, 2024
Browse files
Fix readme (#1236)
parent
3602692c
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
4 additions
and
0 deletions
+4
-0
README.md
README.md
+4
-0
No files found.
README.md
View file @
c8a9e791
...
@@ -83,6 +83,7 @@ docker run --gpus all \
...
@@ -83,6 +83,7 @@ docker run --gpus all \
### Method 4: Using docker compose
### Method 4: Using docker compose
<details>
<details>
<summary>
More
</summary>
> This method is recommended if you plan to serve it as a service.
> This method is recommended if you plan to serve it as a service.
> A better approach is to use the [k8s-sglang-service.yaml](./docker/k8s-sglang-service.yaml).
> A better approach is to use the [k8s-sglang-service.yaml](./docker/k8s-sglang-service.yaml).
...
@@ -94,6 +95,7 @@ docker run --gpus all \
...
@@ -94,6 +95,7 @@ docker run --gpus all \
### Method 5: Run on Kubernetes or Clouds with SkyPilot
### Method 5: Run on Kubernetes or Clouds with SkyPilot
<details>
<details>
<summary>
More
</summary>
To deploy on Kubernetes or 12+ clouds, you can use
[
SkyPilot
](
https://github.com/skypilot-org/skypilot
)
.
To deploy on Kubernetes or 12+ clouds, you can use
[
SkyPilot
](
https://github.com/skypilot-org/skypilot
)
.
...
@@ -262,6 +264,7 @@ Instructions for supporting a new model are [here](https://github.com/sgl-projec
...
@@ -262,6 +264,7 @@ Instructions for supporting a new model are [here](https://github.com/sgl-projec
#### Use Models From ModelScope
#### Use Models From ModelScope
<details>
<details>
<summary>
More
</summary>
To use a model from
[
ModelScope
](
https://www.modelscope.cn
)
, set the environment variable SGLANG_USE_MODELSCOPE.
To use a model from
[
ModelScope
](
https://www.modelscope.cn
)
, set the environment variable SGLANG_USE_MODELSCOPE.
```
```
...
@@ -276,6 +279,7 @@ SGLANG_USE_MODELSCOPE=true python -m sglang.launch_server --model-path qwen/Qwen
...
@@ -276,6 +279,7 @@ SGLANG_USE_MODELSCOPE=true python -m sglang.launch_server --model-path qwen/Qwen
#### Run Llama 3.1 405B
#### Run Llama 3.1 405B
<details>
<details>
<summary>
More
</summary>
```
bash
```
bash
# Run 405B (fp8) on a single node
# Run 405B (fp8) on a single node
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment