Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
Lmdeploy
Commits
6939e47c
Commit
6939e47c
authored
Nov 16, 2023
by
xiabo
Browse files
Update README.md
parent
6df4a6ac
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
26 additions
and
0 deletions
+26
-0
README.md
README.md
+26
-0
No files found.
README.md
View file @
6939e47c
...
...
@@ -86,6 +86,32 @@ python3 -m lmdeploy.turbomind.chat ./workspace_intern
python3 -m lmdeploy.serve.gradio.app ./workspace_intern 10.6.10.67
打开网页输入10.6.10.67:6006
```
### 部署 [baichuan](https://huggingface.co/baichuan-inc) 服务
请从
[
这里
](
https://huggingface.co/baichuan-inc
)
下载 baichuan 模型,参考如下命令部署服务:
以7B为例:
```
1、模型转换
python3 -m lmdeploy.serve.turbomind.deploy baichuan2-7b-chat baichuan2-7b-chat hf baichuan2-7b-chat/tokenizer.model ./workspace_baichuan
2、运行
- 在命令行界面运行:
python3 -m lmdeploy.turbomind.chat ./workspace_baichuan
- 在服务器界面运行:
python3 -m lmdeploy.serve.gradio.app ./workspace_baichuan 10.6.10.67
打开网页输入10.6.10.67:6006
```
### 部署 [qwen](https://huggingface.co/Qwen) 服务
请从
[
这里
](
https://huggingface.co/Qwen
)
下载 qwen 模型,参考如下命令部署服务:
以7B为例:
```
1、模型转换
python3 -m lmdeploy.serve.turbomind.deploy qwen-7b qwen-7b-chat qwen qwen-7b-chat/tokenizer.model ./workspace_qwen
2、运行
- 在命令行界面运行:
python3 -m lmdeploy.turbomind.chat ./workspace_qwen
- 在服务器界面运行:
python3 -m lmdeploy.serve.gradio.app ./workspace_qwen 10.6.10.67
打开网页输入10.6.10.67:6006
```
### 详细可参考 [docs](./docs/zh_cn/serving.md)
## 版本号查询
-
python -c "import lmdeploy; lmdeploy.
\_\_
version__",版本号与官方版本同步,查询该软件的版本号,例如0.0.6;
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment