Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
Lmdeploy
Commits
76ae8627
"tools/run_text_generation_server.py" did not exist on "64a83fb5882ff2a3d0e05bee5bed78281895c13b"
Unverified
Commit
76ae8627
authored
Jul 06, 2023
by
pppppM
Committed by
GitHub
Jul 06, 2023
Browse files
update zh readme (#74)
parent
74a4f3c9
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
3 deletions
+3
-3
README_zh-CN.md
README_zh-CN.md
+3
-3
No files found.
README_zh-CN.md
View file @
76ae8627
...
...
@@ -38,7 +38,7 @@ LMDeploy 由 [MMDeploy](https://github.com/open-mmlab/mmdeploy) 和 [MMRazor](ht
-
**persistent batch 推理**
:进一步优化模型执行效率。
!
[
PersistentBatchInference
](
https://github.com/
open-mmlab
/lmdeploy/assets/
25839884/8f8b57b8-42af-4b71-ad74-e75f39b10694
)
!
[
PersistentBatchInference
](
https://github.com/
InternLM
/lmdeploy/assets/
67539920/e3876167-0671-44fc-ac52-5a0f9382493e
)
## 性能
...
...
@@ -52,7 +52,7 @@ LMDeploy 由 [MMDeploy](https://github.com/open-mmlab/mmdeploy) 和 [MMRazor](ht
TurboMind 的吞吐量超过 2000 token/s, 整体比 DeepSpeed 提升约 5% - 15%,比 huggingface transformers 提升 2.3 倍


## 快速上手
...
...
@@ -117,7 +117,7 @@ python3 lmdeploy.serve.client {server_ip_addresss}:33337 internlm
python3 lmdeploy.app {server_ip_addresss}:33337 internlm
```


其他模型的部署方式,比如 LLaMA,vicuna,请参考
[
这里
](
docs/zh_cn/serving.md
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment