Unverified Commit 9d7cd629 authored by AllentDan's avatar AllentDan Committed by GitHub
Browse files

add demo gif (#63)

* add demo gif

* add demo gif
parent ec53d63f
...@@ -147,6 +147,8 @@ python3 lmdeploy/serve/client.py {server_ip_addresss}:33337 ...@@ -147,6 +147,8 @@ python3 lmdeploy/serve/client.py {server_ip_addresss}:33337
python3 lmdeploy/app.py {server_ip_addresss}:33337 {model_name} python3 lmdeploy/app.py {server_ip_addresss}:33337 {model_name}
``` ```
![](https://github.com/open-mmlab/lmdeploy/assets/41138331/f4352172-d8b1-49aa-b658-50ce72b896a5)
## User Guide ## User Guide
## Quantization ## Quantization
......
...@@ -146,6 +146,8 @@ python3 lmdeploy/serve/client.py {server_ip_addresss}:33337 ...@@ -146,6 +146,8 @@ python3 lmdeploy/serve/client.py {server_ip_addresss}:33337
python3 lmdeploy/app.py {server_ip_addresss}:33337 {model_name} python3 lmdeploy/app.py {server_ip_addresss}:33337 {model_name}
``` ```
![](https://github.com/open-mmlab/lmdeploy/assets/41138331/f4352172-d8b1-49aa-b658-50ce72b896a5)
## 量化部署 ## 量化部署
在 fp16 模式下,可以开启 kv_cache int8 量化,单卡可服务更多用户。 在 fp16 模式下,可以开启 kv_cache int8 量化,单卡可服务更多用户。
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment