update readme

4f8680ee · zhuwenwen · 460e2662 · 4f8680ee · 4f8680ee
Commit 4f8680ee authored Aug 30, 2024 by zhuwenwen
Hide whitespace changes
Inline Side-by-side

Showing with 5 additions and 7 deletions

README.md README.md +5 -7

icon.png icon.png +0 -0

No files found.
--- a/README.md
+++ b/README.md
@@ -2,7 +2,7 @@
 * @Author: zhuww
 * @email: zhuww@sugon.com
 * @Date: 2024-06-13 14:38:07
- * @LastEditTime: 2024-06-20 08:35:01
+ * @LastEditTime: 2024-08-30 09:35:01
 -->
 ## 论文
@@ -22,7 +22,6 @@ ChatGLM-6B 是清华大学开源的开源的、支持中英双语的对话语言
 | 模型名称    | 隐含层维度 | 层数 | 头数 | 词表大小 | 位置编码 | 最大序列长度 |
 | ----------- | ---------- | ---- | ---- | -------- | -------- | ------------ |
-| ChatGLM-6B | 4096       | 28   | 32   | 130528    | RoPE     | 2048         |
 | ChatGLM2-6B | 4096       | 28   | 32   | 65024    | RoPE     | 8192         |
 | ChatGLM3-6B | 4096       | 28   | 32   | 65024    | RoPE     | 8192         |
 | glm-4-9b | 4096       | 40   | 32   | 151552    | RoPE     | 131072         |
@@ -57,7 +56,7 @@ pip install aiohttp==3.9.1 outlines==0.0.37 openai==1.23.3 -i http://mirrors.ali
 # <Host Path>主机端路径
 # <Container Path>容器映射路径
 docker build -t chatglm:latest .
-docker run -it --name chatglm_vllm --privileged --shm-size=64G  --device=/dev/kfd --device=/dev/dri/ --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --ulimit memlock=-1:-1 --ipc=host --network host --group-add video -v /opt/hyhal:/opt/hyhal -v <Host Path>:<Container Path> llama:latest /bin/bash
+docker run -it --name chatglm_vllm --privileged --shm-size=64G  --device=/dev/kfd --device=/dev/dri/ --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --ulimit memlock=-1:-1 --ipc=host --network host --group-add video -v /opt/hyhal:/opt/hyhal:ro -v <Host Path>:<Container Path> llama:latest /bin/bash
 ```
 `Tips：若在K100/Z100L上使用，需要替换flash_attn，下载链接：https://forum.hpccube.com/thread/515`
@@ -67,7 +66,7 @@ conda create -n chatglm_vllm python=3.10
 pip install aiohttp==3.9.1 outlines==0.0.37 openai==1.23.3
 ```
 关于本项目DCU显卡所需的特殊深度学习库可从[光合](https://developer.hpccube.com/tool/)开发者社区下载安装。
-* DTK驱动：dtk24.04
+* DTK驱动：dtk24.04.1
 * Pytorch: 2.1.0
 * triton:2.1.0
 * vllm: 0.3.3
@@ -97,9 +96,8 @@ cd dist && pip install vllm*
 | chat模型                                                                        | 长文本模型                                                                                | 
 | ------------------------------------------------------------------------------- | --------------------------------------------------------------------------------------------- | 
-| [chatglm-6b](https://huggingface.co/THUDM/chatglm-6b) |  
+| [chatglm2-6b](http://113.200.138.88:18080/aimodels/chatglm2-6b) | [chatglm2-6b-32k](https://huggingface.co/THUDM/chatglm2-6b-32k) | 
-| [chatglm2-6b](https://huggingface.co/THUDM/chatglm2-6b) | [chatglm2-6b-32k](https://huggingface.co/THUDM/chatglm2-6b-32k) | 
+| [chatglm3-6b](http://113.200.138.88:18080/aimodels/chatglm3-6b)  | [chatglm3-6b-32k](http://113.200.138.88:18080/aimodels/chatglm3-6b-32k) | 
-| [chatglm3-6b](https://huggingface.co/THUDM/chatglm3-6b)  | [chatglm3-6b-32k](https://huggingface.co/THUDM/chatglm3-6b-32k) | 
 | [glm-4-9b-chat](https://huggingface.co/THUDM/glm-4-9b-chat) | 

--- a/icon.png
+++ b/icon.png