Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
ModelZoo
llama_inference_pytorch
Commits
de4884a1
Commit
de4884a1
authored
Sep 18, 2023
by
hepj987
Browse files
规范格式
parent
66940183
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
13 additions
and
8 deletions
+13
-8
README.md
README.md
+3
-0
model.properties
model.properties
+10
-8
No files found.
README.md
View file @
de4884a1
...
@@ -53,11 +53,14 @@ SwiGLU 激活函数 [PaLM]。使用 SwiGLU 激活函数替换 ReLU 非线性以
...
@@ -53,11 +53,14 @@ SwiGLU 激活函数 [PaLM]。使用 SwiGLU 激活函数替换 ReLU 非线性以
```
```
docker pull image.sourcefind.cn:5000/dcu/admin/base/vscode-pytorch:1.10.0-centos7.6-dtk-22.10-py37-latest
docker pull image.sourcefind.cn:5000/dcu/admin/base/vscode-pytorch:1.10.0-centos7.6-dtk-22.10-py37-latest
docker run -dit --network=host --name=llama_inference_pytorch --privileged --device=/dev/kfd --device=/dev/dri --ipc=host --shm-size=16G --group-add video --cap-add=SYS_PTRACE --security-opt seccomp=unconfined -u root --ulimit stack=-1:-1 --ulimit memlock=-1:-1 image.sourcefind.cn:5000/dcu/admin/base/pytorch:1.10.0-centos7.6-dtk-23.04-py37-latest
docker exec -it llama_inference_pytorch /bin/bash
```
```
安装docker中没有的依赖
安装docker中没有的依赖
```
```
pip install -r requirements.txt -i https://mirrors.aliyun.com/pypi/simple/ --trusted-host mirrors.aliyun.com
pip install tensor_parallel==1.2.5 --no-dependencies
pip install tensor_parallel==1.2.5 --no-dependencies
pip install transformers==4.28.1 sentencepiece==0.1.99
pip install transformers==4.28.1 sentencepiece==0.1.99
```
```
...
...
model.properties
View file @
de4884a1
#模型名称
# 模型唯一标识
modelName
=
LLAMA_pytorch
modelCode
=
405
#模型描述
# 模型名称
modelDescription
=
基于Pytorch框架的tencentpretrain格式llama模型推理
modelName
=
LLaMA_inference_pytorch
#应用场景
# 模型描述
apoScenar10
=
推理,nlp,文本生成,智能聊天助手
modelDescription
=
基于tencentpretrain框架的llama模型推理
#框架类型
# 应用场景
frameType
=
Pytorch,Transformers,Tensor_parallel
appScenario
=
推理,inference,nlp,智能聊天助手
\ No newline at end of file
# 框架类型
frameType
=
Pytorch,Transformers,Deepspeed
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment