Commit de4884a1 authored by hepj987's avatar hepj987
Browse files

规范格式

parent 66940183
......@@ -53,11 +53,14 @@ SwiGLU 激活函数 [PaLM]。使用 SwiGLU 激活函数替换 ReLU 非线性以
```
docker pull image.sourcefind.cn:5000/dcu/admin/base/vscode-pytorch:1.10.0-centos7.6-dtk-22.10-py37-latest
docker run -dit --network=host --name=llama_inference_pytorch --privileged --device=/dev/kfd --device=/dev/dri --ipc=host --shm-size=16G --group-add video --cap-add=SYS_PTRACE --security-opt seccomp=unconfined -u root --ulimit stack=-1:-1 --ulimit memlock=-1:-1 image.sourcefind.cn:5000/dcu/admin/base/pytorch:1.10.0-centos7.6-dtk-23.04-py37-latest
docker exec -it llama_inference_pytorch /bin/bash
```
安装docker中没有的依赖
```
pip install -r requirements.txt -i https://mirrors.aliyun.com/pypi/simple/ --trusted-host mirrors.aliyun.com
pip install tensor_parallel==1.2.5 --no-dependencies
pip install transformers==4.28.1 sentencepiece==0.1.99
```
......
#模型名称
modelName=LLAMA_pytorch
#模型描述
modelDescription=基于Pytorch框架的tencentpretrain格式llama模型推理
#应用场景
apoScenar10=推理,nlp,文本生成,智能聊天助手
#框架类型
frameType=Pytorch,Transformers,Tensor_parallel
\ No newline at end of file
# 模型唯一标识
modelCode=405
# 模型名称
modelName=LLaMA_inference_pytorch
# 模型描述
modelDescription=基于tencentpretrain框架的llama模型推理
# 应用场景
appScenario=推理,inference,nlp,智能聊天助手
# 框架类型
frameType=Pytorch,Transformers,Deepspeed
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment