Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
chenzhuo
XVERSE-MoE-A4.2B
Commits
1a04978b
Commit
1a04978b
authored
Apr 12, 2024
by
chenzhuo
Browse files
Upload New File
parent
256a0c5f
Pipeline
#852
canceled with stages
Changes
1
Pipelines
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
13 additions
and
0 deletions
+13
-0
sugon_readme.md
sugon_readme.md
+13
-0
No files found.
sugon_readme.md
0 → 100644
View file @
1a04978b
1.
***容器启动**
*
<br>
docker run -itd --name dock --privileged --network=host -v /soft:/soft -v /data:/test_data --device=/dev/kfd --device=/dev/dri --shm-size=16G --group-add video --ipc=host --cap-add=SYS_PTRACE --security-opt seccomp=unconfined -u root --ulimit stack=-1:-1 --ulimit memlock=-1:-1 deepcoder:20240319 /usr/sbin/init
<br>
docker exec -it dock /bin/bash
<br>
cd /data/XVERSE-MoE-A4.2B
<br>
pip install -r requirements.txt
<br>
ln -sf 权重目录 weight
CUDA_VISIBLE_DEVICES=0,1 python text_generation_demo.py --port 30137 --model_path weight --tokenizer_path weight
<br>
打开浏览器输入浏览器在地址栏输入机器IP:30137 就可以使用了
<br>
2.
***vllm暂时不支持**
*

从代码上看要对vllm进行针对性修改才能完成vllm适配

模块修改幅度看着很惊人不止改了编码连attention的结构也修改了
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment