Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
ModelZoo
Qwen3-Reranker_pytorch
Commits
b3e14f53
Commit
b3e14f53
authored
Jun 11, 2025
by
chenych
Browse files
Fix bugs in README.
parent
580d5af7
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
8 additions
and
7 deletions
+8
-7
README.md
README.md
+8
-7
No files found.
README.md
View file @
b3e14f53
...
...
@@ -22,28 +22,29 @@ Qwen3嵌入模型系列是Qwen3家族最新的专有模型,专门为文本嵌
docker pull image.sourcefind.cn:5000/dcu/admin/base/custom:vllm0.8.5-ubuntu22.04-dtk25.04-rc7-das1.5-py3.10-20250521-fixpy-rocblas0521-beta2
docker run
-it
--shm-size
200g
--network
=
host
--name
{
docker_name
}
--privileged
--device
=
/dev/kfd
--device
=
/dev/dri
--device
=
/dev/mkfd
--group-add
video
--cap-add
=
SYS_PTRACE
--security-opt
seccomp
=
unconfined
-u
root
-v
/path/your_code_data/:/path/your_code_data/
-v
/opt/hyhal/:/opt/hyhal/:ro
{
imageID
}
bash
cd
/your_code_path/qwen3-
embedding
_pytorch
cd
/your_code_path/qwen3-
reranker
_pytorch
pip
install
transformers>
=
4.51.0
```
### Dockerfile(方法二)
```
bash
cd
docker
docker build
--no-cache
-t
qwen3-
embedding
:latest
.
docker build
--no-cache
-t
qwen3-
reranker
:latest
.
docker run
-it
--shm-size
200g
--network
=
host
--name
{
docker_name
}
--privileged
--device
=
/dev/kfd
--device
=
/dev/dri
--device
=
/dev/mkfd
--group-add
video
--cap-add
=
SYS_PTRACE
--security-opt
seccomp
=
unconfined
-u
root
-v
/path/your_code_data/:/path/your_code_data/
-v
/opt/hyhal/:/opt/hyhal/:ro
{
imageID
}
bash
cd
/your_code_path/qwen3-
embedding
_pytorch
cd
/your_code_path/qwen3-
reranker
_pytorch
pip
install
transformers>
=
4.51.0
```
### Anaconda(方法三)
关于本项目DCU显卡所需的特殊深度学习库可从
[
光合
](
https://developer.
hpccube.com
/tool/
)
开发者社区下载安装。
关于本项目DCU显卡所需的特殊深度学习库可从
[
光合
](
https://developer.
sourcefind.cn
/tool/
)
开发者社区下载安装。
```
bash
DTK: 25.04
python: 3.10
vllm: 0.8.5
torch: 2.4.1+das.opt2.dtk2504
deepspeed: 0.14.2+das.opt2.dtk2504
vllm: 0.8.5
```
`Tips:以上dtk驱动、python、torch等DCU相关工具版本需要严格一一对应`
...
...
@@ -73,7 +74,7 @@ python infer_vllm.py --model_name_or_path /path/your_model_path/
</div>
### 精度
暂无
DCU与GPU精度一致,推理框架:pytorch。
## 应用场景
### 算法类别
...
...
@@ -91,4 +92,4 @@ python infer_vllm.py --model_name_or_path /path/your_model_path/
-
https://developer.sourcefind.cn/codes/modelzoo/qwen3-reranker_pytorch
## 参考资料
-
https://github.com/QwenLM/
Q
wen3-
Embedding
-
https://github.com/QwenLM/
q
wen3-
reranker
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment