Commit 1cb90516 authored by chenych's avatar chenych
Browse files

update url

parent 282b9f04
......@@ -47,8 +47,8 @@ transformers: 4.56.1
```
`Tips:以上dtk驱动、pytorch等DCU相关工具版本需要严格一一对应`,其他包安装如下:
```bash
wget http://10.16.4.1:8000/customized/vllm/dtk25.04.1/v0.9.2-dsv32/v0.9.2-dsv32-cf360956/vllm-0.9.2%2Bdas.opt1.rc2.dtk25041-cp310-cp310-manylinux_2_28_x86_64.whl
pip install vllm-0.9.2+das.opt1.rc2.dtk25041-cp310-cp310-manylinux_2_28_x86_64.whl
wget http://112.11.119.99:18000/temp/vllm-0.9.2%2Bdas.opt1.rc2.51af08a.dtk25041-cp310-cp310-linux_x86_64.whl
pip install vllm-0.9.2+das.opt1.rc2.51af08a.dtk25041-cp310-cp310-linux_x86_64.whl
```
## 数据集
......@@ -139,7 +139,7 @@ ray start --address='x.x.x.x:6379' --num-gpus=8 --num-cpus=32
> intel cpu 需要加参数:`--enforce-eager`
```bash
vllm serve deepseek-ai/DeepSeek-V3.2-Exp \
vllm serve /path/to/DeepSeek-V3.2-Exp-bf16 \
--trust-remote-code \
--distributed-executor-backend ray \
--dtype bfloat16 \
......@@ -155,7 +155,7 @@ vllm serve deepseek-ai/DeepSeek-V3.2-Exp \
curl http://127.0.0.1:8001/v1/chat/completions \
-H "Content-Type: application/json" \
-d '{
"model": "deepseek-ai/DeepSeek-V3.2-Exp",
"model": /path/to/DeepSeek-V3.2-Exp-bf16",
"messages": [
{
"role": "user",
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment