README.md 1.11 KB
Newer Older
sunzhq2's avatar
init  
sunzhq2 committed
1
2
## MATH-500数据集
- https://www.modelscope.cn/datasets/AI-ModelScope/MATH-500
sunzhq2's avatar
sunzhq2 committed
3

sunzhq2's avatar
init  
sunzhq2 committed
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
## vllm 脚本修改

- serve.py 修改
```
mv /usr/local/lib/python3.10/dist-packages/vllm/benchmarks/serve.py /usr/local/lib/python3.10/dist-packages/vllm/benchmarks/serve.py.bak

cp ./utils/vllm-benchmarks/serve.py /usr/local/lib/python3.10/dist-packages/vllm/benchmarks/
```

- datasets.py修改
```
mv /usr/local/lib/python3.10/dist-packages/vllm/benchmarks/datasets.py /usr/local/lib/python3.10/dist-packages/vllm/benchmarks/datasets.py.bak

cp ./utils/vllm-benchmarks/datasets.py /usr/local/lib/python3.10/dist-packages/vllm/benchmarks/
```

## evalscope 修改
- evaluator.py
```
cp /usr/local/lib/python3.10/dist-packages/evalscope/evaluator/evaluator.py /usr/local/lib/python3.10/dist-packages/evalscope/evaluator/evaluator.py.bak

cp ./utils/evalscope/evaluator.py /usr/local/lib/python3.10/dist-packages/evalscope/evaluator/
```

## 启动vllm服务
- bash vllm_serve.sh

## 性能和精度结果保存

- bash run_benchmarks.sh

## 数据集转换
```
cd tools
bash run_convert.sh
```

## 跑精度
```
cd tools
bash evalscope_test.sh
```