README.md 3.97 KB
Newer Older
jerrrrry's avatar
jerrrrry committed
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
# 🚀 一键启动工具集 | EasyStart v0.1

![Banner](https://placehold.co/1200x400/2a4365/white?text=EasyStart-AI-Testing-Toolkit) 

> 一个集成了环境测试、模型下载与推理验证的自动化工具包

## 🌟 核心功能

| 功能模块 | 适用场景 | 特色亮点 |
|---------|----------|----------|
| **1. 环境测试** | 基础硬件验证 | ROCm带宽测试/ACS监控/全栈状态检查 |
| **2. 环境测试+模型下载+推理** | 交付验证 | 自动下载Modelscope模型/批量测试 |
| **3. 环境测试+批量推理** | 压力测试 | 支持本地模型/Docker集成 |

## 🛠️ 快速开始

### 1️⃣ 环境测试

```bash
git clone https://developer.sourcefind.cn/codes/jerrrrry/easystart_v0.1.git
cd easystart_v0.1/
bash start.sh

测试项目包含:

rocm_bandwidth_test

Rccl 4卡&8卡带宽测试

贵哥发版的dcu_env_check

ACS监控

CPU&DCU状态

存储&内存状态

网络状态

测试结果保存在 ./outputs/env\_check\_outputs
![图片描述](images/1.png)




























jerrrrry's avatar
jerrrrry committed
70
71
72
73
74
75
76
77
78
79
80
81
82
# 🚀 一键启动工具集 | EasyStart v0.1

![Banner](https://placehold.co/1200x400/2a4365/white?text=EasyStart-AI-Testing-Toolkit) <!-- 建议替换为实际项目横幅图 -->

## 🌟 核心功能

| 功能模块 | 适用场景 | 特色亮点 |
|---------|----------|----------|
| **1. 环境测试** | 基础硬件验证 | ROCm带宽测试/ACS监控/全栈状态检查 |
| **2. 环境测试+模型下载+推理** | 交付验证 | 自动下载Modelscope模型/批量测试 |
| **3. 环境测试+批量推理** | 压力测试 | 支持本地模型/Docker集成 |

## 🛠️ 快速开始
jerrrrry's avatar
jerrrrry committed
83
### 1️⃣一键启动环境测试
jerrrrry's avatar
jerrrrry committed
84
85
86
87

```bash
git clone https://developer.sourcefind.cn/codes/jerrrrry/easystart_v0.1.git
cd easystart_v0.1/
jerrrrry's avatar
jerrrrry committed
88
bash start.sh
jerrrrry's avatar
jerrrrry committed
89

jerrrrry's avatar
jerrrrry committed
90
91
   测试项包含:

jerrrrry's avatar
jerrrrry committed
92
   1:rocm\_bandwidth\_test
jerrrrry's avatar
jerrrrry committed
93

jerrrrry's avatar
jerrrrry committed
94
   2:Rccl 4卡&8卡带宽测试
jerrrrry's avatar
jerrrrry committed
95

jerrrrry's avatar
jerrrrry committed
96
   3:贵哥发版的dcu\_env\_check
jerrrrry's avatar
jerrrrry committed
97

jerrrrry's avatar
jerrrrry committed
98
   4:ACS监控
jerrrrry's avatar
jerrrrry committed
99

jerrrrry's avatar
jerrrrry committed
100
101
102
103
104
105
106
107
108
   5:CPU&DCU状态

   6:存储&内存状态

   7:网络状态

   <a name="heading_1"></a>**测试结果**

   测试结果保存在 ./outputs/env\_check\_outputs
jerrrrry's avatar
jerrrrry committed
109
   ![图片描述](images/1.png)
jerrrrry's avatar
jerrrrry committed
110

jerrrrry's avatar
jerrrrry committed
111
112
113
114
115





jerrrrry's avatar
jerrrrry committed
116
117
118
119
   <a name="heading_2"></a>**2\_env\_check&model\_download&llm\_inference**

   一键启动环境测试+模型下载+llm推理:

jerrrrry's avatar
jerrrrry committed
120
   https://developer.sourcefind.cn/codes/jerrrrry/easystart_v0.1/-/tree/main/2_env_check%26model_download%26llm_inference
jerrrrry's avatar
jerrrrry committed
121

jerrrrry's avatar
jerrrrry committed
122
   |git clone https://developer.sourcefind.cn/codes/jerrrrry/easystart_v0.1.git<br>cd 2\_env\_check&model\_download&llm\_inference/<br>bash start.sh|
jerrrrry's avatar
jerrrrry committed
123
124
   | :- |

jerrrrry's avatar
jerrrrry committed
125
126
   ![图片描述](images/2.png)

jerrrrry's avatar
jerrrrry committed
127
128
129
130
   只需将要测试的模型ID**(对应modelscope的模型ID)**传入**download-list.cfg**

   <a name="heading_3"></a>**Tips**

jerrrrry's avatar
jerrrrry committed
131
   1:download-list.cfg里的格式为:**模型ID;本地保存路径**
jerrrrry's avatar
jerrrrry committed
132

jerrrrry's avatar
jerrrrry committed
133
134
   ![图片描述](images/3.png)

jerrrrry's avatar
jerrrrry committed
135
   2:可以写入多个模型下载且**会进行批量测试**
jerrrrry's avatar
jerrrrry committed
136

jerrrrry's avatar
jerrrrry committed
137
   3:模型测试参数通过**model\_to\_test.cfg**传入,**需注意model\_to\_test.cfg的传参格式**
jerrrrry's avatar
jerrrrry committed
138

jerrrrry's avatar
jerrrrry committed
139
140
   ![图片描述](images/4.png)

jerrrrry's avatar
jerrrrry committed
141
142
143
144
145
146
   <a name="heading_4"></a>**测试结果**

   测试结果保存在 ./outputs/env\_check\_outputs和 ./outputs/inference\_outputs

   下载的模型会保存在 ./outputs/models

jerrrrry's avatar
jerrrrry committed
147
148
   ![图片描述](images/5.png)

jerrrrry's avatar
jerrrrry committed
149
150
   <a name="heading_5"></a>**推理结果**

jerrrrry's avatar
jerrrrry committed
151
152
   ![图片描述](images/6.png)

jerrrrry's avatar
jerrrrry committed
153
154
   <a name="heading_6"></a>**测试日志**

jerrrrry's avatar
jerrrrry committed
155
156
   ![图片描述](images/7.png)

jerrrrry's avatar
jerrrrry committed
157
158
159
160
   <a name="heading_7"></a>**3\_env\_check&batches\_llm\_inference**

   一键启动环境测试+批量llm推理:

jerrrrry's avatar
jerrrrry committed
161
   https://developer.sourcefind.cn/codes/jerrrrry/easystart_v0.1/-/tree/main/3_env_check%26batches_llm_inference
jerrrrry's avatar
jerrrrry committed
162

jerrrrry's avatar
jerrrrry committed
163
   |git clone https://developer.sourcefind.cn/codes/jerrrrry/easystart_v0.1.git<br>cd 3\_env\_check&batches\_llm\_inference/<br>bash start.sh|
jerrrrry's avatar
jerrrrry committed
164
165
166
167
168
169
170
171
172
173
   | :- |

   **只需在start.sh中挂载本地大模型到docker里** 

   **修改model\_to\_test.cfg里的测试参数**

   <a name="heading_8"></a>**测试结果**

   测试结果保存在 ./outputs/inference\_outputs

jerrrrry's avatar
jerrrrry committed
174
175
   ![图片描述](images/8.png)