README.md 3.66 KB
Newer Older
jerrrrry's avatar
jerrrrry committed
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
# 🚀 一键启动工具集 | EasyStart v0.1

![Banner](https://placehold.co/1200x400/2a4365/white?text=EasyStart-AI-Testing-Toolkit) <!-- 建议替换为实际项目横幅图 -->

## 🌟 核心功能

| 功能模块 | 适用场景 | 特色亮点 |
|---------|----------|----------|
| **1. 环境测试** | 基础硬件验证 | ROCm带宽测试/ACS监控/全栈状态检查 |
| **2. 环境测试+模型下载+推理** | 交付验证 | 自动下载Modelscope模型/批量测试 |
| **3. 环境测试+批量推理** | 压力测试 | 支持本地模型/Docker集成 |

## 🛠️ 快速开始

```bash
git clone https://developer.sourcefind.cn/codes/jerrrrry/easystart_v0.1.git
cd easystart_v0.1/


jerrrrry's avatar
jerrrrry committed
20
21
22
23
24
graph TD
    A[ROCm Bandwidth] --> B[RCCL 4/8-GPU]
    B --> C[DCU Status]
    C --> D[ACS Monitoring]
    D --> E[Full System Check]
jerrrrry's avatar
jerrrrry committed
25
26
27
28
29
30
31







jerrrrry's avatar
jerrrrry committed
32
33
34
35
36
37
38
39
40
41
42
43
44
45
**一键启动使用指南**

**一键启动当前版本支持:**

1. 一键启动环境测试(适用于一切场景)
2. 一键启动环境测试+模型下载+大模型推理(更适用于交付场景)
3. 一键启动环境测试+批量大模型推理(适用于大批量测试需求的场景)

   可根据需求进行相应测试。

   <a name="heading_0"></a>**1\_env\_check**

   一键启动环境测试:

jerrrrry's avatar
jerrrrry committed
46
   https://developer.sourcefind.cn/codes/jerrrrry/easystart_v0.1/-/tree/main/1_env_check
jerrrrry's avatar
jerrrrry committed
47

jerrrrry's avatar
jerrrrry committed
48
   |git clone https://developer.sourcefind.cn/codes/jerrrrry/easystart_v0.1.git<br>cd 1\_env\_check/<br>bash start.sh|
jerrrrry's avatar
jerrrrry committed
49
50
51
52
   | :- |

   测试项包含:

jerrrrry's avatar
jerrrrry committed
53
   1:rocm\_bandwidth\_test
jerrrrry's avatar
jerrrrry committed
54

jerrrrry's avatar
jerrrrry committed
55
   2:Rccl 4卡&8卡带宽测试
jerrrrry's avatar
jerrrrry committed
56

jerrrrry's avatar
jerrrrry committed
57
   3:贵哥发版的dcu\_env\_check
jerrrrry's avatar
jerrrrry committed
58

jerrrrry's avatar
jerrrrry committed
59
   4:ACS监控
jerrrrry's avatar
jerrrrry committed
60

jerrrrry's avatar
jerrrrry committed
61
62
63
64
65
66
67
68
69
   5:CPU&DCU状态

   6:存储&内存状态

   7:网络状态

   <a name="heading_1"></a>**测试结果**

   测试结果保存在 ./outputs/env\_check\_outputs
jerrrrry's avatar
jerrrrry committed
70
   ![图片描述](images/1.png)
jerrrrry's avatar
jerrrrry committed
71
72
73
74
75

   <a name="heading_2"></a>**2\_env\_check&model\_download&llm\_inference**

   一键启动环境测试+模型下载+llm推理:

jerrrrry's avatar
jerrrrry committed
76
   https://developer.sourcefind.cn/codes/jerrrrry/easystart_v0.1/-/tree/main/2_env_check%26model_download%26llm_inference
jerrrrry's avatar
jerrrrry committed
77

jerrrrry's avatar
jerrrrry committed
78
   |git clone https://developer.sourcefind.cn/codes/jerrrrry/easystart_v0.1.git<br>cd 2\_env\_check&model\_download&llm\_inference/<br>bash start.sh|
jerrrrry's avatar
jerrrrry committed
79
80
   | :- |

jerrrrry's avatar
jerrrrry committed
81
82
   ![图片描述](images/2.png)

jerrrrry's avatar
jerrrrry committed
83
84
85
86
   只需将要测试的模型ID**(对应modelscope的模型ID)**传入**download-list.cfg**

   <a name="heading_3"></a>**Tips**

jerrrrry's avatar
jerrrrry committed
87
   1:download-list.cfg里的格式为:**模型ID;本地保存路径**
jerrrrry's avatar
jerrrrry committed
88

jerrrrry's avatar
jerrrrry committed
89
90
   ![图片描述](images/3.png)

jerrrrry's avatar
jerrrrry committed
91
   2:可以写入多个模型下载且**会进行批量测试**
jerrrrry's avatar
jerrrrry committed
92

jerrrrry's avatar
jerrrrry committed
93
   3:模型测试参数通过**model\_to\_test.cfg**传入,**需注意model\_to\_test.cfg的传参格式**
jerrrrry's avatar
jerrrrry committed
94

jerrrrry's avatar
jerrrrry committed
95
96
   ![图片描述](images/4.png)

jerrrrry's avatar
jerrrrry committed
97
98
99
100
101
102
   <a name="heading_4"></a>**测试结果**

   测试结果保存在 ./outputs/env\_check\_outputs和 ./outputs/inference\_outputs

   下载的模型会保存在 ./outputs/models

jerrrrry's avatar
jerrrrry committed
103
104
   ![图片描述](images/5.png)

jerrrrry's avatar
jerrrrry committed
105
106
   <a name="heading_5"></a>**推理结果**

jerrrrry's avatar
jerrrrry committed
107
108
   ![图片描述](images/6.png)

jerrrrry's avatar
jerrrrry committed
109
110
   <a name="heading_6"></a>**测试日志**

jerrrrry's avatar
jerrrrry committed
111
112
   ![图片描述](images/7.png)

jerrrrry's avatar
jerrrrry committed
113
114
115
116
   <a name="heading_7"></a>**3\_env\_check&batches\_llm\_inference**

   一键启动环境测试+批量llm推理:

jerrrrry's avatar
jerrrrry committed
117
   https://developer.sourcefind.cn/codes/jerrrrry/easystart_v0.1/-/tree/main/3_env_check%26batches_llm_inference
jerrrrry's avatar
jerrrrry committed
118

jerrrrry's avatar
jerrrrry committed
119
   |git clone https://developer.sourcefind.cn/codes/jerrrrry/easystart_v0.1.git<br>cd 3\_env\_check&batches\_llm\_inference/<br>bash start.sh|
jerrrrry's avatar
jerrrrry committed
120
121
122
123
124
125
126
127
128
129
   | :- |

   **只需在start.sh中挂载本地大模型到docker里** 

   **修改model\_to\_test.cfg里的测试参数**

   <a name="heading_8"></a>**测试结果**

   测试结果保存在 ./outputs/inference\_outputs

jerrrrry's avatar
jerrrrry committed
130
131
   ![图片描述](images/8.png)