Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
zk
GroundingDINO-DCU-Optimized
Commits
bade2f57
Commit
bade2f57
authored
Jun 02, 2026
by
zk
Browse files
更新readme
parent
7bc21d37
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
18 additions
and
19 deletions
+18
-19
README.md
README.md
+16
-17
migraphx_infer/migraphx_export.bash
migraphx_infer/migraphx_export.bash
+2
-2
No files found.
README.md
View file @
bade2f57
...
@@ -268,6 +268,12 @@ bash migraphx_perf.bash
...
@@ -268,6 +268,12 @@ bash migraphx_perf.bash
```
```
4.
使用python脚本测试
4.
使用python脚本测试
需要提前配置环境变量
```
bash
export
PYTHONPATH
=
/opt/dtk/lib:
$PYTHONPATH
```
运行脚本
```
bash
```
bash
python migraphx_infer.py
python migraphx_infer.py
# offload=False推理,提前开辟gpu空间,数据放在device推理
# offload=False推理,提前开辟gpu空间,数据放在device推理
...
@@ -313,25 +319,18 @@ python migraphx_infer1.py
...
@@ -313,25 +319,18 @@ python migraphx_infer1.py
|
**ORT + Plugin**
| +自定义算子
<br>
+FP16 纯量化方案 B |
`ground_deform_fp16_all.onnx`
|
`ort_plugin_fp16_B`
| 105.35 | 9.49 |
|
**ORT + Plugin**
| +自定义算子
<br>
+FP16 纯量化方案 B |
`ground_deform_fp16_all.onnx`
|
`ort_plugin_fp16_B`
| 105.35 | 9.49 |
|
**ORT + Plugin**
| +自定义算子
<br>
+FP16 极致优化方案 C |
`ground_deform_fp16_all.onnx`
|
`ort_plugin_fp16_C`
| 100.91 | 9.90 |
|
**ORT + Plugin**
| +自定义算子
<br>
+FP16 极致优化方案 C |
`ground_deform_fp16_all.onnx`
|
`ort_plugin_fp16_C`
| 100.91 | 9.90 |
### 8.3 migraphx BW150和BW100 测试结果
### 8.3 migraphx BW150、BW100、K100_AI 测试结果
BW100示例结果:
```
Batch size: 1
Rate: 6.05197 inferences/sec
Total time: 165.235ms (Min: 165.115ms, Max: 165.535ms,
Mean: 165.258ms, Median: 165.225ms)
Percentiles (90%, 95%, 99%): (165.358ms, 165.358ms, 165.358ms)
Total instructions time: 205.275ms
Overhead time: 2.32812ms, -40.0399ms
Overhead: 1%, -24%
```
汇总结果
汇总结果
| 设备 | 推理方式 | FPS |
平均推理时间 (ms)
|
| 设备 | 推理方式 | FPS | |
| :--- | :--- | :--- | :--- |
| :--- | :--- | :--- | :--- |
| BW150 | migraphx-driver | 14.93 | 66.97 |
| BW150 | migraphx-driver | 26.51 |
| BW150 | Python + MIGraphX(device) | 13.65 | 73.20(包含前后处理) |
| BW150 | Python + MIGraphX(device) | 23.02 |
| BW100 | migraphx-driver | 13.54 | 73.87 |
| BW100 | migraphx-driver | 22.93 |
| BW100 | Python + MIGraphX(device) | 12.12 | 82.44(包含前后处理) |
| BW100 | Python + MIGraphX(device) | 19.39 |
| K100_AI | migraphx-driver | 19.26 |
| K100_AI | Python + MIGraphX(device) | 15.90 |
-----
-----
...
...
migraphx_infer/migraphx_export.bash
View file @
bade2f57
# export MIGRAPHX_TRACE_COMPILE=1
# export MIGRAPHX_TRACE_COMPILE=1
MIGRAPHX_ENABLE_GRAPHAPI_REDUCTION
=
1
export
MIGRAPHX_ENABLE_GRAPHAPI_REDUCTION
=
1
MIGRAPHX_ENABLE_LAYERNORM_FUSION
=
1
export
MIGRAPHX_ENABLE_LAYERNORM_FUSION
=
1
migraphx-driver perf
--onnx
\
migraphx-driver perf
--onnx
\
../weights/ground_opt_0601.onnx
\
../weights/ground_opt_0601.onnx
\
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment