Commit 599bbc78 authored by chenych's avatar chenych
Browse files

Add paddleocr-vl-1.5

parents
Apache License
Version 2.0, January 2004
http://www.apache.org/licenses/
TERMS AND CONDITIONS FOR USE, REPRODUCTION, AND DISTRIBUTION
1. Definitions.
"License" shall mean the terms and conditions for use, reproduction,
and distribution as defined by Sections 1 through 9 of this document.
"Licensor" shall mean the copyright owner or entity authorized by
the copyright owner that is granting the License.
"Legal Entity" shall mean the union of the acting entity and all
other entities that control, are controlled by, or are under common
control with that entity. For the purposes of this definition,
"control" means (i) the power, direct or indirect, to cause the
direction or management of such entity, whether by contract or
otherwise, or (ii) ownership of fifty percent (50%) or more of the
outstanding shares, or (iii) beneficial ownership of such entity.
"You" (or "Your") shall mean an individual or Legal Entity
exercising permissions granted by this License.
"Source" form shall mean the preferred form for making modifications,
including but not limited to software source code, documentation
source, and configuration files.
"Object" form shall mean any form resulting from mechanical
transformation or translation of a Source form, including but
not limited to compiled object code, generated documentation,
and conversions to other media types.
"Work" shall mean the work of authorship, whether in Source or
Object form, made available under the License, as indicated by a
copyright notice that is included in or attached to the work
(an example is provided in the Appendix below).
"Derivative Works" shall mean any work, whether in Source or Object
form, that is based on (or derived from) the Work and for which the
editorial revisions, annotations, elaborations, or other modifications
represent, as a whole, an original work of authorship. For the purposes
of this License, Derivative Works shall not include works that remain
separable from, or merely link (or bind by name) to the interfaces of,
the Work and Derivative Works thereof.
"Contribution" shall mean any work of authorship, including
the original version of the Work and any modifications or additions
to that Work or Derivative Works thereof, that is intentionally
submitted to Licensor for inclusion in the Work by the copyright owner
or by an individual or Legal Entity authorized to submit on behalf of
the copyright owner. For the purposes of this definition, "submitted"
means any form of electronic, verbal, or written communication sent
to the Licensor or its representatives, including but not limited to
communication on electronic mailing lists, source code control systems,
and issue tracking systems that are managed by, or on behalf of, the
Licensor for the purpose of discussing and improving the Work, but
excluding communication that is conspicuously marked or otherwise
designated in writing by the copyright owner as "Not a Contribution."
"Contributor" shall mean Licensor and any individual or Legal Entity
on behalf of whom a Contribution has been received by Licensor and
subsequently incorporated within the Work.
2. Grant of Copyright License. Subject to the terms and conditions of
this License, each Contributor hereby grants to You a perpetual,
worldwide, non-exclusive, no-charge, royalty-free, irrevocable
copyright license to reproduce, prepare Derivative Works of,
publicly display, publicly perform, sublicense, and distribute the
Work and such Derivative Works in Source or Object form.
3. Grant of Patent License. Subject to the terms and conditions of
this License, each Contributor hereby grants to You a perpetual,
worldwide, non-exclusive, no-charge, royalty-free, irrevocable
(except as stated in this section) patent license to make, have made,
use, offer to sell, sell, import, and otherwise transfer the Work,
where such license applies only to those patent claims licensable
by such Contributor that are necessarily infringed by their
Contribution(s) alone or by combination of their Contribution(s)
with the Work to which such Contribution(s) was submitted. If You
institute patent litigation against any entity (including a
cross-claim or counterclaim in a lawsuit) alleging that the Work
or a Contribution incorporated within the Work constitutes direct
or contributory patent infringement, then any patent licenses
granted to You under this License for that Work shall terminate
as of the date such litigation is filed.
4. Redistribution. You may reproduce and distribute copies of the
Work or Derivative Works thereof in any medium, with or without
modifications, and in Source or Object form, provided that You
meet the following conditions:
(a) You must give any other recipients of the Work or
Derivative Works a copy of this License; and
(b) You must cause any modified files to carry prominent notices
stating that You changed the files; and
(c) You must retain, in the Source form of any Derivative Works
that You distribute, all copyright, patent, trademark, and
attribution notices from the Source form of the Work,
excluding those notices that do not pertain to any part of
the Derivative Works; and
(d) If the Work includes a "NOTICE" text file as part of its
distribution, then any Derivative Works that You distribute must
include a readable copy of the attribution notices contained
within such NOTICE file, excluding those notices that do not
pertain to any part of the Derivative Works, in at least one
of the following places: within a NOTICE text file distributed
as part of the Derivative Works; within the Source form or
documentation, if provided along with the Derivative Works; or,
within a display generated by the Derivative Works, if and
wherever such third-party notices normally appear. The contents
of the NOTICE file are for informational purposes only and
do not modify the License. You may add Your own attribution
notices within Derivative Works that You distribute, alongside
or as an addendum to the NOTICE text from the Work, provided
that such additional attribution notices cannot be construed
as modifying the License.
You may add Your own copyright statement to Your modifications and
may provide additional or different license terms and conditions
for use, reproduction, or distribution of Your modifications, or
for any such Derivative Works as a whole, provided Your use,
reproduction, and distribution of the Work otherwise complies with
the conditions stated in this License.
5. Submission of Contributions. Unless You explicitly state otherwise,
any Contribution intentionally submitted for inclusion in the Work
by You to the Licensor shall be under the terms and conditions of
this License, without any additional terms or conditions.
Notwithstanding the above, nothing herein shall supersede or modify
the terms of any separate license agreement you may have executed
with Licensor regarding such Contributions.
6. Trademarks. This License does not grant permission to use the trade
names, trademarks, service marks, or product names of the Licensor,
except as required for reasonable and customary use in describing the
origin of the Work and reproducing the content of the NOTICE file.
7. Disclaimer of Warranty. Unless required by applicable law or
agreed to in writing, Licensor provides the Work (and each
Contributor provides its Contributions) on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or
implied, including, without limitation, any warranties or conditions
of TITLE, NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A
PARTICULAR PURPOSE. You are solely responsible for determining the
appropriateness of using or redistributing the Work and assume any
risks associated with Your exercise of permissions under this License.
8. Limitation of Liability. In no event and under no legal theory,
whether in tort (including negligence), contract, or otherwise,
unless required by applicable law (such as deliberate and grossly
negligent acts) or agreed to in writing, shall any Contributor be
liable to You for damages, including any direct, indirect, special,
incidental, or consequential damages of any character arising as a
result of this License or out of the use or inability to use the
Work (including but not limited to damages for loss of goodwill,
work stoppage, computer failure or malfunction, or any and all
other commercial damages or losses), even if such Contributor
has been advised of the possibility of such damages.
9. Accepting Warranty or Additional Liability. While redistributing
the Work or Derivative Works thereof, You may choose to offer,
and charge a fee for, acceptance of support, warranty, indemnity,
or other liability obligations and/or rights consistent with this
License. However, in accepting such obligations, You may act only
on Your own behalf and on Your sole responsibility, not on behalf
of any other Contributor, and only if You agree to indemnify,
defend, and hold each Contributor harmless for any liability
incurred by, or claims asserted against, such Contributor by reason
of your accepting any such warranty or additional liability.
END OF TERMS AND CONDITIONS
APPENDIX: How to apply the Apache License to your work.
To apply the Apache License to your work, attach the following
boilerplate notice, with the fields enclosed by brackets "[]"
replaced with your own identifying information. (Don't include
the brackets!) The text should be enclosed in the appropriate
comment syntax for the file format. We also recommend that a
file or class name and description of purpose be included on the
same "printed page" as the copyright notice for easier
identification within third-party archives.
Copyright (c) 2025 PaddlePaddle Authors. All Rights Reserved.
Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
# PaddleOCR-VL-1.5
## 论文
暂无
## 模型简介
PaddleOCR-VL-1.5 是 PaddleOCR-VL 的下一代先进模型,在 OmniDocBench v1.5 基准上实现了 94.5% 的全新 SOTA(当前最优)准确率。 为严格评估模型在真实世界物理失真(包括扫描伪影、倾斜、扭曲、屏幕拍摄和光照变化)下的鲁棒性,我们提出了 Real5-OmniDocBench 基准。实验结果表明,该增强模型在新构建的基准上达到了 SOTA 性能。此外,我们在保持模型为 0.9B 超紧凑视觉语言模型(VLM)并具备高效率的同时,进一步扩展了其能力,新增了印章识别和文本检测任务。
PaddleOCR-VL-1.5 的核心能力
1. 参数量仅为 0.9B,PaddleOCR-VL-1.5 在 OmniDocBench v1.5 上达到 94.5% 的准确率,超越了先前的 SOTA 模型 PaddleOCR-VL。在表格、公式和文本识别方面均取得显著提升。
2. 通过支持不规则形状定位,引入了一种创新的文档解析方法,可在倾斜和扭曲的文档条件下实现精确的多边形检测。在五类真实场景(扫描、倾斜、扭曲、屏幕拍摄和光照变化)下的评测中,其性能均优于主流开源及闭源模型。
3. 模型新增了文本检测(文本行定位与识别)以及印章识别功能,所有相关指标在其各自任务中均创下新的 SOTA 成绩。
4. PaddleOCR-VL-1.5 进一步强化了在专业场景和多语言识别方面的能力。针对生僻字、古籍文本、多语言表格、下划线和复选框的识别性能得到提升,并将语言支持范围扩展至中国藏文和孟加拉语。
5. 模型支持自动跨页表格合并和跨页段落标题识别,有效缓解了长文档解析中的内容碎片化问题。
<div align=center>
<img src="./doc/PaddleOCR-VL-1.5.png"/>
</div>
## 环境依赖
- 列举基础环境需求,根据实际情况填写
| 软件 | 版本 |
| :------: | :------: |
| DTK | 25.04.2 |
| python | 3.10.12 |
| transformers | 4.57.1 |
| vllm | 0.9.2+das.opt1.dtk25042 |
| paddlepaddle-dcu | 3.2.2 |
| paddlex | 3.4 |
| paddleocr | 3.3.3 |
推荐使用镜像: image.sourcefind.cn:5000/dcu/admin/base/vllm:0.9.2-ubuntu22.04-dtk25.04.2-py3.10
- 挂载地址`-v`根据实际模型情况修改
```bash
docker run -it \
--shm-size 60g \
--network=host \
--name paddleocr-vl-1.5 \
--privileged \
--device=/dev/kfd \
--device=/dev/dri \
--device=/dev/mkfd \
--group-add video \
--cap-add=SYS_PTRACE \
--security-opt seccomp=unconfined \
-u root \
-v /opt/hyhal/:/opt/hyhal/:ro \
-v /path/your_code_data/:/path/your_code_data/ \
image.sourcefind.cn:5000/dcu/admin/base/vllm:0.9.2-ubuntu22.04-dtk25.04.2-py3.10 bash
```
更多镜像可前往[光源](https://sourcefind.cn/#/service-list)下载使用。
关于本项目DCU显卡所需的特殊深度学习库可从[光合](https://developer.sourcefind.cn/tool/)开发者社区下载安装,其它包安装如下:
```
python -m pip install paddlepaddle-dcu==3.2.2 -i https://www.paddlepaddle.org.cn/packages/stable/dcu/
python -m pip install -U "paddleocr[doc-parser]"
pip install paddlex==3.4
```
## 数据集
暂无
## 训练
暂无
## 推理
### vllm
#### 单机推理
- 命令行
```bash
export PADDLE_PDX_DISABLE_DEV_MODEL_WL=1
export DISABLE_MODEL_SOURCE_CHECK=1
paddleocr doc_parser -i doc/paddleocr_vl_demo.png --device DCU --precision fp32 --save_path ./output
```
- 服务端方式
```bash
## server
paddlex_genai_server --model_name PaddleOCR-VL-1.5-0.9B --backend vllm --host 127.0.0.1 --port 8118 --model_dir PaddlePaddle/PaddleOCR-VL-1.5
## client
export PADDLE_PDX_DISABLE_DEV_MODEL_WL=1
export DISABLE_MODEL_SOURCE_CHECK=1
paddleocr doc_parser -i doc/paddleocr_vl_demo.png --vl_rec_backend vllm-server --vl_rec_server_url http://127.0.0.1:8118/v1 --device DCU --save_path ./output-vllm
```
## 效果展示
<div align=center>
<img src="doc/paddleocr_vl_demo_layout_det_res.png"/>
</div>
<div align=center>
<img src="doc/result-dcu.png"/>
</div>
### 精度
DCU与GPU精度一致,推理框架:paddle。
## 预训练权重
| 模型名称 | 权重大小 | DCU型号 | 最低卡数需求 |下载地址|
|:-----:|:----------:|:----------:|:---------------------:|:----------:|
| PaddleOCR-VL-1.5 | 1B | K100AI | 1 | [Modelscope](https://modelscope.cn/models/PaddlePaddle/PaddleOCR-VL-1.5) |
## 源码仓库及问题反馈
- https://developer.sourcefind.cn/codes/modelzoo/paddleocr-vl-1.5_paddle
## 参考资料
- https://github.com/PaddlePaddle/PaddleOCR
\ No newline at end of file
# 助力双方交往 搭建友谊桥梁
本报记者 沈小晓 任彦 黄培昭
<div style="text-align: center;"><img src="imgs/img_in_image_box_777_201_1502_685.jpg" alt="Image" width="47%" /></div>
在厄立特里亚不久前举办的第六届中国风筝文化节上,当地小学生体验风筝制作。中国驻厄立特里亚大使馆供图
身着中国传统民族服装的厄立特里亚青年依次登台表演中国民族舞、现代舞、扇子舞等,曼妙的舞姿赢得现场观众阵阵掌声。这是日前厄立特里亚高等教育与研究院孔子学院(以下简称“厄特孔院”)举办“喜迎新年”中国歌舞比赛的场景。
中国和厄立特里亚传统友谊深厚。近年来,在高质量共建“一带一路”框架下,中厄两国人文交流不断深化,互利合作的民意基础日益深厚。
## “学好中文,我们的未来不是梦”
“鲜花曾告诉我你怎样走过,大地知道你心中的每一个角落……”厄立特里亚阿斯马拉大学综合楼二层,一阵优美的歌声在走廊里回响。循着熟悉的旋律轻轻推开一间教室的门,学生们正跟着老师学唱中文歌曲《同一首歌》。
这是厄特孔院阿斯马拉大学教学点的一节中文歌曲课。为了让学生们更好地理解歌词大意,老师尤斯拉·穆罕默德萨尔·侯赛因逐字翻译和解释歌词。随着伴奏声响起,学生们边唱边随着节拍摇动身体,现场气氛热烈。
“这是中文歌曲初级班,共有32人。学生大部分来自首都阿斯马拉的中小学,年龄最小的仅有6岁。”尤斯拉告诉记者。
尤斯拉今年23岁,是厄立特里亚一所公立学校的艺术老师。她12岁开始在厄特孔院学习中文,在2017年第十届“汉语桥”世界中学生中文比赛中获得厄立特里亚赛区第一名,并和同伴代表厄立特里亚前往中国参加决赛,获得团体优胜奖。2022年起,尤斯拉开始在厄特孔院兼职教授中文歌曲,每周末两个课时。“中国文化博大精深,我希望我的学生们能够通过中文歌曲更好地理解中国文化。”她说。
“姐姐,你想去中国吗?”“非常想!我想去看故宫、爬长城。”尤斯拉的学生中有一对能歌善舞的姐妹,姐姐露娅今年15岁,妹妹莉娅14岁,两人都已在厄特孔院学习多年,中文说得格外流利。
露娅对记者说:“这些年来,怀着对中文和中国文化的热爱,我们姐妹俩始终相互鼓励,一起学习。我们的中文一天比一天好,还学会了中文歌和中国舞。我们一定要到中国去。学好中文,我们的未来不是梦!”
据厄特孔院中方院长黄鸣飞介绍,这所孔院成立于2013年3月,由贵州财经大学和厄立特里亚高等教育与研究院合作建立,开设了中国语言课程和中国文化课程,注册学生2万余人次。10余年来,厄特孔院已成为当地民众了解中国的一扇窗口。
黄鸣飞表示,随着来学习中文的人日益增多,阿斯马拉大学教学点已难以满足教学需要。2024年4月,由中企蜀道集团所属四川路桥承建的孔院教学楼项目在阿斯马拉开工建设,预计今年上半年竣工,建成后将为厄特孔院提供全新的办学场地。
## “在中国学习的经历让我看到更广阔的世界”
多年来,厄立特里亚广大赴华留学生和培训人员积极投身国家建设,成为助力该国发展的人才和厄中友好的见证者和推动者。
在厄立特里亚全国妇女联盟工作的约翰娜·特韦尔德·凯莱塔就是其中一位。她曾在中华女子学院攻读硕士学位,研究方向是女性领导力与社会发展。其间,她实地走访中国多个地区,获得了观察中国社会发展的第一手资料。
谈起在中国求学的经历,约翰娜记忆犹新:“中国的发展在当今世界是独一无二的。沿着中国特色社会主义道路坚定前行,中国创造了发展奇迹,这一切都离不开中国共产党的领导。中国的发展经验值得许多国家学习借鉴。”
正在西南大学学习的厄立特里亚博士生穆卢盖塔·泽穆伊对中国怀有深厚感情。8年前,在北京师范大学获得硕士学位后,穆卢盖塔在社交媒体上写下这样一段话:“这是我人生的重要一步,自此我拥有了一双坚固的鞋子,赋予我穿越荆棘的力量。”
穆卢盖塔密切关注中国在经济、科技、教育等领域的发展,“中国在科研等方面的实力与日俱增。在中国学习的经历让我看到更广阔的世界,从中受益匪浅。”
23岁的莉迪亚·埃斯蒂法诺斯已在厄特孔院学习3年,在中国书法、中国画等方面表现十分优秀,在2024年厄立特里亚赛区的“汉语桥”比赛中获得一等奖。莉迪亚说:“学习中国书法让我的内心变得安宁和纯粹。我也喜欢中国的服饰,希望未来能去中国学习,把中国不同民族元素融入服装设计中,创作出更多精美作品,也把厄特文化分享给更多的中国朋友。”
“不管远近都是客人,请不用客气;相约好了在一起,我们欢迎你……”在一场中厄青年联谊活动上,四川路桥中方员工同当地大学生合唱《北京欢迎你》。厄立特里亚技术学院计算机科学与工程专业学生鲁夫塔·谢拉是其中一名演唱者,她很早便在孔院学习中文,一直在为去中国留学作准备。“这句歌词是我们两国人民友谊的生动写照。无论是投身于厄立特里亚基础设施建设的中企员工,还是在中国留学的厄立特里亚学子,两国人民携手努力,必将推动两国关系不断向前发展。”鲁夫塔说。
厄立特里亚高等教育委员会主任助理萨马瑞表示:“每年我们都会组织学生到中国访问学习,目前有超过5000名厄立特里亚学生在中国留学。学习中国的教育经验,有助于提升厄立特里亚的教育水平。”
## “共同向世界展示非洲和亚洲的灿烂文明”
从阿斯马拉出发,沿着蜿蜒曲折的盘山公路一路向东寻找丝路印迹。驱车两个小时,记者来到位于厄立特里亚港口城市马萨瓦的北红海省博物馆。
博物馆二层陈列着一个发掘自阿杜利斯古城的中国古代陶制酒器,罐身上写着“万”“和”“禅”“山”等汉字。“这件文物证明,很早以前我们就通过海上丝绸之路进行贸易往来与文化交流。这也是厄立特里亚与中国友好交往历史的有力证明。”北红海省博物馆研究与文献部负责人伊萨亚斯·特斯法兹吉说。
厄立特里亚国家博物馆考古学和人类学研究员菲尔蒙·特韦尔德十分喜爱中国文化。他表示:“学习彼此的语言和文化,将帮助厄中两国人民更好地理解彼此,助力双方交往,搭建友谊桥梁。”
厄立特里亚国家博物馆馆长塔吉丁·努里达姆·优素福曾多次访问中国,对中华文明的传承与创新、现代化博物馆的建设与发展印象深刻。“中国博物馆不仅有许多保存完好的文物,还充分运用先进科技手段进行展示,帮助人们更好理解中华文明。”塔吉丁说,“厄立特里亚与中国都拥有悠久的文明,始终相互理解、相互尊重。我希望未来与中国同行加强合作,共同向世界展示非洲和亚洲的灿烂文明。”
\ No newline at end of file
{
"input_path": "paddleocr_vl_demo.png",
"page_index": null,
"page_count": null,
"width": 1524,
"height": 1368,
"model_settings": {
"use_doc_preprocessor": false,
"use_layout_detection": true,
"use_chart_recognition": false,
"use_seal_recognition": false,
"use_ocr_for_image_block": false,
"format_block_content": false,
"merge_layout_blocks": true,
"markdown_ignore_labels": [
"number",
"footnote",
"header",
"header_image",
"footer",
"footer_image",
"aside_text"
],
"return_layout_polygon_points": true
},
"parsing_res_list": [
{
"block_label": "doc_title",
"block_content": "助力双方交往 搭建友谊桥梁",
"block_bbox": [
130,
35,
1384,
127
],
"block_id": 0,
"block_order": 1,
"group_id": 0,
"block_polygon_points": [
[
130.0,
35.0
],
[
1384.0,
35.0
],
[
1384.0,
127.0
],
[
130.0,
127.0
]
]
},
{
"block_label": "text",
"block_content": "本报记者 沈小晓 任彦 黄培昭",
"block_bbox": [
582,
157,
930,
183
],
"block_id": 1,
"block_order": 2,
"group_id": 1,
"block_polygon_points": [
[
582.0,
157.0
],
[
930.0,
157.0
],
[
930.0,
183.0
],
[
582.0,
183.0
]
]
},
{
"block_label": "image",
"block_content": "",
"block_bbox": [
777,
201,
1502,
685
],
"block_id": 2,
"block_order": null,
"group_id": 2,
"block_polygon_points": [
[
777.0,
201.0
],
[
1502.0,
201.0
],
[
1502.0,
685.0
],
[
777.0,
685.0
]
]
},
{
"block_label": "vision_footnote",
"block_content": "在厄立特里亚不久前举办的第六届中国风筝文化节上,当地小学生体验风筝制作。中国驻厄立特里亚大使馆供图",
"block_bbox": [
809,
702,
1486,
750
],
"block_id": 3,
"block_order": null,
"group_id": 3,
"block_polygon_points": [
[
809,
702
],
[
809,
736
],
[
817,
743
],
[
1485,
749
],
[
1485,
723
],
[
1478,
716
],
[
1455,
702
]
]
},
{
"block_label": "text",
"block_content": "身着中国传统民族服装的厄立特里亚青年依次登台表演中国民族舞、现代舞、扇子舞等,曼妙的舞姿赢得现场观众阵阵掌声。这是日前厄立特里亚高等教育与研究院孔子学院(以下简称“厄特孔院”)举办“喜迎新年”中国歌舞比赛的场景。",
"block_bbox": [
9,
199,
361,
342
],
"block_id": 4,
"block_order": 3,
"group_id": 4,
"block_polygon_points": [
[
9.0,
199.0
],
[
361.0,
199.0
],
[
361.0,
342.0
],
[
9.0,
342.0
]
]
},
{
"block_label": "text",
"block_content": "中国和厄立特里亚传统友谊深厚。近年来,在高质量共建“一带一路”框架下,中厄两国人文交流不断深化,互利合作的民意基础日益深厚。",
"block_bbox": [
8,
344,
360,
440
],
"block_id": 5,
"block_order": 4,
"group_id": 5,
"block_polygon_points": [
[
8.0,
344.0
],
[
360.0,
344.0
],
[
360.0,
440.0
],
[
8.0,
440.0
]
]
},
{
"block_label": "paragraph_title",
"block_content": "“学好中文,我们的未来不是梦”",
"block_bbox": [
27,
455,
341,
520
],
"block_id": 6,
"block_order": 5,
"group_id": 6,
"block_polygon_points": [
[
27.0,
455.0
],
[
341.0,
455.0
],
[
341.0,
520.0
],
[
27.0,
520.0
]
]
},
{
"block_label": "text",
"block_content": "“鲜花曾告诉我你怎样走过,大地知道你心中的每一个角落……”厄立特里亚阿斯马拉大学综合楼二层,一阵优美的歌声在走廊里回响。循着熟悉的旋律轻轻推开一间教室的门,学生们正跟着老师学唱中文歌曲《同一首歌》。",
"block_bbox": [
8,
535,
359,
655
],
"block_id": 7,
"block_order": 6,
"group_id": 7,
"block_polygon_points": [
[
8.0,
535.0
],
[
359.0,
535.0
],
[
359.0,
655.0
],
[
8.0,
655.0
]
]
},
{
"block_label": "text",
"block_content": "这是厄特孔院阿斯马拉大学教学点的一节中文歌曲课。为了让学生们更好地理解歌词大意,老师尤斯拉·穆罕默德萨尔·侯赛因逐字翻译和解释歌词。随着伴奏声响起,学生们边唱边随着节拍摇动身体,现场气氛热烈。",
"block_bbox": [
8,
656,
361,
773
],
"block_id": 8,
"block_order": 7,
"group_id": 8,
"block_polygon_points": [
[
8.0,
656.0
],
[
361.0,
656.0
],
[
361.0,
773.0
],
[
8.0,
773.0
]
]
},
{
"block_label": "text",
"block_content": "“这是中文歌曲初级班,共有32人。学生大部分来自首都阿斯马拉的中小学,年龄最小的仅有6岁。”尤斯拉告诉记者。",
"block_bbox": [
8,
776,
360,
846
],
"block_id": 9,
"block_order": 8,
"group_id": 9,
"block_polygon_points": [
[
8.0,
776.0
],
[
360.0,
776.0
],
[
360.0,
846.0
],
[
8.0,
846.0
]
]
},
{
"block_label": "text",
"block_content": "尤斯拉今年23岁,是厄立特里亚一所公立学校的艺术老师。她12岁开始在厄特孔院学习中文,在2017年第十届“汉语桥”世界中学生中文比赛中获得厄立特里亚赛区第一名,并和同伴代表厄立特里亚前往中国参加决赛,获得团体优胜奖。2022年起,尤斯拉开始在厄特孔院兼职教授中文歌曲,每周末两个课时。“中国文化博大精深,我希望我的学生们能够通过中文歌曲更好地理解中国文化。”她说。",
"block_bbox": [
8,
847,
361,
1061
],
"block_id": 10,
"block_order": 9,
"group_id": 10,
"block_polygon_points": [
[
8.0,
847.0
],
[
361.0,
847.0
],
[
361.0,
1061.0
],
[
8.0,
1061.0
]
]
},
{
"block_label": "text",
"block_content": "“姐姐,你想去中国吗?”“非常想!我想去看故宫、爬长城。”尤斯拉的学生中有一对能歌善舞的姐妹,姐姐露娅今年15岁,妹妹莉娅14岁,两人都已在厄特孔院学习多年,中文说得格外流利。",
"block_bbox": [
8,
1063,
360,
1181
],
"block_id": 11,
"block_order": 10,
"group_id": 11,
"block_polygon_points": [
[
8.0,
1063.0
],
[
360.0,
1063.0
],
[
360.0,
1181.0
],
[
8.0,
1181.0
]
]
},
{
"block_label": "text",
"block_content": "露娅对记者说:“这些年来,怀着对中文和中国文化的热爱,我们姐妹俩始终相互鼓励,一起学习。我们的中文一天比一天好,还学会了中文歌和中国舞。我们一定要到中国去。学好中文,我们的未来不是梦!”",
"block_bbox": [
8,
1183,
361,
1301
],
"block_id": 12,
"block_order": 11,
"group_id": 12,
"block_polygon_points": [
[
8.0,
1183.0
],
[
361.0,
1183.0
],
[
361.0,
1301.0
],
[
8.0,
1301.0
]
]
},
{
"block_label": "text",
"block_content": "据厄特孔院中方院长黄鸣飞介绍,这所孔院成立于2013年3月,由贵州财经大学和厄立特里亚高等教育与研究院合作建立,开设了中国语言课程和中国文化课程,注册学生2万余人次。10余年来,厄特孔院已成为当地民众了解中国的一扇窗口。",
"block_bbox": [
9,
1303,
361,
1351
],
"block_id": 13,
"block_order": 12,
"group_id": 13,
"block_polygon_points": [
[
9.0,
1303.0
],
[
361.0,
1303.0
],
[
361.0,
1351.0
],
[
9.0,
1351.0
]
]
},
{
"block_label": "text",
"block_content": "",
"block_bbox": [
389,
199,
742,
294
],
"block_id": 14,
"block_order": 13,
"group_id": 13,
"block_polygon_points": [
[
389.0,
199.0
],
[
742.0,
199.0
],
[
742.0,
294.0
],
[
389.0,
294.0
]
]
},
{
"block_label": "text",
"block_content": "黄鸣飞表示,随着来学习中文的人日益增多,阿斯马拉大学教学点已难以满足教学需要。2024年4月,由中企蜀道集团所属四川路桥承建的孔院教学楼项目在阿斯马拉开工建设,预计今年上半年竣工,建成后将为厄特孔院提供全新的办学场地。",
"block_bbox": [
389,
296,
743,
440
],
"block_id": 15,
"block_order": 14,
"group_id": 15,
"block_polygon_points": [
[
389.0,
296.0
],
[
743.0,
296.0
],
[
743.0,
440.0
],
[
389.0,
440.0
]
]
},
{
"block_label": "paragraph_title",
"block_content": "“在中国学习的经历让我看到更广阔的世界”",
"block_bbox": [
407,
454,
721,
520
],
"block_id": 16,
"block_order": 15,
"group_id": 16,
"block_polygon_points": [
[
407.0,
454.0
],
[
721.0,
454.0
],
[
721.0,
520.0
],
[
407.0,
520.0
]
]
},
{
"block_label": "text",
"block_content": "多年来,厄立特里亚广大赴华留学生和培训人员积极投身国家建设,成为助力该国发展的人才和厄中友好的见证者和推动者。",
"block_bbox": [
390,
535,
742,
607
],
"block_id": 17,
"block_order": 16,
"group_id": 17,
"block_polygon_points": [
[
390.0,
535.0
],
[
742.0,
535.0
],
[
742.0,
607.0
],
[
390.0,
607.0
]
]
},
{
"block_label": "text",
"block_content": "在厄立特里亚全国妇女联盟工作的约翰娜·特韦尔德·凯莱塔就是其中一位。她曾在中华女子学院攻读硕士学位,研究方向是女性领导力与社会发展。其间,她实地走访中国多个地区,获得了观察中国社会发展的第一手资料。",
"block_bbox": [
389,
609,
742,
749
],
"block_id": 18,
"block_order": 17,
"group_id": 18,
"block_polygon_points": [
[
389.0,
609.0
],
[
742.0,
609.0
],
[
742.0,
749.0
],
[
389.0,
749.0
]
]
},
{
"block_label": "text",
"block_content": "谈起在中国求学的经历,约翰娜记忆犹新:“中国的发展在当今世界是独一无二的。沿着中国特色社会主义道路坚定前行,中国创造了发展奇迹,这一切都离不开中国共产党的领导。中国的发展经验值得许多国家学习借鉴。”",
"block_bbox": [
389,
751,
741,
893
],
"block_id": 19,
"block_order": 18,
"group_id": 19,
"block_polygon_points": [
[
389.0,
751.0
],
[
741.0,
751.0
],
[
741.0,
893.0
],
[
389.0,
893.0
]
]
},
{
"block_label": "text",
"block_content": "正在西南大学学习的厄立特里亚博士生穆卢盖塔·泽穆伊对中国怀有深厚感情。8年前,在北京师范大学获得硕士学位后,穆卢盖塔在社交媒体上写下这样一段话:“这是我人生的重要一步,自此我拥有了一双坚固的鞋子,赋予我穿越荆棘的力量。”",
"block_bbox": [
389,
895,
742,
1037
],
"block_id": 20,
"block_order": 19,
"group_id": 20,
"block_polygon_points": [
[
389.0,
895.0
],
[
742.0,
895.0
],
[
742.0,
1037.0
],
[
389.0,
1037.0
]
]
},
{
"block_label": "text",
"block_content": "穆卢盖塔密切关注中国在经济、科技、教育等领域的发展,“中国在科研等方面的实力与日俱增。在中国学习的经历让我看到更广阔的世界,从中受益匪浅。”",
"block_bbox": [
389,
1039,
742,
1133
],
"block_id": 21,
"block_order": 20,
"group_id": 21,
"block_polygon_points": [
[
389.0,
1039.0
],
[
742.0,
1039.0
],
[
742.0,
1133.0
],
[
389.0,
1133.0
]
]
},
{
"block_label": "text",
"block_content": "23岁的莉迪亚·埃斯蒂法诺斯已在厄特孔院学习3年,在中国书法、中国画等方面表现十分优秀,在2024年厄立特里亚赛区的“汉语桥”比赛中获得一等奖。莉迪亚说:“学习中国书法让我的内心变得安宁和纯粹。我也喜欢中国的服饰,希望未来能去中国学习,把中国不同民族元素融入服装设计中,创作出更多精美作品,也把厄特文化分享给更多的中国朋友。”\n“不管远近都是客人,请不用客气;相约好了在一起,我们欢迎你……”在一场中厄青年联谊活动上,四川路桥中方员工同当地大学生合唱《北京欢迎你》。厄立特里亚技术学院计算机科学与工程专业学生鲁夫塔·谢拉是其中一名演唱者,她很早便在孔院学习中文,一直在为去中国留学作准备。“这句歌词是我们两国人民友谊的生动写照。无论是投身于厄立特里亚基础设施建设的中企员工,还是在中国留学的厄立特里亚学子,两国人民携手努力,必将推动两国关系不断向前发展。”鲁夫塔说。",
"block_bbox": [
388,
1135,
742,
1351
],
"block_id": 22,
"block_order": 21,
"group_id": 22,
"block_polygon_points": [
[
388.0,
1135.0
],
[
742.0,
1135.0
],
[
742.0,
1351.0
],
[
388.0,
1351.0
]
]
},
{
"block_label": "text",
"block_content": "",
"block_bbox": [
770,
773,
1124,
1062
],
"block_id": 23,
"block_order": 22,
"group_id": 22,
"block_polygon_points": [
[
770.0,
773.0
],
[
1124.0,
773.0
],
[
1124.0,
1062.0
],
[
770.0,
1062.0
]
]
},
{
"block_label": "text",
"block_content": "厄立特里亚高等教育委员会主任助理萨马瑞表示:“每年我们都会组织学生到中国访问学习,目前有超过5000名厄立特里亚学生在中国留学。学习中国的教育经验,有助于提升厄立特里亚的教育水平。”",
"block_bbox": [
770,
1062,
1124,
1183
],
"block_id": 24,
"block_order": 23,
"group_id": 24,
"block_polygon_points": [
[
770.0,
1062.0
],
[
1124.0,
1062.0
],
[
1124.0,
1183.0
],
[
770.0,
1183.0
]
]
},
{
"block_label": "paragraph_title",
"block_content": "“共同向世界展示非洲和亚洲的灿烂文明”",
"block_bbox": [
790,
1198,
1103,
1263
],
"block_id": 25,
"block_order": 24,
"group_id": 25,
"block_polygon_points": [
[
790.0,
1198.0
],
[
1103.0,
1198.0
],
[
1103.0,
1263.0
],
[
790.0,
1263.0
]
]
},
{
"block_label": "text",
"block_content": "从阿斯马拉出发,沿着蜿蜒曲折的盘山公路一路向东寻找丝路印迹。驱车两个小时,记者来到位于厄立特里亚港口城市马萨瓦的北红海省博物馆。",
"block_bbox": [
770,
1278,
1124,
1352
],
"block_id": 26,
"block_order": 25,
"group_id": 26,
"block_polygon_points": [
[
770.0,
1278.0
],
[
1124.0,
1278.0
],
[
1124.0,
1352.0
],
[
770.0,
1352.0
]
]
},
{
"block_label": "text",
"block_content": "",
"block_bbox": [
1154,
774,
1333,
797
],
"block_id": 27,
"block_order": 26,
"group_id": 26,
"block_polygon_points": [
[
1154.0,
774.0
],
[
1333.0,
774.0
],
[
1333.0,
797.0
],
[
1154.0,
797.0
]
]
},
{
"block_label": "text",
"block_content": "博物馆二层陈列着一个发掘自阿杜利斯古城的中国古代陶制酒器,罐身上写着“万”“和”“禅”“山”等汉字。“这件文物证明,很早以前我们就通过海上丝绸之路进行贸易往来与文化交流。这也是厄立特里亚与中国友好交往历史的有力证明。”北红海省博物馆研究与文献部负责人伊萨亚斯·特斯法兹吉说。",
"block_bbox": [
1151,
798,
1506,
989
],
"block_id": 28,
"block_order": 27,
"group_id": 28,
"block_polygon_points": [
[
1151.0,
798.0
],
[
1506.0,
798.0
],
[
1506.0,
989.0
],
[
1151.0,
989.0
]
]
},
{
"block_label": "text",
"block_content": "厄立特里亚国家博物馆考古学和人类学研究员菲尔蒙·特韦尔德十分喜爱中国文化。他表示:“学习彼此的语言和文化,将帮助厄中两国人民更好地理解彼此,助力双方交往,搭建友谊桥梁。”",
"block_bbox": [
1152,
991,
1506,
1109
],
"block_id": 29,
"block_order": 28,
"group_id": 29,
"block_polygon_points": [
[
1152.0,
991.0
],
[
1506.0,
991.0
],
[
1506.0,
1109.0
],
[
1152.0,
1109.0
]
]
},
{
"block_label": "text",
"block_content": "厄立特里亚国家博物馆馆长塔吉丁·努里达姆·优素福曾多次访问中国,对中华文明的传承与创新、现代化博物馆的建设与发展印象深刻。“中国博物馆不仅有许多保存完好的文物,还充分运用先进科技手段进行展示,帮助人们更好理解中华文明。”塔吉丁说,“厄立特里亚与中国都拥有悠久的文明,始终相互理解、相互尊重。我希望未来与中国同行加强合作,共同向世界展示非洲和亚洲的灿烂文明。”",
"block_bbox": [
1152,
1111,
1507,
1352
],
"block_id": 30,
"block_order": 29,
"group_id": 30,
"block_polygon_points": [
[
1152.0,
1111.0
],
[
1507.0,
1111.0
],
[
1507.0,
1352.0
],
[
1152.0,
1352.0
]
]
}
],
"layout_det_res": {
"input_path": null,
"page_index": null,
"boxes": [
{
"cls_id": 6,
"label": "doc_title",
"score": 0.9300571084022522,
"coordinate": [
130,
35,
1384,
127
],
"order": 1,
"polygon_points": [
[
130.0,
35.0
],
[
1384.0,
35.0
],
[
1384.0,
127.0
],
[
130.0,
127.0
]
]
},
{
"cls_id": 22,
"label": "text",
"score": 0.8483848571777344,
"coordinate": [
582,
157,
930,
183
],
"order": 2,
"polygon_points": [
[
582.0,
157.0
],
[
930.0,
157.0
],
[
930.0,
183.0
],
[
582.0,
183.0
]
]
},
{
"cls_id": 14,
"label": "image",
"score": 0.9810925126075745,
"coordinate": [
777,
201,
1502,
685
],
"order": null,
"polygon_points": [
[
777.0,
201.0
],
[
1502.0,
201.0
],
[
1502.0,
685.0
],
[
777.0,
685.0
]
]
},
{
"cls_id": 24,
"label": "vision_footnote",
"score": 0.4310949742794037,
"coordinate": [
810,
702,
1452,
724
],
"order": null,
"polygon_points": [
[
810.0,
702.0
],
[
1452.0,
702.0
],
[
1452.0,
724.0
],
[
810.0,
724.0
]
]
},
{
"cls_id": 24,
"label": "vision_footnote",
"score": 0.6346683502197266,
"coordinate": [
809,
702,
1486,
750
],
"order": null,
"polygon_points": [
[
809,
702
],
[
809,
736
],
[
817,
743
],
[
1485,
749
],
[
1485,
723
],
[
1478,
716
],
[
1455,
702
]
]
},
{
"cls_id": 24,
"label": "vision_footnote",
"score": 0.3505808413028717,
"coordinate": [
1246,
729,
1487,
750
],
"order": null,
"polygon_points": [
[
1246,
729
],
[
1246,
749
],
[
1486,
749
],
[
1486,
729
]
]
},
{
"cls_id": 22,
"label": "text",
"score": 0.9565537571907043,
"coordinate": [
9,
199,
361,
342
],
"order": 3,
"polygon_points": [
[
9.0,
199.0
],
[
361.0,
199.0
],
[
361.0,
342.0
],
[
9.0,
342.0
]
]
},
{
"cls_id": 22,
"label": "text",
"score": 0.9485597610473633,
"coordinate": [
8,
344,
360,
440
],
"order": 4,
"polygon_points": [
[
8.0,
344.0
],
[
360.0,
344.0
],
[
360.0,
440.0
],
[
8.0,
440.0
]
]
},
{
"cls_id": 17,
"label": "paragraph_title",
"score": 0.9114850163459778,
"coordinate": [
27,
455,
341,
520
],
"order": 5,
"polygon_points": [
[
27.0,
455.0
],
[
341.0,
455.0
],
[
341.0,
520.0
],
[
27.0,
520.0
]
]
},
{
"cls_id": 22,
"label": "text",
"score": 0.9484403729438782,
"coordinate": [
8,
535,
359,
655
],
"order": 6,
"polygon_points": [
[
8.0,
535.0
],
[
359.0,
535.0
],
[
359.0,
655.0
],
[
8.0,
655.0
]
]
},
{
"cls_id": 22,
"label": "text",
"score": 0.9521226286888123,
"coordinate": [
8,
656,
361,
773
],
"order": 7,
"polygon_points": [
[
8.0,
656.0
],
[
361.0,
656.0
],
[
361.0,
773.0
],
[
8.0,
773.0
]
]
},
{
"cls_id": 22,
"label": "text",
"score": 0.9330599308013916,
"coordinate": [
8,
776,
360,
846
],
"order": 8,
"polygon_points": [
[
8.0,
776.0
],
[
360.0,
776.0
],
[
360.0,
846.0
],
[
8.0,
846.0
]
]
},
{
"cls_id": 22,
"label": "text",
"score": 0.9604430794715881,
"coordinate": [
8,
847,
361,
1061
],
"order": 9,
"polygon_points": [
[
8.0,
847.0
],
[
361.0,
847.0
],
[
361.0,
1061.0
],
[
8.0,
1061.0
]
]
},
{
"cls_id": 22,
"label": "text",
"score": 0.9469948410987854,
"coordinate": [
8,
1063,
360,
1181
],
"order": 10,
"polygon_points": [
[
8.0,
1063.0
],
[
360.0,
1063.0
],
[
360.0,
1181.0
],
[
8.0,
1181.0
]
]
},
{
"cls_id": 22,
"label": "text",
"score": 0.9434539079666138,
"coordinate": [
8,
1183,
361,
1301
],
"order": 11,
"polygon_points": [
[
8.0,
1183.0
],
[
361.0,
1183.0
],
[
361.0,
1301.0
],
[
8.0,
1301.0
]
]
},
{
"cls_id": 22,
"label": "text",
"score": 0.9266975522041321,
"coordinate": [
9,
1303,
361,
1351
],
"order": 12,
"polygon_points": [
[
9.0,
1303.0
],
[
361.0,
1303.0
],
[
361.0,
1351.0
],
[
9.0,
1351.0
]
]
},
{
"cls_id": 22,
"label": "text",
"score": 0.9517078399658203,
"coordinate": [
389,
199,
742,
294
],
"order": 13,
"polygon_points": [
[
389.0,
199.0
],
[
742.0,
199.0
],
[
742.0,
294.0
],
[
389.0,
294.0
]
]
},
{
"cls_id": 22,
"label": "text",
"score": 0.9540071487426758,
"coordinate": [
389,
296,
743,
440
],
"order": 14,
"polygon_points": [
[
389.0,
296.0
],
[
743.0,
296.0
],
[
743.0,
440.0
],
[
389.0,
440.0
]
]
},
{
"cls_id": 17,
"label": "paragraph_title",
"score": 0.8874742388725281,
"coordinate": [
407,
454,
721,
520
],
"order": 15,
"polygon_points": [
[
407.0,
454.0
],
[
721.0,
454.0
],
[
721.0,
520.0
],
[
407.0,
520.0
]
]
},
{
"cls_id": 22,
"label": "text",
"score": 0.9290581345558167,
"coordinate": [
390,
535,
742,
607
],
"order": 16,
"polygon_points": [
[
390.0,
535.0
],
[
742.0,
535.0
],
[
742.0,
607.0
],
[
390.0,
607.0
]
]
},
{
"cls_id": 22,
"label": "text",
"score": 0.9516089558601379,
"coordinate": [
389,
609,
742,
749
],
"order": 17,
"polygon_points": [
[
389.0,
609.0
],
[
742.0,
609.0
],
[
742.0,
749.0
],
[
389.0,
749.0
]
]
},
{
"cls_id": 22,
"label": "text",
"score": 0.9524989724159241,
"coordinate": [
389,
751,
741,
893
],
"order": 18,
"polygon_points": [
[
389.0,
751.0
],
[
741.0,
751.0
],
[
741.0,
893.0
],
[
389.0,
893.0
]
]
},
{
"cls_id": 22,
"label": "text",
"score": 0.9566522240638733,
"coordinate": [
389,
895,
742,
1037
],
"order": 19,
"polygon_points": [
[
389.0,
895.0
],
[
742.0,
895.0
],
[
742.0,
1037.0
],
[
389.0,
1037.0
]
]
},
{
"cls_id": 22,
"label": "text",
"score": 0.9500834345817566,
"coordinate": [
389,
1039,
742,
1133
],
"order": 20,
"polygon_points": [
[
389.0,
1039.0
],
[
742.0,
1039.0
],
[
742.0,
1133.0
],
[
389.0,
1133.0
]
]
},
{
"cls_id": 22,
"label": "text",
"score": 0.9630913138389587,
"coordinate": [
388,
1135,
742,
1351
],
"order": 21,
"polygon_points": [
[
388.0,
1135.0
],
[
742.0,
1135.0
],
[
742.0,
1351.0
],
[
388.0,
1351.0
]
]
},
{
"cls_id": 22,
"label": "text",
"score": 0.9659959077835083,
"coordinate": [
770,
773,
1124,
1062
],
"order": 22,
"polygon_points": [
[
770.0,
773.0
],
[
1124.0,
773.0
],
[
1124.0,
1062.0
],
[
770.0,
1062.0
]
]
},
{
"cls_id": 22,
"label": "text",
"score": 0.9498738050460815,
"coordinate": [
770,
1062,
1124,
1183
],
"order": 23,
"polygon_points": [
[
770.0,
1062.0
],
[
1124.0,
1062.0
],
[
1124.0,
1183.0
],
[
770.0,
1183.0
]
]
},
{
"cls_id": 17,
"label": "paragraph_title",
"score": 0.8923302292823792,
"coordinate": [
790,
1198,
1103,
1263
],
"order": 24,
"polygon_points": [
[
790.0,
1198.0
],
[
1103.0,
1198.0
],
[
1103.0,
1263.0
],
[
790.0,
1263.0
]
]
},
{
"cls_id": 22,
"label": "text",
"score": 0.9308297634124756,
"coordinate": [
770,
1278,
1124,
1352
],
"order": 25,
"polygon_points": [
[
770.0,
1278.0
],
[
1124.0,
1278.0
],
[
1124.0,
1352.0
],
[
770.0,
1352.0
]
]
},
{
"cls_id": 22,
"label": "text",
"score": 0.8169817328453064,
"coordinate": [
1154,
774,
1333,
797
],
"order": 26,
"polygon_points": [
[
1154.0,
774.0
],
[
1333.0,
774.0
],
[
1333.0,
797.0
],
[
1154.0,
797.0
]
]
},
{
"cls_id": 22,
"label": "text",
"score": 0.9588348269462585,
"coordinate": [
1151,
798,
1506,
989
],
"order": 27,
"polygon_points": [
[
1151.0,
798.0
],
[
1506.0,
798.0
],
[
1506.0,
989.0
],
[
1151.0,
989.0
]
]
},
{
"cls_id": 22,
"label": "text",
"score": 0.9523345232009888,
"coordinate": [
1152,
991,
1506,
1109
],
"order": 28,
"polygon_points": [
[
1152.0,
991.0
],
[
1506.0,
991.0
],
[
1506.0,
1109.0
],
[
1152.0,
1109.0
]
]
},
{
"cls_id": 22,
"label": "text",
"score": 0.9643592238426208,
"coordinate": [
1152,
1111,
1507,
1352
],
"order": 29,
"polygon_points": [
[
1152.0,
1111.0
],
[
1507.0,
1111.0
],
[
1507.0,
1352.0
],
[
1152.0,
1352.0
]
]
}
]
}
}
\ No newline at end of file
icon.png

61 KB

# 模型唯一标识
modelCode=2009
# 模型名称
modelName=paddleocr-vl-1.5_paddle
# 模型描述
modelDescription=PaddleOCR-VL-1.5 是 PaddleOCR-VL 的下一代先进模型,在 OmniDocBench v1.5 基准上实现了 94.5% 的全新 SOTA(当前最优)准确率。
# 运行过程
processType=推理
# 算法类别
appCategory=OCR
# 框架类型
frameType=paddle
# 加速卡类型
accelerateType=K100AI
from paddleocr import PaddleOCRVL
pipeline = PaddleOCRVL(device='DCU')
# pipeline = PaddleOCRVL(use_doc_orientation_classify=True) # 通过 use_doc_orientation_classify 指定是否使用文档方向分类模型
# pipeline = PaddleOCRVL(use_doc_unwarping=True) # 通过 use_doc_unwarping 指定是否使用文本图像矫正模块
# pipeline = PaddleOCRVL(use_layout_detection=False) # 通过 use_layout_detection 指定是否使用版面区域检测排序模块
output = pipeline.predict("doc/paddleocr_vl_demo.png")
for res in output:
res.print() ## 打印预测的结构化输出
res.save_to_json(save_path="output-jpg") ## 保存当前图像的结构化json结果
res.save_to_markdown(save_path="output-jpg") ## 保存当前图像的markdown格式的结果
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment