Skip to content

Commit 9178a2c

Browse files
authored
Merge pull request #77 from RapidAI/add_link_for_doc_crection
chore: add link for doc distortion correction
2 parents 46ad9a1 + 364fa9f commit 9178a2c

File tree

2 files changed

+12
-9
lines changed

2 files changed

+12
-9
lines changed

README.md

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -17,14 +17,14 @@
1717
### 最近更新
1818
- **2024.10.22**
1919
- 补充复杂背景多表格检测提取方案[RapidTableDet](https://github.com/RapidAI/RapidTableDetection)
20-
- **2024.10.29**
21-
- 使用yolo11重新训练表格分类器,修正wired_table_rec v2逻辑坐标还原错误,并更新测评
2220
- **2024.11.12**
2321
- 抽离模型识别和处理过程核心阈值,方便大家进行微调适配自己的场景[微调入参参考](#核心参数)
22+
- **2024.11.16**
23+
- 补充文档扭曲矫正方案,可作为前置处理 [文档扭曲变形修正](https://github.com/Joker1212/RapidUnWrap)
2424

2525
### 简介
2626
💖该仓库是用来对文档中表格做结构化识别的推理库,包括来自阿里读光有线和无线表格识别模型,llaipython(微信)贡献的有线表格模型,网易Qanything内置表格分类模型等。\
27-
[快速开始](#安装) [模型评测](#指标结果) [使用建议](#使用建议) [表格旋转及透视修正](#表格旋转及透视修正) [微调入参参考](#核心参数) [常见问题](#FAQ) [更新计划](#更新计划)
27+
[快速开始](#安装) [模型评测](#指标结果) [使用建议](#使用建议) [文档扭曲变形修正](https://github.com/Joker1212/RapidUnWrap) [表格旋转及透视修正](#表格旋转及透视修正) [微调入参参考](#核心参数) [常见问题](#FAQ) [更新计划](#更新计划)
2828
#### 特点
2929

3030
**** 采用ONNXRuntime作为推理引擎,cpu下单图推理1-7s
@@ -190,6 +190,8 @@ html, elasp, polygons, logic_points, ocr_res = lineless_table_rec(
190190
下载更高精度的ocr模型,在执行时传入ocr_result即可,
191191
- 或者尝试调节rapid_ocr的参数, 根据在线demo调节参数, [modelscope](https://www.modelscope.cn/studios/liekkas/RapidOCRDemo/summary) [huggingface](https://huggingface.co/spaces/SWHL/RapidOCRDemo)
192192
然后在推理时传入即可
193+
2. **问:文档扭曲变形怎么处理?**
194+
- 答:使用 [RapidUnwrap](https://github.com/Joker1212/RapidUnWrap)
193195
3. **问:模型支持 gpu 加速吗?**
194196
- 答:目前表格模型的推理非常快,有线表格在100ms级别,无线表格在500ms级别,
195197
主要耗时在ocr阶段,可以参考 [rapidocr_paddle](https://rapidai.github.io/RapidOCRDocs/install_usage/rapidocr_paddle/usage/#_3)

README_en.md

Lines changed: 7 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -16,16 +16,14 @@
1616
- **2024.10.22**
1717
- Added the complex background multi-table detection and extraction solution [RapidTableDet](https://github.com/RapidAI/RapidTableDetection).
1818

19-
- **2024.10.29**
20-
- Retrained the table classifier using YOLO11 to fix the logic coordinate restoration error in wired_table_rec v2 and updated evaluations.
21-
2219
- **2024.11.12**
2320
- Extracted model recognition and processing core thresholds for easier fine-tuning according to specific scenarios. See [Core Parameters](#core-parameters).
24-
21+
- **2024.11.16**
22+
- Added document distortion correction solution, which can be used as a pre-processing step [Document Distortion Correction](https://github.com/Joker1212/RapidUnWrap)
2523
### Introduction
2624
💖 This repository serves as an inference library for structured recognition of tables within documents, including models for wired and wireless table recognition from Alibaba DulaLight, a wired table model from llaipython (WeChat), and a built-in table classification model from NetEase Qanything.
2725

28-
[Quick Start](#installation) [Model Evaluation](#evaluation-results) [Usage Recommendations](#usage-recommendations) [Table Rotation & Perspective Correction](#table-rotation-and-perspective-correction) [Fine-tuning Input Parameters Reference](#core-parameters) [Frequently Asked Questions](#faqs) [Update Plan](#update-plan)
26+
[Quick Start](#installation) [Model Evaluation](#evaluation-results) [Usage Recommendations](#usage-recommendations) [Document Distortion Correction](https://github.com/Joker1212/RapidUnWrap) [Table Rotation & Perspective Correction](#table-rotation-and-perspective-correction) [Fine-tuning Input Parameters Reference](#core-parameters) [Frequently Asked Questions](#faqs) [Update Plan](#update-plan)
2927
#### Features
3028

3129
**Fast:** Uses ONNXRuntime as the inference engine, achieving 1-7 seconds per image on CPU.
@@ -203,7 +201,10 @@ html, elasp, polygons, logic_points, ocr_res = lineless_table_rec(
203201
### Processing Workflow
204202

205203
```mermaid
206-
A[/Table Image/] --> B([Table Classification table_cls]) B --> C([Wired Table Recognition wired_table_rec]) & D([Wireless Table Recognition lineless_table_rec]) --> E([Text Recognition rapidocr_onnxruntime]) E --> F[/HTML Structured Output/]
204+
flowchart TD
205+
A[/table image/] --> B([table cls table_cls])
206+
B --> C([wired_table_rec]) & D([lineless_table_rec]) --> E([rapidocr_onnxruntime])
207+
E --> F[/html output/]
207208
```
208209

209210
### Acknowledgments

0 commit comments

Comments
 (0)