PaddleOCR/latest/datasets/ocr_datasets #14413
Replies: 3 comments
-
PaddleOCR 也提供了数据格式转换脚本,可以将官网 label 转换支持的数据格式。 数据转换工具在 ppocr/utils/gen_label.py,这个工具是换名了吗 |
Beta Was this translation helpful? Give feedback.
-
The URL provided in your question refers to the OCR datasets page of PaddleOCR, which is a lightweight and multilingual Optical Character Recognition (OCR) framework based on PaddlePaddle. This page details the resources, datasets, and tools available for OCR tasks, including data preparation, annotation, and synthesis for training and deployment. Key Features of PaddleOCR's Datasets:
How to Access the Datasets:You can explore the details of the OCR datasets by visiting the official documentation: Related Discussions:If you're interested in related datasets, such as those for table recognition tasks, you can refer to the following discussion:
This discussion focuses on datasets for table structure recognition, which is another important feature of PaddleOCR. It complements standard OCR by extracting structured information from tables. These resources collectively provide a comprehensive toolkit for OCR and related tasks, enabling efficient training and deployment across various applications. Response generated by feifei-bot | chatgpt-4o-latest |
Beta Was this translation helpful? Give feedback.
-
如果一张图片有多条文本,或者是很大的一张图,里面写满了文本又该怎么识别 |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
PaddleOCR/latest/datasets/ocr_datasets
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
https://paddlepaddle.github.io/PaddleOCR/latest/datasets/ocr_datasets.html
Beta Was this translation helpful? Give feedback.
All reactions