You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+14-3Lines changed: 14 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -79,7 +79,7 @@ In addition to providing an outstanding model library, PaddleOCR 3.0 also offers
79
79
- The high-stability service-oriented deployment solution also supports invocation via manually constructed HTTP requests, enabling client-side code development in any programming language.
80
80
81
81
-**Benchmark Support:**
82
-
-**All production lines now support fine-grained benchmarking, enabling measurement of end-to-end inference time as well as per-layer and per-module latency data to assist with performance analysis.**
82
+
-**All production lines now support fine-grained benchmarking, enabling measurement of end-to-end inference time as well as per-layer and per-module latency data to assist with performance analysis.[Here's](docs/version3.x/pipeline_usage/instructions/benchmark.en.md) how to set up and use the benchmark feature.**
83
83
-**Documentation has been updated to include key metrics for commonly used configurations on mainstream hardware, such as inference latency and memory usage, providing deployment references for users.**
84
84
85
85
-**Bug Fixes:**
@@ -213,10 +213,21 @@ In addition to providing an outstanding model library, PaddleOCR 3.0 also offers
213
213
Install PaddlePaddle refer to [Installation Guide](https://www.paddlepaddle.org.cn/en/install/quick?docurl=/documentation/docs/en/develop/install/pip/linux-pip_en.html), after then, install the PaddleOCR toolkit.
214
214
215
215
```bash
216
-
# Install paddleocr
217
-
pip install paddleocr
216
+
# If you only want to use the basic text recognition feature (returns text position coordinates and content), including the PP-OCR series
217
+
python -m pip install paddleocr
218
+
# If you want to use all features such as document parsing, document understanding, document translation, key information extraction, etc.
219
+
# python -m pip install "paddleocr[all]"
218
220
```
219
221
222
+
Starting from version 3.2.0, in addition to the `all` dependency group demonstrated above, PaddleOCR also supports installing partial optional features by specifying other dependency groups. All dependency groups provided by PaddleOCR are as follows:
223
+
224
+
| Dependency Group Name | Corresponding Functionality |
225
+
| - | - |
226
+
|`doc-parser`| Document parsing: can be used to extract layout elements such as tables, formulas, stamps, images, etc. from documents; includes models like PP-StructureV3 |
227
+
|`ie`| Information extraction: can be used to extract key information from documents, such as names, dates, addresses, amounts, etc.; includes models like PP-ChatOCRv4 |
228
+
|`trans`| Document translation: can be used to translate documents from one language to another; includes models like PP-DocTranslation |
In addition to the `all` dependency group demonstrated above, PaddleOCR also supports installing specific optional features by specifying other dependency groups. The available dependency groups provided by PaddleOCR are as follows:
122
122
123
-
| Dependency Group | Functionality |
124
-
| ----------------|------------------------ |
125
-
|`doc-parser`| Document parsing, which can be used to extract layout elements in a document such as tables, formulas, stamps, and images.|
126
-
|`ie`| Information extraction, which can be used to extract key information from documents, such as names, dates, addresses, amounts, and more.|
127
-
|`trans`| Document translation, which can be used to translate a document from one language to another.|
128
-
|`all`| Full functionality.|
123
+
| Dependency Group Name | Corresponding Functionality|
124
+
| - | - |
125
+
|`doc-parser`| Document parsing: can be used to extract layout elements such as tables, formulas, stamps, images, etc. from documents; includes models like PP-StructureV3|
126
+
|`ie`| Information extraction: can be used to extract key information from documents, such as names, dates, addresses, amounts, etc.; includes models like PP-ChatOCRv4|
127
+
|`trans`| Document translation: can be used to translate documents from one language to another; includes models like PP-DocTranslation|
128
+
|`all`| Complete functionality |
129
129
130
130
The general OCR pipeline (e.g., PP-OCRv3/v4/v5) and the document image preprocessing pipeline can be used without installing any additional dependency groups. Apart from these two pipelines, each remaining pipeline belongs to one and only one dependency group. You can refer to the usage documentation of each pipeline to determine which group it belongs to. For individual functional modules, installing any dependency group that includes the module will enable access to its core functionality.
0 commit comments