|
| 1 | +English | [中文](README_CN.md) |
| 2 | + |
1 | 3 | # pdf2docx |
2 | 4 |
|
3 | 5 |  |
|
12 | 14 |
|
13 | 15 | ## Features |
14 | 16 |
|
15 | | -- [x] Parse and re-create page layout |
16 | | - - [x] page margin |
17 | | - - [x] section and column (1 or 2 columns only) |
18 | | - - [ ] page header and footer |
19 | | - |
20 | | -- [x] Parse and re-create paragraph |
21 | | - - [ ] OCR text |
22 | | - - [x] text in horizontal/vertical direction: from left to right, from bottom to top |
23 | | - - [x] font style, e.g. font name, size, weight, italic and color |
24 | | - - [x] text format, e.g. highlight, underline, strike-through |
25 | | - - [ ] list style |
26 | | - - [x] external hyper link |
27 | | - - [x] paragraph horizontal alignment (left/right/center/justify) and vertical spacing |
| 17 | +- Parse and re-create page layout |
| 18 | + - page margin |
| 19 | + - section and column (1 or 2 columns only) |
| 20 | + - page header and footer [TODO] |
| 21 | + |
| 22 | +- Parse and re-create paragraph |
| 23 | + - OCR text [TODO] |
| 24 | + - text in horizontal/vertical direction: from left to right, from bottom to top |
| 25 | + - font style, e.g. font name, size, weight, italic and color |
| 26 | + - text format, e.g. highlight, underline, strike-through |
| 27 | + - list style [TODO] |
| 28 | + - external hyper link |
| 29 | + - paragraph horizontal alignment (left/right/center/justify) and vertical spacing |
28 | 30 |
|
29 | | -- [x] Parse and re-create image |
30 | | - - [x] in-line image |
31 | | - - [x] image in Gray/RGB/CMYK mode |
32 | | - - [x] transparent image |
33 | | - - [x] floating image, i.e. picture behind text |
34 | | - |
35 | | -- [x] Parse and re-create table |
36 | | - - [x] border style, e.g. width, color |
37 | | - - [x] shading style, i.e. background color |
38 | | - - [x] merged cells |
39 | | - - [x] vertical direction cell |
40 | | - - [x] table with partly hidden borders |
41 | | - - [x] nested tables |
42 | | - |
43 | | -- [x] Parsing pages with multi-processing |
| 31 | +- Parse and re-create image |
| 32 | + - in-line image |
| 33 | + - image in Gray/RGB/CMYK mode |
| 34 | + - transparent image |
| 35 | + - floating image, i.e. picture behind text |
| 36 | + |
| 37 | +- Parse and re-create table |
| 38 | + - border style, e.g. width, color |
| 39 | + - shading style, i.e. background color |
| 40 | + - merged cells |
| 41 | + - vertical direction cell |
| 42 | + - table with partly hidden borders |
| 43 | + - nested tables |
| 44 | + |
| 45 | +- Parsing pages with multi-processing |
44 | 46 |
|
45 | 47 | *It can also be used as a tool to extract table contents since both table content and format/style is parsed.* |
46 | 48 |
|
|
0 commit comments