Skip to content

Commit c6dc466

Browse files
MthwRobinsonqued
andauthored
docs: update capabilities table; fix mistake in para grouping docs (#683)
* docs: update capabilities table with rtf/md/epub tables * fix regex in docs * revert bricks update --------- Co-authored-by: qued <[email protected]>
1 parent 5336891 commit c6dc466

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

README.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -90,17 +90,17 @@ about the library.
9090
| CSV Files (`.csv`) | `partition_csv` | N/A | Yes | None |
9191
| E-mails (`.eml`) | `partition_eml` | N/A | No | Encoding |
9292
| E-mails (`.msg`) | `partition_msg` | N/A | No | Encoding |
93-
| EPubs (`.epub`) | `partition_epub` | N/A | No | Include Page Breaks |
93+
| EPubs (`.epub`) | `partition_epub` | N/A | Yes | Include Page Breaks |
9494
| Excel Documents (`.xlsx`/`.xls`) | `partition_xlsx` | N/A | Yes | None |
9595
| HTML Pages (`.html`) | `partition_html` | N/A | No | Encoding; Include Page Breaks |
9696
| Images (`.png`/`.jpg`) | `partition_image` | `"auto"`, `"hi_res"`, `"ocr_only"` | Yes | Encoding; Include Page Breaks; Infer Table Structure; OCR Languages, Strategy |
97-
| Markdown (`.md`) | `partitin_md` | N/A | No | Include Page Breaks |
97+
| Markdown (`.md`) | `partitin_md` | N/A | Yes | Include Page Breaks |
9898
| Open Office Documents (`.odt`) | `partition_odt` | N/A | Yes | None |
9999
| PDFs (`.pdf`) | `partition_pdf` | `"auto"`, `"fast"`, `"hi_res"`, `"ocr_only"` | Yes | Encoding; Include Page Breaks; Infer Table Structure; OCR Languages, Strategy |
100100
| Plain Text (`.txt`) | `partition_text` | N/A | No | Encoding, Paragraph Grouper |
101101
| Power Points (`.ppt`) | `partition_ppt` | N/A | Yes | Include Page Breaks |
102102
| Power Points (`.pptx`) | `partition_pptx` | N/A | Yes | Include Page Breaks |
103-
| Rich Text Files (`.rtf`) | `partition_rtf` | N/A | No | Include Page Breaks |
103+
| Rich Text Files (`.rtf`) | `partition_rtf` | N/A | Yes | Include Page Breaks |
104104
| Word Documents (`.doc`) | `partition_doc` | N/A | Yes | None |
105105
| Word Documents (`.docx`) | `partition_docx` | N/A | Yes | None |
106106
| XML Documents (`.xml`) | `partition_xml` | N/A | No | Encoding; XML Keep Tags |

0 commit comments

Comments
 (0)