Skip to content

Releases: huridocs/pdf-document-layout-analysis

v0.0.32

16 Oct 14:18

Choose a tag to compare

v0.0.31

10 Oct 11:41

Choose a tag to compare

What's Changed

Full Changelog: v0.0.30...v0.0.31

v0.0.30

01 Oct 10:08

Choose a tag to compare

Upgrade pdf-features version to fix common text height issue if there is no text & list type

v0.0.29

25 Sep 10:01

Choose a tag to compare

Upgrade pdf-features version to fix hyperlink styled content markdown

v0.0.28

18 Aug 14:35

Choose a tag to compare

Full Changelog: v0.0.27...v0.0.28

v0.0.27

18 Aug 13:37

Choose a tag to compare

Full Changelog: v0.0.26...v0.0.27

v0.0.26

18 Aug 09:32

Choose a tag to compare

Full Changelog: v0.0.25...v0.0.26

v0.0.25

18 Aug 09:26

Choose a tag to compare

Full Changelog: v0.0.24...v0.0.25

v0.0.24

11 Aug 11:13

Choose a tag to compare

What's Changed

Support for PDF-to-markdown and PDF-to-HTML:

  • Different sizes of titles

  • Superscripts/Subscripts

  • Bold/Italic text

  • Tables in HTML format

  • Formulas in LaTeX format

  • List items with different indentations

  • Hyperlinks

  • In-document references

  • Pictures

  • Table of contents information (optional with extract_toc parameter)

  • Restructured & refactored all the project to clean architecture.

  • Updated formula extraction model to a better one

  • Updated table extraction model to a better & much faster one

New Contributors

Full Changelog: v0.0.23...v0.0.24

v0.0.23

14 Mar 14:00

Choose a tag to compare

Remove PDFs after processing

What's Changed

New Contributors

Full Changelog: v0.0.22...v0.0.23