Skip to content

Releases: Unstructured-IO/unstructured-inference

0.4.0

14 Apr 21:30
4e7464f

Choose a tag to compare

0.4.0

  • Added logic to partition granular elements (words, characters) by proximity
  • Text extraction is now delegated to text regions rather than being handled centrally
  • Fixed embedded image coordinates being interpreted differently than embedded text coordinates
  • Update to how dependencies are being handled
  • Update detectron2 version

0.3.2

31 Mar 22:48
148b268

Choose a tag to compare

0.3.2

  • Allow extracting tables from higher level functions

0.3.1

31 Mar 16:12
dd32ab1

Choose a tag to compare

0.3.1

  • Pin protobuf version to avoid errors
  • Make paddleocr an extra again

0.3.0

29 Mar 23:57
db173d0

Choose a tag to compare

0.3.0

  • Fix for text block detection
  • Add paddleocr dependency to setup for x86_64 machines

0.2.14

29 Mar 19:28
43887e6

Choose a tag to compare

0.2.14

  • Suppressed processing progress bars

0.2.13

29 Mar 17:57
a501616

Choose a tag to compare

0.2.13

  • Add table processing
  • Change OCR logic to be aware of PDF image elements

0.2.11

10 Mar 17:16
4814a72

Choose a tag to compare

0.2.11

  • Fixed some cases where image elements were not being OCR'd

0.2.10

07 Mar 21:58
237d69d

Choose a tag to compare

0.2.10

  • Removed control characters from tesseract output

0.2.8

25 Feb 01:46
f2ad4ae

Choose a tag to compare

0.2.8

  • Refactored YoloX inference code to integrate better with framework
  • Improved testing time

0.2.7

10 Feb 03:30
7169127

Choose a tag to compare

0.2.7

  • Fixed duplicated load_pdf call