Skip to content

0.5.31

Choose a tag to compare

@cragwolfe cragwolfe released this 21 Sep 05:10
· 130 commits to main since this release
b9f032c

0.5.31

  • Add functionality to extract and save images from the page
  • Add functionality to get only "true" embedded images when extracting elements from PDF pages
  • Update the layout visualization script to be able to show only image elements if need
  • add an evaluation metric for table comparison based on token similarity
  • fix paddle unit tests where make test fails since paddle doesn't work on M1/M2 chip locally