I want to extract drawings as images #3508
-
I am building a solution where when provided with PDF it can parse it and make it editable I have extract text and images along with there bounding box's What I want to achieve is, to actually extract drawings / shapes / graphics which is left on pdf and save them as png images One of the approaches I tried is to use Help required |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
If your vector graphics are made of drawing paths that are close enough to each other, the Its parameters allow you some flexibility:
|
Beta Was this translation helpful? Give feedback.
If your vector graphics are made of drawing paths that are close enough to each other, the
Page
method cluster_drawings should deliver what you want.Its parameters allow you some flexibility:
get_drawings()
output to build some desired subset, which you can hand into the method. Otherwise, this parameter is also useful for avoiding multiple vector graphic extractions.get_drawings()
are assumed to belong to the same cluster if their minimum distance is below 3.