How to get only cropped pages data #1501
Unanswered
kathimohan
asked this question in
Q&A
Replies: 1 comment 2 replies
-
disccused in #1508 (comment) :
|
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Cropped a page using pypdf but when tried to read text, it is providing all the data instead of only cropped one. How to get only cropped page's data
from PyPDF2 import PdfWriter, PdfReader
from PyPDF2.generic import RectangleObject
import pdfplumber
reader = PdfReader('abc.pdf')
writer = PdfWriter()
page1 = reader.pages[1]
mb = page1.mediabox
page1.cropbox = RectangleObject((41.82533333333333, 771.0659999999999, 552.965, 741.742))
writer.add_page(page1)
#print(page1.extractText())
with open('new.pdf', 'wb') as f:
writer.write(f)
pdf_name = "new.pdf"
with pdfplumber.open(pdf_name) as pdf:
page = pdf.pages[0]
tb=page.extract_text()
print(tb)
reader = PdfReader("new.pdf")
page = reader.pages[0]
print(page.extract_text())
Beta Was this translation helpful? Give feedback.
All reactions