Migrate from PyPDF2 to pypdf and remove obsolete mobi_to_json test by PRAZPC · Pull Request #88 · Py-Contributors/AudioBook

PRAZPC · 2025-12-08T17:29:03Z

Pull Request Template

What have you Changed

Migrated the project from PyPDF2 to pypdf, replacing deprecated imports and updating PDF-handling code accordingly.
Updated the code to use the modern PdfReader API.
Removed/cleaned up the obsolete mobi_to_json test case, which remained in the test suite even though the function was removed from the codebase.
Verified that all tests pass successfully after the migration and cleanup.

Issue no.(must) - #87

Self Check(Tick After Making pull Request)

One Change in one Pull Request
I am following clean code and Documentation and my code is well linted with flake8.

Join Us on Discord:- https://discord.gg/JfbK3bS

…st_pdf_to_json_pypdf

codeperfectplus

Thanks for improving the audiobook.

Copilot

Pull request overview

This PR modernizes the PDF handling library by migrating from the deprecated PyPDF2 to its actively maintained successor pypdf. The migration updates the dependency, refactors all PDF-related code to use the new API, and cleans up an obsolete test case for removed mobi functionality.

Key changes:

Updated dependency from PyPDF2 3.0.1 to pypdf 4.0.1 with corresponding API migrations (PdfFileReader → PdfReader, method name updates)
Renamed PyPDF2DocParser class to PyPDFDocParser to reflect the new library name
Removed obsolete mobi_to_json test case that referenced a function no longer in the codebase

Reviewed changes

Copilot reviewed 5 out of 6 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
requirements.txt	Updated PDF library dependency from PyPDF2 3.0.1 to pypdf 4.0.1
audiobook/doc_parser/pdf_parser.py	Migrated to pypdf API: updated imports, class name, and all method calls (PdfFileReader→PdfReader, numPages→len(pages), getPage→pages[], extractText→extract_text, getOutlines→outline)
audiobook/utils.py	Updated import statement to use PyPDFDocParser instead of PyPDF2DocParser
audiobook/main.py	Updated logger name from "PyPDF2" to "pypdf" to align with new library
tests/test_create_json_book.py	Renamed test from test_pdf_to_json_pypdf2 to test_pdf_to_json_pypdf; commented out obsolete mobi_to_json test
docs/command_line_usage.rst	Updated documentation to reference pypdf instead of pypdf2 in extraction engine table

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

docs/command_line_usage.rst

Copilot · 2025-12-09T04:12:40Z

tests/test_create_json_book.py

    # def test_docs_to_json(self):
    #     self.assertEqual(ab.create_json_book("assets/sample.doc"), (output['docs'], {'book_name': 'sample', 'pages': 1}))


This comment appears to contain commented-out code.

Suggested change

# def test_docs_to_json(self):

# self.assertEqual(ab.create_json_book("assets/sample.doc"), (output['docs'], {'book_name': 'sample', 'pages': 1}))

@unittest.skip("DOC to JSON test is currently disabled (e.g., due to missing support or failing test).")

def test_docs_to_json(self):

self.assertEqual(ab.create_json_book("assets/sample.doc"), (output['docs'], {'book_name': 'sample', 'pages': 1}))

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: Deepak Raj <54245038+codeperfectplus@users.noreply.github.com>

PRAZPC added 2 commits December 8, 2025 22:37

migrate from deprecated pypdf2 to pypdf

aeeac40

commented test_mobi_to_json and renamed test_pdf_to_json_pypdf2 to te…

e1cc4cb

…st_pdf_to_json_pypdf

codeperfectplus linked an issue Dec 9, 2025 that may be closed by this pull request

Migrate from PyPDF2 to pypdf and clean up obsolete mobi_to_json test #87

Closed

codeperfectplus approved these changes Dec 9, 2025

View reviewed changes

codeperfectplus requested a review from Copilot December 9, 2025 04:08

Copilot started reviewing on behalf of codeperfectplus December 9, 2025 04:08 View session

codeperfectplus added enhancement New feature or request python Pull requests that update python code labels Dec 9, 2025

Copilot AI reviewed Dec 9, 2025

View reviewed changes

Update docs/command_line_usage.rst

e938d54

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: Deepak Raj <54245038+codeperfectplus@users.noreply.github.com>

codeperfectplus merged commit 6a955ed into Py-Contributors:dev Dec 9, 2025
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Migrate from PyPDF2 to pypdf and remove obsolete mobi_to_json test#88

Migrate from PyPDF2 to pypdf and remove obsolete mobi_to_json test#88
codeperfectplus merged 3 commits intoPy-Contributors:devfrom
PRAZPC:dev

PRAZPC commented Dec 8, 2025

Uh oh!

codeperfectplus left a comment

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI Dec 9, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		# def test_docs_to_json(self):
		# self.assertEqual(ab.create_json_book("assets/sample.doc"), (output['docs'], {'book_name': 'sample', 'pages': 1}))

Uh oh!

Conversation

PRAZPC commented Dec 8, 2025

Pull Request Template

Issue no.(must) - #87

Self Check(Tick After Making pull Request)

Uh oh!

codeperfectplus left a comment

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Copilot AI Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants