Skip to content

Conversation

@pprados
Copy link
Contributor

@pprados pprados commented Mar 26, 2025

The image parsing is generating a bug as the the extracted objects for the /Filter returns sometimes an array, sometimes a string.

Fix Issue 30098

@vercel
Copy link

vercel bot commented Mar 26, 2025

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Skipped Deployment
Name Status Preview Comments Updated (UTC)
langchain ⬜️ Ignored (Inspect) Visit Preview Mar 26, 2025 7:45am

@pprados pprados marked this pull request as ready for review March 26, 2025 08:03
@dosubot dosubot bot added size:S bug Related to a bug, vulnerability, unexpected error with an existing feature labels Mar 26, 2025
@pprados
Copy link
Contributor Author

pprados commented Mar 26, 2025

@eyurtsev
This PR is ready for review. can you also validate this one?

@dosubot dosubot bot added the lgtm label Mar 26, 2025
@eyurtsev eyurtsev changed the title [community] Fix Issue-30098 community[patch]: update PyPDFParser to take into account filters returned as arrays Mar 26, 2025
@eyurtsev
Copy link
Collaborator

Is there any unit test that could be included?

@eyurtsev eyurtsev merged commit 8e5d2a4 into langchain-ai:master Mar 26, 2025
19 checks passed
@pprados
Copy link
Contributor Author

pprados commented Mar 27, 2025

No. I couldn't find an example file that caused the problem.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug Related to a bug, vulnerability, unexpected error with an existing feature

Projects

None yet

Development

Successfully merging this pull request may close these issues.

PyPDFParser does not take into account filters returned as arrays.

2 participants