Releases: PrinsFrank/pdfparser
Releases · PrinsFrank/pdfparser
v2.7.0 Image (LUTS) & text extraction improvements
What's Changed
- Integrate samples from pdf-samples repository into this repo by @PrinsFrank in #266
- Update CONTRIBUTING.md with instructions on adding samples by @PrinsFrank in #267
- Add sample from #255 by @PrinsFrank in #268
- Add samples from previous issues by @PrinsFrank in #269
- Fix colorspace parsing issues by @PrinsFrank in #270
- Add support for LUTs in rasterized images by @PrinsFrank in #274
- Add sample from #235 by @PrinsFrank in #271
- Only fall back to identity decoding when toUnicodeCMap is not set, fixes #254 by @PrinsFrank in #276
Full Changelog: v2.6.3...v2.7.0
v2.6.3 Support for hex chars in name objects, method to retrieve subtype for embedded file & deprecation fixes
What's Changed
- Remove deprecated method usages and add PHPStan package to prevent future use by @PrinsFrank in #263
- Add support for hex characters in name objects by @PrinsFrank in #264
- Add method to retrieve subtype for embedded file by @PrinsFrank in #265
Full Changelog: v2.6.2...v2.6.3
v2.6.2 Associated file can reference object with reference array, support for octal escape sequences in dates
What's Changed
- AF key can contain reference to object with reference array by @PrinsFrank in #261
- Support octal escape sequences in date values by @PrinsFrank in #262
Full Changelog: v2.6.1...v2.6.2
v2.6.1 Dictionary array parsing fix
What's Changed
- Fix bug where dictionary arrays are parsed as reference arrays when its number of components is divisible by 3 by @PrinsFrank in #260
Full Changelog: v2.6.0...v2.6.1
v2.6.0 Support for file specifications, several performance improvements
What's Changed
- Upgrade samples library to new version and move sample tests to their own folder by @PrinsFrank in #248
- Cache font widths on Font property as it will be called for every text element by @PrinsFrank in #250
- Decrease memory footprint contentStreamParser by not appending string in memory and reading from stream directly by @PrinsFrank in #251
- Fix font widths caching by setting property to false when no widths are available by @PrinsFrank in #252
- Search backwards from known start of next object positions for endobj/endstream markers instead of start current object by @PrinsFrank in #253
- AF dictionary key can be referenceValueArray by @PrinsFrank in #257
- Clean up old development documentation that is now outdated by @PrinsFrank in #258
- Add support for file specifications by @PrinsFrank in #259
Full Changelog: v2.5.1...v2.6.0
v2.5.1 Fix Backwards incompatible signature change of getContent introduced in previous release
What's Changed
- Revert Backwards incompatible signature change on getContent by @PrinsFrank in #247
Full Changelog: v2.5.0...v2.5.1
v2.5.0 Decreased memory footprint by up to 95%, Filter reference support
What's Changed
- Support references for filters, fixes #237 by @PrinsFrank in #241
- Bump actions/checkout from 4 to 5 by @dependabot[bot] in #242
- Fix type issues for sprintf that were detected with new PHPStan version by @PrinsFrank in #244
- GetContents returns stream to decrease memory footprint by @PrinsFrank in #246
- Decrease memory footprint of content stream parsing by passing objects instead of the entire content as string to contentStreamParser by @PrinsFrank in #245
Full Changelog: v2.4.2...v2.5.0
v2.4.2 Fix regression in encoding/toUnicodeCMap ordering resulting in encoding misses
What's Changed
- Fix regression in ordering of decoding logic by @PrinsFrank in #236
Full Changelog: v2.4.1...v2.4.2
v2.4.1 Several bugfixes related to dictionary entries
What's Changed
- Empty arrays are currently properly supported, but empty reference value arrays result in errors by @PrinsFrank in #231
- Parse invalid TabsNameValues as TextString as fallback option, fixes #228 by @PrinsFrank in #232
- Remove test case for float conversion above PHP_INT_MAX as it's already covered by two other float tests by @PrinsFrank in #233
- Normalize name values to add support for hex encoded characters in name objects (7.3.5) by @PrinsFrank in #234
Full Changelog: v2.4.0...v2.4.1
v2.4.0 More text output support and several bugfixes
What's Changed
- Unlink temp decoding file by @PrinsFrank in #207
- Use printf in exception message by @szepeviktor in #206
- ci.yml: added PHP 8.5 as a test target by @k00ni in #209
- Validate passwords for encrypted documents by @PrinsFrank in #208
- Implement decryption when file encryption key is valid by @PrinsFrank in #210
- Handle new valueErrors from dateTime::createFromFormat by @PrinsFrank in #218
- Allow font operand to contain plus symbol by @PrinsFrank in #217
- Remove default assignee for bugs to allow for other people to pick up new issues by @PrinsFrank in #219
- Add support for font /Differences array by @PrinsFrank in #222
- Handle glyphs in non octal notation for Differences array by @PrinsFrank in #223
- Handle newlines and duplicate whitespace in text positioning operands by @PrinsFrank in #224
- Handle extra whitespace in compressedObjectByteOffsets by @PrinsFrank in #225
- Temporarily disable encryption support as it is not feature complete yet by @PrinsFrank in #226
- Support datetime values without timezone by @whiskeyfur in #227
New Contributors
- @k00ni made their first contribution in #209
- @whiskeyfur made their first contribution in #227
Full Changelog: v2.3.1...v2.4.0