fix: convert PIL images to RGB before picture description by aatrey56 · Pull Request #3014 · docling-project/docling

aatrey56 · 2026-02-19T07:47:05Z

Documents frequently contain images in non-RGB modes — PNGs with transparency (RGBA), grayscale scans (L), or palette/indexed color (P). These were being passed directly to _annotate_images without any mode check. Transformers processors and VLM engines require 3-channel RGB input, so any non-RGB image would either crash the pipeline or produce incorrect output silently.

The fix is a single '.convert("RGB")' call in 'PictureDescriptionBaseModel.call', at the point where images are batched before being forwarded to '_annotate_images'. Placing it in the base class means all three subclasses benefit automatically:
'PictureDescriptionVlmModel' (transformers), 'PictureDescriptionVlmEngineModel' (engine abstraction), and 'PictureDescriptionApiModel'.

'Image.convert("RGB")' is safe to call unconditionally — if the image is already RGB it returns a copy unchanged.

Issue resolved by this Pull Request:
Resolves #3000

Checklist

Documentation has been updated, if necessary.
Examples have been added, if necessary.
Tests have been added, if necessary.

Non-RGB image modes (RGBA, L, P) cause failures or incorrect output when passed to transformers processors or VLM engines, which expect 3-channel RGB input. Convert in the base model's __call__ so all subclasses (transformers, engine, API) benefit from a single fix. Closes docling-project#3000 Signed-off-by: aatrey56 <aatrey.sahay@gmail.com>

…rsion fix: convert PIL images to RGB before picture description

github-actions · 2026-02-19T07:47:16Z

✅ DCO Check Passed

Thanks @aatrey56, all your commits are properly signed off. 🎉

mergify · 2026-02-19T07:47:40Z

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert)(?:\(.+\))?(!)?:

dosubot · 2026-02-19T07:49:06Z

Related Documentation

Checked 15 published document(s) in 1 knowledge base(s). No updates required.

^{How did I do? Any feedback?}

codecov · 2026-02-19T16:30:11Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

aatrey56 added 2 commits February 19, 2026 07:10

Merge pull request #1 from aatrey56/fix/picture-description-rgb-conve…

ccd80d6

…rsion fix: convert PIL images to RGB before picture description

PeterStaar-IBM requested review from cau-git and dolfim-ibm February 19, 2026 16:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: convert PIL images to RGB before picture description#3014

fix: convert PIL images to RGB before picture description#3014
aatrey56 wants to merge 2 commits intodocling-project:mainfrom
aatrey56:main

aatrey56 commented Feb 19, 2026

Uh oh!

github-actions bot commented Feb 19, 2026

Uh oh!

mergify bot commented Feb 19, 2026

Uh oh!

dosubot bot commented Feb 19, 2026

Uh oh!

codecov bot commented Feb 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

aatrey56 commented Feb 19, 2026

Checklist

Uh oh!

github-actions bot commented Feb 19, 2026

Uh oh!

mergify bot commented Feb 19, 2026

Merge Protections

🟢 Enforce conventional commit

Uh oh!

dosubot bot commented Feb 19, 2026

Uh oh!

codecov bot commented Feb 19, 2026

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments