Skip to content

feat: Visualizer tool and command for datasets#186

Merged
nikos-livathinos merged 26 commits intomainfrom
cau/add-external-vis-tool
Dec 9, 2025
Merged

feat: Visualizer tool and command for datasets#186
nikos-livathinos merged 26 commits intomainfrom
cau/add-external-vis-tool

Conversation

@cau-git
Copy link
Member

@cau-git cau-git commented Dec 8, 2025

No description provided.

nikos-livathinos and others added 22 commits December 4, 2025 16:34
Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
…mmy entries in all evaluators.

Extend the CLI to support the --external-predictions-path

Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
…various formats

Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
…th. Add unit test

Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
…d unit test.

Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
…it test.

Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
…unit test

Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
…dd unit test.

Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
…. Add unit test

Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
…dd unit test

Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
…ngOrderEvaluator. Fix main

Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
…oclingDocument from doctags and

the GT image.
- Introduce the staticmethod load_doctags() which covers all cases on page image loading.
- Refactor the FilePredictionProvider to use the load_doctags() from ExternalDoclingDocumentLoader.
- Refactor all evaluators to use the new ExternalDoclingDocumentLoader.

Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
…sing the API and the CLI.

Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
@github-actions
Copy link
Contributor

github-actions bot commented Dec 8, 2025

DCO Check Passed

Thanks @cau-git, all your commits are properly signed off. 🎉

@cau-git cau-git changed the base branch from main to nli/external_predictions December 8, 2025 14:43
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
@cau-git cau-git force-pushed the cau/add-external-vis-tool branch from 3794359 to 71f5e17 Compare December 8, 2025 14:45
@cau-git cau-git marked this pull request as ready for review December 8, 2025 14:46
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR adds a new standalone visualization tool for dataset predictions, allowing users to generate HTML visualizations without creating full evaluation datasets. The tool supports both embedded predictions (in dataset parquet files) and external predictions (from separate DoclingDocument files).

Key Changes:

  • Added PredictionsVisualizer utility class for generating GT vs. prediction HTML visualizations
  • Added CLI command create_viz for invoking the visualizer from the command line
  • Added tests for both embedded and external prediction visualization modes

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 8 comments.

File Description
docling_eval/utils/external_predictions_visualizer.py New visualizer class that renders paired ground-truth vs. prediction HTML outputs from datasets
docling_eval/cli/main.py Added create_viz CLI command and unrelated OCR configuration changes
tests/test_predictions_visualizer.py Integration tests for both embedded and external prediction visualization scenarios

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Base automatically changed from nli/external_predictions to main December 8, 2025 15:51
@mergify
Copy link

mergify bot commented Dec 8, 2025

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

  • title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert)(?:\(.+\))?(!)?:

🟢 Require two reviewer for test updates

Wonderful, this rule succeeded.

When test data is updated, we require two reviewers

  • #approved-reviews-by >= 2

cau-git and others added 3 commits December 9, 2025 08:20
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Signed-off-by: Christoph Auer <60343111+cau-git@users.noreply.github.com>
…ternal predictions

Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
@nikos-livathinos nikos-livathinos merged commit 373f959 into main Dec 9, 2025
10 checks passed
@nikos-livathinos nikos-livathinos deleted the cau/add-external-vis-tool branch December 9, 2025 13:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants