Skip to content

Commit a25317d

Browse files
committed
Lots of added fixtures
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
1 parent 1f6502a commit a25317d

File tree

80 files changed

+39001
-225
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

80 files changed

+39001
-225
lines changed

docling_cvat_tools/cvat_tools/cvat_to_docling.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2987,7 +2987,7 @@ def load_document_pages(
29872987
)
29882988
from docling.datamodel.base_models import InputFormat
29892989
from docling.datamodel.document import InputDocument
2990-
from docling.models.page_preprocessing_model import (
2990+
from docling.models.stages.page_preprocessing.page_preprocessing_model import (
29912991
PagePreprocessingModel,
29922992
PagePreprocessingOptions,
29932993
)

tests/cvat_to_docling/fixtures/002a_table_order/metadata.json

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2,8 +2,8 @@
22
"test_id": "002",
33
"name": "002a_table_order",
44
"description": "Tables should be included in reading order correctly. Footnotes should appear once.",
5-
"observation": "Footnotes are replicated on top and after the table as captions, because the XML annotation uses a to_caption link for the footnote. We should tolerate this error.",
6-
"observation_status": "broken",
5+
"observation": "Footnotes are correctly placed. They are not visible in the HTML serialization.",
6+
"observation_status": "correct",
77
"source": {
88
"image_identifier": "doc_fade275d3b925d67abc61f56f5bb414c1cab6c3e94c5953cb9cfa8a6f7a9853d_page_000001.png"
99
},

tests/cvat_to_docling/fixtures/008_kvp_in_picture_legend/expected.json

Lines changed: 1 addition & 1 deletion
Large diffs are not rendered by default.

tests/cvat_to_docling/fixtures/009_rotation_on_table/expected.json

Lines changed: 1 addition & 1 deletion
Large diffs are not rendered by default.

tests/cvat_to_docling/fixtures/009_rotation_on_table/metadata.json

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2,8 +2,8 @@
22
"test_id": "009",
33
"name": "009_rotation_on_table",
44
"description": "Rotated table should be correctly rebuilt.",
5-
"observation": "The rotated table is garbage.",
6-
"observation_status": "broken",
5+
"observation": "The rotated table is correctly constructed.",
6+
"observation_status": "correct.",
77
"source": {
88
"image_identifier": "doc_a02617dc490954fe59c8382c7e8dc6203bf230c56b156c7778e09b31f1d43f2b_page_000001.png"
99
},

tests/cvat_to_docling/fixtures/011a_paragraph_continuation_with_picture_between/expected.json

Lines changed: 3 additions & 3 deletions
Large diffs are not rendered by default.

tests/cvat_to_docling/fixtures/011a_paragraph_continuation_with_picture_between/metadata.json

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2,8 +2,8 @@
22
"test_id": "011",
33
"name": "011a_paragraph_continuation_with_picture_between",
44
"description": "Paragraph must merge across column wrap, with picture and caption placed afterwards.",
5-
"observation": "The paragraph is cut up, and also all paragraphs before the first figure are concatenated to the list that starts at the top.",
6-
"observation_status": "broken",
5+
"observation": "Correct ordering. Formula transcription is bad from the PDF.",
6+
"observation_status": "correct",
77
"source": {
88
"image_identifier": "doc_f0ac80935cb386d734bee68e317b47050819ab112dadb25fbf78266d3918fbd2_page_000001.png"
99
},

tests/cvat_to_docling/fixtures/011b_paragraph_continuation_with_picture_between/expected.json

Lines changed: 3 additions & 3 deletions
Large diffs are not rendered by default.

tests/cvat_to_docling/fixtures/013_footnote_list_with_children_reading_order/expected.json

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -267,8 +267,8 @@
267267
"coord_origin": "BOTTOMLEFT"
268268
},
269269
"charspan": [
270-
228,
271-
437
270+
0,
271+
209
272272
]
273273
}
274274
],

tests/cvat_to_docling/fixtures/013_footnote_list_with_children_reading_order/metadata.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22
"test_id": "013",
33
"name": "bad_handling_of_footnote_vs_list_and_merges_inside",
44
"description": "The footnote list with sandwiched text must be handled in correct reading order.",
5-
"observation": "Observing out-of-order elements in the footnotes.",
5+
"observation": "Observing out-of-order elements in the footnotes, some elements lost.",
66
"observation_status": "broken",
77
"source": {
88
"image_identifier": "doc_a4b6343889cd5469d7957e1fcfd97be1299a75c865e12779001180c790620801_page_000001.png"

0 commit comments

Comments
 (0)