added markdown document for ocr engine comparison #577

Kaan0029 · 2025-07-17T17:32:54Z

This is related to gsoc ocr project by Kaan Erdem.
JabRef/jabref#13313

koppor · 2025-07-17T20:50:02Z

en/advanced/ocr-engine-selection.md

+* **Bad**, because increases support complexity with multiple engines
+
+### Confirmation
+


Elaborate on how this is done. I would assume that you have the 100+ PDFs at hand and wrote a test suite?

No, i wrote this in advance assuming that I will have that many tested later on, but I deleted that section now. Looking at the level of detail and sophistication of the other markdowns (very little) I decided it's not needed.

An ADR can also have TODOs and links to existing drafts of the test suite.

koppor · 2025-07-17T20:51:40Z

en/advanced/ocr-engine-selection.md

+
+* Current implementation uses Tesseract 4.x with LSTM engine
+* In benchmarks, Google Cloud Vision shows the highest overall accuracy
+* Handwriting (categories 2 & 3) is the main differentiator among engines


Where are these catorgies mentioned?

Same as above, deleted this section

koppor · 2025-07-17T20:51:49Z

en/advanced/ocr-engine-selection.md

+
+The web resources that informed this ADR:
+
+1. <https://www.mdpi.com/2073-8994/12/5/715>


Link that to each pro/con agrument

Did that.
see JabRef/jabref#13573

koppor · 2025-07-17T20:52:57Z

en/advanced/ocr-engine-selection.md

@@ -0,0 +1,153 @@
+# ADR-002: OCR Engine Selection for JabRef


Try to follow the format given at JabRef's repo - and place it in the JabRef folder. https://github.com/JabRef/jabref/tree/main/docs/decisions

I think, this is AI generated, because I cannot explain otherwise why A) this takes number 0002 - and in the heading.

(And does not follow the MADR format)

I adjusted the format a little bit, but it was already very similar to the other md files in the folder. I restructured the heading a little bit to make it even more similar.

See the new PR here: JabRef/jabref#13573

calixtus · 2025-07-21T19:49:16Z

should go to devdocs: jabref/docs/decisions

…abRef/user-documentation#577

koppor · 2025-07-23T10:32:44Z

Follow-up PR is JabRef/jabref#13573

Therefore, I close this one.

added markdown document for ocr engine comparison

5831dfc

koppor reviewed Jul 17, 2025

View reviewed changes

koppor requested changes Jul 17, 2025

View reviewed changes

Kaan0029 added a commit to Kaan0029/jabref that referenced this pull request Jul 22, 2025

Add ADR 0047 for OCR engine selection and fix requested changes from J…

ea5a243

…abRef/user-documentation#577

Kaan0029 added a commit to Kaan0029/jabref that referenced this pull request Jul 22, 2025

Add ADR 0047 for OCR engine selection and fix requested changes from J…

122d522

…abRef/user-documentation#577

koppor closed this Jul 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

added markdown document for ocr engine comparison #577

added markdown document for ocr engine comparison #577

Uh oh!

Kaan0029 commented Jul 17, 2025

Uh oh!

koppor Jul 17, 2025

Uh oh!

Kaan0029 Jul 22, 2025

Uh oh!

koppor Jul 23, 2025

Uh oh!

koppor Jul 17, 2025

Uh oh!

Kaan0029 Jul 22, 2025

Uh oh!

koppor Jul 17, 2025

Uh oh!

Kaan0029 Jul 22, 2025

Uh oh!

koppor Jul 17, 2025

Uh oh!

Kaan0029 Jul 22, 2025

Uh oh!

calixtus commented Jul 21, 2025

Uh oh!

koppor commented Jul 23, 2025

Uh oh!

Uh oh!

		* Bad, because increases support complexity with multiple engines

		### Confirmation


		The web resources that informed this ADR:

		1. <https://www.mdpi.com/2073-8994/12/5/715>

added markdown document for ocr engine comparison #577

added markdown document for ocr engine comparison #577

Uh oh!

Conversation

Kaan0029 commented Jul 17, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

calixtus commented Jul 21, 2025

Uh oh!

koppor commented Jul 23, 2025

Uh oh!

Uh oh!