You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+5-1Lines changed: 5 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -107,14 +107,18 @@ The output does not contain the whole TEI schema. Instead a simple outputTemplat
107
107
108
108
Afterwards you have to adapt the classes under de.sub.goettingen.arendt.ocrmapping for the changed structure.
109
109
110
-
# New for v1.1
110
+
##New for v1.1
111
111
112
112
* Support for XPath expression before processing
113
113
* extracting of main document language
114
114
* each TEI document gets an ID
115
115
* support for foreign element if specified via command line option
116
116
* improved whitespace handling. The OCR output is missing some whitespaces, this version tries the best to fix the whitespaces
117
117
118
+
## New for v1.1.1
119
+
120
+
* changed XML id for Arendt specific directory processing
121
+
118
122
# Further information
119
123
120
124
The documentation of the Finereader XML File can be found [here](https://abbyy.technology/en:features:ocr:xml), a examplanation of the schema is located [here](https://ocrsdk.com/documentation/specifications/xml-scheme-recognized-document/).
0 commit comments