Skip to content
This repository was archived by the owner on Jun 14, 2018. It is now read-only.

Commit 22e501a

Browse files
committed
0.5
Signed-off-by: Jerome Flesch <[email protected]>
1 parent 30182d1 commit 22e501a

File tree

4 files changed

+15
-5
lines changed

4 files changed

+15
-5
lines changed

ChangeLog

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,7 @@
1+
14/12/2017 - 0.5:
2+
- Tesseract/Libtesseract + LineBoxBuilder: Add confidence scores to
3+
every word boxes and to hOCR files
4+
15
13/05/2017 - 0.4.7:
26
- Tesseract 4.00.00alpha:
37
- Version parsing: Ignore suffix (so '4.00.00alpha' == (4, 0, 0))

README.markdown

Lines changed: 7 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -107,8 +107,13 @@ line_and_word_boxes = tool.image_to_string(
107107
# line.content is the whole text of the line
108108
# line.position is the position of the whole line on the page (in pixels)
109109
#
110-
# Beware that some OCR tools (Tesseract for instance)
111-
# may return empty boxes
110+
# Each word box object has an attribute 'confidence' giving the confidence
111+
# score provided by the OCR tool. Confidence score depends entirely on
112+
# the OCR tool. Only supported with Tesseract and Libtesseract (always 0
113+
# with Cuneiform).
114+
#
115+
# Beware that some OCR tools (Tesseract for instance) may return boxes
116+
# with an empty content.
112117

113118
# Digits - Only Tesseract (not 'libtesseract' yet !)
114119
digits = tool.image_to_string(

setup.py

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -11,12 +11,13 @@
1111
# - ChangeLog
1212
# - push
1313
# - tag
14-
version="0.4.7",
14+
# - python3 ./setup.py sdist upload
15+
version="0.5",
1516
description=("A Python wrapper for OCR engines (Tesseract, Cuneiform,"
1617
" etc)"),
1718
keywords="tesseract cuneiform ocr",
1819
url="https://github.com/openpaperwork/pyocr",
19-
download_url="https://github.com/openpaperwork/pyocr/archive/0.4.7.zip",
20+
download_url="https://github.com/openpaperwork/pyocr/archive/0.5.zip",
2021
classifiers=[
2122
"Development Status :: 5 - Production/Stable",
2223
"Intended Audience :: Developers",

src/pyocr/pyocr.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -62,7 +62,7 @@
6262
cuneiform,
6363
]
6464

65-
VERSION = (0, 4, 7)
65+
VERSION = (0, 5, 0)
6666

6767

6868
def get_available_tools():

0 commit comments

Comments
 (0)