Skip to content

Commit 43c4f7f

Browse files
committed
add relation to the original
1 parent ff4aee3 commit 43c4f7f

File tree

3 files changed

+27
-18
lines changed

3 files changed

+27
-18
lines changed

DESCRIPTION

Lines changed: 8 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -2,14 +2,17 @@ Package: cpp11tesseract
22
Type: Package
33
Title: Open Source OCR Engine
44
Version: 5.3.4
5-
Authors@R: c(person("Jeroen", "Ooms",
6-
role = c("aut"),
7-
email = "[email protected]",
8-
comment = c(ORCID = "0000-0002-4035-0289")),
9-
person("Mauricio", "Vargas Sepulveda",
5+
Authors@R: c(person("Mauricio", "Vargas Sepulveda",
106
role = c("aut", "cre"),
117
email = "[email protected]",
128
comment = c(ORCID = "0000-0003-1017-7574")),
9+
person("Jeroen", "Ooms",
10+
role = c("aut"),
11+
email = "[email protected]",
12+
comment = c("Author of tesseract R package",
13+
ORCID = "0000-0002-4035-0289")),
14+
person("HP", role = "cph", comment = "Author of tesseract"),
15+
person("Google", role = "cph", comment = "Author of tesseract"),
1316
person("Munk School of Global Affairs and Public Policy",
1417
role = "fnd")
1518
)

README.Rmd

Lines changed: 7 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -15,17 +15,19 @@ knitr::opts_chunk$set(
1515

1616
# cpp11tesseract <img src="man/figures/logo.svg" align="right" height="139" alt="" />
1717

18-
> Bindings to [Tesseract-OCR](https://opensource.google/projects/tesseract):
19-
a powerful optical character recognition (OCR) engine that supports over 100 languages.
20-
The engine is highly configurable in order to tune the detection algorithms and
21-
obtain the best possible results.
22-
2318
[![R-CMD-check](https://github.com/pachadotdev/cpp11tesseract/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/pachadotdev/cpp11tesseract/actions/workflows/R-CMD-check.yaml)
2419
[![codecov](https://codecov.io/gh/pachadotdev/cpp11tesseract/graph/badge.svg?token=mWfiUCgfNu)](https://app.codecov.io/gh/pachadotdev/cpp11tesseract)
2520
[![Project Status: Active – The project has reached a stable, usable state and is being actively developed.](https://www.repostatus.org/badges/latest/active.svg)](http://www.repostatus.org/#active)
2621
[![CRAN status](https://www.r-pkg.org/badges/version/cpp11tesseract)](https://CRAN.R-project.org/package=cpp11tesseract)
2722
[![BuyMeACoffee](https://raw.githubusercontent.com/pachadotdev/buymeacoffee-badges/main/bmc-donate-white.svg)](https://buymeacoffee.com/pacha)
2823

24+
`cpp11tesseract` is a fork of `tesseract` that uses `cpp11` for those that
25+
require it for licensing or security purposes. It provides bindings to
26+
[Tesseract-OCR](https://opensource.google/projects/tesseract), a powerful
27+
optical character recognition (OCR) engine that supports over 100 languages. The
28+
engine is highly configurable in order to tune the detection algorithms and
29+
obtain the best possible results.
30+
2931
- Upstream Tesseract-OCR documentation: https://tesseract-ocr.github.io/tessdoc/
3032
- Introduction: https://docs.ropensci.org/tesseract/articles/intro.html
3133
- Reference: https://docs.ropensci.org/tesseract/reference/ocr.html

README.md

Lines changed: 12 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -3,12 +3,6 @@
33

44
# cpp11tesseract <img src="man/figures/logo.svg" align="right" height="139" alt="" />
55

6-
> Bindings to
7-
> [Tesseract-OCR](https://opensource.google/projects/tesseract): a
8-
> powerful optical character recognition (OCR) engine that supports over
9-
> 100 languages. The engine is highly configurable in order to tune the
10-
> detection algorithms and obtain the best possible results.
11-
126
[![R-CMD-check](https://github.com/pachadotdev/cpp11tesseract/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/pachadotdev/cpp11tesseract/actions/workflows/R-CMD-check.yaml)
137
[![codecov](https://codecov.io/gh/pachadotdev/cpp11tesseract/graph/badge.svg?token=mWfiUCgfNu)](https://app.codecov.io/gh/pachadotdev/cpp11tesseract)
148
[![Project Status: Active – The project has reached a stable, usable
@@ -18,6 +12,13 @@ developed.](https://www.repostatus.org/badges/latest/active.svg)](http://www.rep
1812
status](https://www.r-pkg.org/badges/version/cpp11tesseract)](https://CRAN.R-project.org/package=cpp11tesseract)
1913
[![BuyMeACoffee](https://raw.githubusercontent.com/pachadotdev/buymeacoffee-badges/main/bmc-donate-white.svg)](https://buymeacoffee.com/pacha)
2014

15+
`cpp11tesseract` is a fork of `tesseract` that uses `cpp11` for those
16+
that require it for licensing or security purposes. It provides bindings
17+
to [Tesseract-OCR](https://opensource.google/projects/tesseract), a
18+
powerful optical character recognition (OCR) engine that supports over
19+
100 languages. The engine is highly configurable in order to tune the
20+
detection algorithms and obtain the best possible results.
21+
2122
- Upstream Tesseract-OCR documentation:
2223
<https://tesseract-ocr.github.io/tessdoc/>
2324
- Introduction:
@@ -46,11 +47,14 @@ package, which I need to comply with the Munk School IT standards. Using
4647
`cpp11` allows me to vendor the C++ headers into the package, and then I
4748
can conduct an offline installation in the Niagara Cluster.
4849

49-
The documentation changes a bit. I tried to expand the documentation and
50+
The documentation changed a bit. I tried to expand the documentation and
5051
compare with Amazon Textract output.
5152

5253
This package includes some changes requested by CRAN, and these are
53-
mostly about the package internals.
54+
mostly about the package internals. For example, this version lists the
55+
dependencies to install in Linux and Mac, that you can install using
56+
apt/yum/brew, while the original package uses autobrew to install the
57+
Mac dependencies as binaries.
5458

5559
## Installation
5660

0 commit comments

Comments
 (0)