Skip to content

kimlee87/ecpo_ocrd

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

OCR-D Pipeline for ECPO

Work in Progress

Prerequisites

  • Python ==3.10 (others might work, but this is what we tested. Very recent versions do not work.)

Installation

git clone https://github.com/ssciwr/ecpo_ocrd.git
cd ecpo_ocrd
python -m pip install .
ecpo_ocrd install

Usage

Before doing anything, add the bin folder to PATH:

export PATH=$PWD/bin:$PATH

Then, create an OCR-D workspace for your data via:

mkdir myworkspace
cd myworkspace
ecpo_ocrd workspace create

For details, please have a look at ecpo_ocrd workspace create --help. For testing purposed, you should always add --sample n with a reasonably small n (otherwise you will process the entire subcorpus for the selected data source).

Then, you can run any of the provided workflow shell scripts e.g.

../workflows/jingbao.sh

About

An OCR-D Pipeline for ECPO

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 89.9%
  • Shell 10.1%