You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
added a string linking stage to ticcl, this adds extra markup information (t-str/t-correction) using the foliatextcontent tool, this is in turn needed by FLAT for proper visualisation. #62
@@ -557,6 +561,53 @@ if (!params.containsKey('nofoliacorrect')) {
557
561
"""
558
562
}
559
563
564
+
if (!params.containsKey('nostringlinking')) {
565
+
process linkstrings {
566
+
/*
567
+
This invokes a tool that adds text markup information (t-str and t-correction) linking to the substrings. It adds a level of redundancy that is needed for proper visualisation in FLAT.
568
+
*/
569
+
570
+
publishDir params.outputdir, mode: 'copy', overwrite: true//publish the output for the end-user to see (this is the final output)
571
+
572
+
input:
573
+
file foliadoc from foliacorrect_documents
574
+
val virtualenv from params.virtualenv
575
+
576
+
output:
577
+
file "*.ticcl.folia.xml" into folia_ticcl_documents
Copy file name to clipboardExpand all lines: webservice/picclservice/picclservice.py
+1-1Lines changed: 1 addition & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -48,7 +48,7 @@
48
48
#An informative description for this system (this should be fairly short, about one paragraph, and may not contain HTML)
49
49
SYSTEM_DESCRIPTION="PICCL offers a workflow for corpus building and builds on a variety of tools. The primary component of PICCL is TICCL; a Text-induced Corpus Clean-up system, which performs spelling correction and OCR post-correction (normalisation of spelling variants etc)."
50
50
51
-
SYSTEM_VERSION="0.9.4"#also change in codemeta.json and setup.py
51
+
SYSTEM_VERSION="0.9.5"#also change in codemeta.json and setup.py
52
52
53
53
SYSTEM_AUTHOR="Martin Reynaert, Maarten van Gompel, Ko van der Sloot"
description= ("Webservice for PICCL; a set of workflows for corpus building through OCR, post-correction, modernization of historic language and Natural Language Processing"),
0 commit comments