Skip to content

Commit aeab7b0

Browse files
committed
Update README.md
- update with CLI changes RE: parse
1 parent e984a3c commit aeab7b0

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -125,7 +125,7 @@ the tika-server jar and start it if you haven't done so already.
125125
```
126126
tika.py [-v] [-o <outputDir>] [--server <TikaServerEndpoint>] [--install <UrlToTikaServerJar>] [--port <portNumber>] <command> <option> <urlOrPathToFile>
127127
128-
tika.py parse all test.pdf | python -mjson.tool (pretty print Tika JSON output)
128+
tika.py parse all test.pdf test2.pdf (write output JSON metadata files for test1.pdf_meta.json and test2.pdf_meta.json)
129129
tika.py detect type test.pdf (returns mime-type as text/plain)
130130
tika.py language file french.txt (returns language e.g., fr as text/plain)
131131
tika.py translate fr:en french.txt (translates the file french.txt from french to english)
@@ -141,7 +141,7 @@ To check the configuration of the Tika server, use:
141141
tika.py config <mime-types | detectors | parsers>
142142
143143
Commands:
144-
parse = parse the input file and return a JSON doc containing the extracted metadata, text, or both
144+
parse = parse the input file and write a JSON doc file.ext_meta.json containing the extracted metadata, text, or both
145145
detect type = parse the stream and 'detect' the MIME/media type, return in text/plain
146146
language file = parse the file stream and identify the language of the text, return its 2 character code in text/plain
147147
translate src:dest = parse and extract text and then translate the text from source language to destination language

0 commit comments

Comments
 (0)