Replies: 5 comments 2 replies
-
I am getting this same error. |
Beta Was this translation helpful? Give feedback.
-
docker automatically resolves service names to its ip addresses internally, so you mustn't worry about correct IP adressing. That is, as long as you are using the same docker-compose file (because it creates a seperate unrouted network for each docker-compose you run.) If you are not going to use gotenberg and tika somewhere else, it is easier just pasting the in short: just use service names and be sure to include both services in your paperlesses docker-compose.yml. |
Beta Was this translation helpful? Give feedback.
-
I get the following error when I import an Open Office document:
The error has occurred since I updated to version 1.5.0. Before that, Open Office documents were imported and processed flawlessly. Now it aborts with the above error message. Everything else works fine. There is a small difference in the error message. For me it is a 404 error instead of 400. For the container I use the docker-compose file from the repository. My environment:
I have already removed the container and the images and created them again. How can I proceed further to isolate the error? Translated with Deepl.com |
Beta Was this translation helpful? Give feedback.
-
Since this topic is not entirely about paperless, but tika / gotenberg, maybe you try other configurations for this? I would tag the docker images inside your docker-compose.yml with a fixed version number. Maybe the underlying framework or API calls have been changed and we don't know about it? For Tika and gotenberg, i wouldn't want to use the latest versions, since I'm unaware of when support for both has been added/tested.
But those are only hints, since I don't use any of those containers myself. |
Beta Was this translation helpful? Give feedback.
-
I experienced the same issue with V1.5.0 and can confirm that specifying image V1.27 for Tika and V6 for Gotenberg solves the issue for me. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi community,
for the environment variables in the docker compose file for gotenberg and tika, I am wondering what's the right/best setup for the endpoints: referencing localhost as shown below, entering the dedicated IP of the server, or going with name of the service as shown in the documentation https://paperless-ng.readthedocs.io/en/latest/configuration.html#tika-settings
I am getting sometimes errors like:
`[2021-04-13 13:02:01,497] [ERROR] [paperless.consumer] Error while consuming document ID(181746_2)_Date(Mon, 16 Jan 2017 11_53_04 +0100)_Bruno macantay - EN.DOC: Error while converting document to PDF: 400 Client Error: Bad Request for url: http://gotenberg:3000/convert/office
Traceback (most recent call last):
File "/usr/src/paperless/src/paperless_tika/parsers.py", line 79, in convert_to_pdf
File "/usr/local/lib/python3.7/site-packages/requests/models.py", line 943, in raise_for_status
requests.exceptions.HTTPError: 400 Client Error: Bad Request for url: http://gotenberg:3000/convert/office
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/usr/src/paperless/src/documents/consumer.py", line 248, in try_consume_file
File "/usr/src/paperless/src/paperless_tika/parsers.py", line 65, in parse
File "/usr/src/paperless/src/paperless_tika/parsers.py", line 82, in convert_to_pdf
documents.parsers.ParseError: Error while converting document to PDF: 400 Client Error: Bad Request for url: http://gotenberg:3000/convert/office`
or
[2021-04-13
11:25:02,896] [ERROR] [paperless.consumer] Error while consuming document ID(255_3)_Date(Fri, 10 Aug 2001 20_03_10 +0200)_ABI93A.XLS: Could not parse /usr/src/paperless/src/../consume/ID(255_3)_Date(Fri, 10 Aug 2001 20_03_10 +0200)_ABI93A.XLS with tika server at http://192.168.1.60:9998: HTTPConnectionPool(host='192.168.1.60', port=9998): Max retries exceeded with url: /rmeta/text (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f9c7715ab10>: Failed to establish a new connection: [Errno 111] Connectionrefused'))
and wondering if this is related to how those services are setup?
Can anybody help? Any suggestions to avoid those errors?
Cheers,
Volker
Beta Was this translation helpful? Give feedback.
All reactions