Replies: 2 comments 1 reply
-
|
Hi there, Reasonable indeed 🤣. Ok, I didn't realize this is soo bad. I'll do a pass through the docs. As for convert with inline images, it's part of the documented API for |
Beta Was this translation helpful? Give feedback.
1 reply
-
@gsakkis please give me as comprehensive a review as you can You are welcome to raise PRS as well. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
As much as this project is an awesome piece of work, I find it hard to navigate and rely on without a lot of trial and error, mostly due to lacking or (worse) misleading documentation. A few recent examples:
The force_ocr parameter has been deprecated in favor of the new ocr configuration object.This is not true, at least as of the latest 4.2.9 release few hours ago.
force_ocris still required to always call tesseract.html_options(mentioned only in the migration guide) is a dict instead of a dataclass, making typos in keys/values undetectable.HtmlConversionOptionsin _internal_bindings but this doesn't exist at runtime:AttributeError: module 'kreuzberg._internal_bindings' has no attribute 'HtmlConversionOptions'convert_as_inlineoption that seems to do the work, though i'm not sure if it's by design or by accident.Perhaps that's the price of massive AI-generated code but any improvement in the overall documentation & consistency would be amazing!
Beta Was this translation helpful? Give feedback.
All reactions