Replies: 1 comment 4 replies
-
Can you please try with this model? https://huggingface.co/ds4sd/SmolDocling-256M-preview-mlx-bf16 |
Beta Was this translation helpful? Give feedback.
4 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I'm having an issue with docling/smoldocling and I'm hoping someone has some ideas. I'm on an m4max mbp and docling byitself can convert a PDF I'm using but it takes over an hour for a 20MB PDF with very few images. I looked into using smoldocling via LMStudio with MLX, the same model that docling has in their documentation. It pretty immediately fails after LMS loads the model.
Here's the error I'm getting:
INFO:docling.datamodel.document:detected formats: [<InputFormat.PDF: 'pdf'>] INFO:docling.document_converter:Going to convert document batch... INFO:docling.document_converter:Initializing pipeline for VlmPipeline with options hash 6987930c9c009d6c95a83692a1c3f3c0 INFO:docling.pipeline.base_pipeline:Processing document sfdc_object_reference.pdf ERROR:docling.utils.api_image_request:Error calling the API. Response was {"error":"Error in iterating prediction stream: ValueError: Failed to process inputs with error: PreTrainedTokenizerFast._batch_encode_plus() got an unexpected keyword argument 'images'"} ERROR:docling.utils.api_image_request:Error calling the API. Response was {"error":"Error in iterating prediction stream: ValueError: Failed to process inputs with error: PreTrainedTokenizerFast._batch_encode_plus() got an unexpected keyword argument 'images'"} HTTPError: 400 Client Error: Bad Request for url: http://localhost:1234/v1/chat/completions
My code is almost exactly: https://github.com/docling-project/docling/blob/1dc63d0aa9c3763fcd2b9a8da3d9b817bd8666cb/docs/examples/vlm_pipeline_api_model.py
I don't necessarily need any of the images in the PDF. But I'm really not sure what's going on here. The model at https://huggingface.co/ds4sd/SmolDocling-256M-preview-mlx-bf16-docling-snap (from the example) claims to support images.
Beta Was this translation helpful? Give feedback.
All reactions