VlmTableOCR: got multiple values for keyword argument 'resized_shape' error

Error:

`Error recognizing table: Vlm(model_name=openai/gpt-40-mini) got multiple values for keyword argument 'resized_shape'`

<img width="955" height="194" alt="Image" src="https://github.com/user-attachments/assets/1b535d4a-a15c-4ea5-a972-2e928c1814b1" />

Code example:

```python
p2t = Pix2Text(table_ocr=vlm_table_ocr)

total_config = {
    'layout': None,
    "table": {
        "model_type": "VlmTableOCR",  # 指定类名
        "resized_shape": 798,
        "model_name": "openai/gpt-4o-mini",
        "api_key": "key"
    },
}

p2t = Pix2Text.from_config(total_configs=total_config)
result = p2t.recognize_page(img, resized_shape=798)
```

P.S.: seems like non of those resized_shape have affect

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

VlmTableOCR: got multiple values for keyword argument 'resized_shape' error #195

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

VlmTableOCR: got multiple values for keyword argument 'resized_shape' error #195

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions