Introduce a num_of_sample/evaluated_samples parameter to the evaluate function in the docling-eval module

I noticed that the **DatasetEvaluation** class includes a variable **evaluated_samples**

```python
class DatasetEvaluation(BaseModel):
    evaluated_samples: int = -1
    rejected_samples: Dict[EvaluationRejectionType, int] = {}
``` 

However, it seems the current evaluator classes only use this parameter to process the entire dataset (test split) in the benchmark. I’m wondering if we could allow an arbitrary value to be passed during the evaluation dataset construction phase. This could help speed up the evaluation process for benchmark like OmniDocBench, which currently takes about an hour to complete on my machine.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Introduce a num_of_sample/evaluated_samples parameter to the evaluate function in the docling-eval module #141

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Introduce a num_of_sample/evaluated_samples parameter to the evaluate function in the docling-eval module #141

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions