Skip to content

RPA Desktop OCR - Use confidence from Tesseract result #1303

@adombeck

Description

@adombeck

Currently, the find function in packages/recognition/src/RPA/recognition/ocr.py ignores the confidence values from the Tesseract result. Instead, a "confidence" value is calculated by calculating the similarity between the text it tries to find and the text returned by Tesseract. That is also useful, but it's different from the confidence returned by Tesseract. Ideally, it should return both the confidence value from Tesseract and the text similarity.

I’m willing to submit a PR to implement this. Before I proceed, could you confirm whether you’re open to such a contribution and whether the project is actively maintained enough for the PR to be reviewed?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions