An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)
          nlp          machine-learning          ocr          extraction          document          onprem          document-analysis          table-extraction          unstructured-data          rag          onpremise          llms          vlms          document-information-extraction          ocr-onpremise          document-data-extraction          onprem-vision          onprem-ocr          llm-ocr          ocr-benchmark      
    - 
            Updated
            Aug 25, 2025 
- Python