Skip to content

Releases: opendatalab/MinerU

mineru-2.7.6-released

06 Feb 03:39
e31c0ec

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: mineru-2.7.5-released...mineru-2.7.6-released

mineru-2.7.5-released

02 Feb 11:59
4dc45f6

Choose a tag to compare

What's Changed

  • Fix the issue where PDF rendering timeout detection fails under certain conditions.

Full Changelog: mineru-2.7.4-released...mineru-2.7.5-released

mineru-2.7.4-released

30 Jan 13:48
c2c998a

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: mineru-2.7.3-released...mineru-2.7.4-released

mineru-2.7.3-released

26 Jan 11:40
e2282af

Choose a tag to compare

mineru-2.7.2-released

23 Jan 13:41
d832388

Choose a tag to compare

What's Changed

  • 2026/01/23 2.7.2 Release

    • Cross-page table merging optimization, improving merge success rate and merge quality
  • 2026/01/23 2.7.2 发布

New Contributors

Full Changelog: mineru-2.7.1-released...mineru-2.7.2-released

mineru-2.7.1-released

06 Jan 06:59
45f8ad1

Choose a tag to compare

What's Changed

  • 2026/01/06 2.7.1 Release

    • fix bug: #4300
    • Updated pdfminer.six dependency version to resolve CVE-2025-64512
    • Support automatic correction of input image exif orientation to improve OCR recognition accuracy #4283
  • 2026/01/06 2.7.1 发布

    • fix bug: #4300
    • 更新pdfminer.six的依赖版本以解决 CVE-2025-64512
    • 支持输入图像的exif方向自动校正,提升OCR识别效果 #4283

New Contributors

Full Changelog: mineru-2.7.0-released...mineru-2.7.1-released

mineru-2.7.0-released

30 Dec 10:24
ed6894c

Choose a tag to compare

What's Changed

  • 2025/12/30 2.7.0 Release

    • Simplified installation process. No need to separately install vlm acceleration engine dependencies. Using uv pip install mineru[all] during installation will install all optional backend dependencies.
    • Added new hybrid backend, which combines the advantages of pipeline and vlm backends. Built on vlm, it integrates some capabilities of pipeline, adding extra extensibility on top of high accuracy:
      • Directly extracts text from text PDFs, natively supports multi-language recognition in text PDF scenarios, and greatly reduces parsing hallucinations;
      • Supports text recognition in 109 languages for scanned PDF scenarios by specifying OCR language;
      • Independent inline formula recognition switch, which can be disabled separately when inline formula recognition is not needed, improving the visual effect of parsing results.
    • Simplified engine selection logic for vlm/hybrid backends. Users only need to specify the backend as *-auto-engine, and the system will automatically select the appropriate engine for inference acceleration based on the current environment, improving usability.
    • Switched default parsing backend from pipeline to hybrid-auto-engine, improving out-of-the-box result consistency for new users and avoiding cognitive differences in parsing results.
    • Added i18n support to gradio application, supporting switching between Chinese and English languages.
  • 2025/12/30 2.7.0 发布

    • 简化安装流程,现在不再需要单独安装vlm加速引擎依赖包,安装时使用uv pip install mineru[all]即可安装所有可选后端的依赖包。
    • 增加全新后端hybrid,该后端结合了pipelinevlm后端的优势,在vlm的基础上,融入了pipeline的部分能力,在高精度的基础上增加了额外的扩展性:
      • 从文本pdf中直接抽取文本,在文本pdf场景原生支持多语言识别,并极大减少解析幻觉;
      • 通过指定ocr语言,在扫描pdf场景下支持109种语言的文本识别;
      • 独立的行内公式识别开关,在不需要行内公式识别的场景下可单独关闭,提升解析结果视觉效果。
    • 简化vlm/hybrid后端的引擎选择逻辑,用户只需指定后端为*-auto-engine,系统会根据当前环境自动选择合适的引擎进行推理加速,提升易用性.
    • 默认解析后端从pipeline切换至hybrid-auto-engine,提升新用户开箱即用的结果一致性,避免出现解析结果认知差异。
    • gradio应用增加i18n适配,支持中英文两种语言切换。

Full Changelog: mineru-2.6.8-released...mineru-2.7.0-released

mineru-2.6.8-released

15 Dec 10:25
4ccb5e0

Choose a tag to compare

mineru-2.6.7-released

12 Dec 09:25
f4e4050

Choose a tag to compare

mineru-2.6.6-released

01 Dec 19:54
cf489fe

Choose a tag to compare

What's Changed

  • 2025/12/02 2.6.6 Release

    • mineru-api tool optimizations
      • Added descriptive text to mineru-api interface parameters to improve API documentation readability.
      • You can use the environment variable MINERU_API_ENABLE_FASTAPI_DOCS to control whether the auto-generated interface documentation page is enabled (enabled by default).
      • Added concurrency configuration options for the vlm-vllm-async-engine, vlm-lmdeploy-engine, and vlm-http-client backends. Users can use the environment variable MINERU_API_MAX_CONCURRENT_REQUESTS to set the maximum number of concurrent API requests (unlimited by default).
  • 2025/12/02 2.6.6 发布

    • Ascend适配优化
      • 优化命令行工具初始化流程,使Ascend适配方案中vlm-vllm-engine后端在命令行工具中可用。
      • 为Atlas 300I Duo(310p)设备更新适配文档。
    • mineru-api工具优化
      • mineru-api接口参数增加描述性文本,优化接口文档可读性。
      • 可通过环境变量MINERU_API_ENABLE_FASTAPI_DOCS控制是否启用自动生成的接口文档页面,默认为启用。
      • vlm-vllm-async-enginevlm-lmdeploy-enginevlm-http-client后端增加并发数配置选项,用户可通过环境变量MINERU_API_MAX_CONCURRENT_REQUESTS控制api接口的最大并发请求数,默认为不限制数量。

New Contributors

Full Changelog: mineru-2.6.5-released...mineru-2.6.6-released