Skip to content
@SCUT-DLVCLab

SCUT-DLVCLab

华南理工大学深度学习与视觉计算实验室

About Us 🚀

The Deep Learning and Vision Computing Lab is dedicated to advanced theoretical research and innovative applications in the fields of artificial intelligence, computer vision, machine learning, and pattern recognition. Our current research focuses on deep learning, text detection and recognition, document analysis and understanding, and artificial intelligence. In recent years, our team has led more than 30 national and provincial research projects, making significant achievements in optical character recognition (OCR), handwriting recognition, gesture recognition and interaction technology, and innovative applications of deep learning. We have published over 300 SCI/EI papers, obtained more than 50 authorized invention patents, won 5 provincial and ministerial science and technology awards, and achieved first place in international academic competitions 4 times.

Pinned Loading

  1. OCR-Reasoning OCR-Reasoning Public

    [arXiv: 2505.17163] OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning

    Python 64 3

  2. TongGu-LLM TongGu-LLM Public

    [EMNLP 2024] TongGu, a classical Chinese language model.

    46 3

  3. GPT-4V_OCR GPT-4V_OCR Public

    Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)

    Python 125 4

  4. Document-AI-Recommendations Document-AI-Recommendations Public

    Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.

    200 9

  5. MegaHan97K MegaHan97K Public

    [PR 2025] The official GitHub page of "MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Categories"

    Python 63 5

  6. HisDoc1B HisDoc1B Public

    15 1

Repositories

Showing 10 of 22 repositories
  • MCCD Public

    [ICDAR 2025] The official GitHub page of "MCCD: A Multi-Attribute Chinese Calligraphy Character Dataset Annotated with Script Styles, Dynasties, and Calligraphers"

    SCUT-DLVCLab/MCCD’s past year of commit activity
    Python 6 0 2 0 Updated Sep 2, 2025
  • LongHisDoc Public

    A Comprehensive Benchmark for Chinese Long Historical Document Understanding

    SCUT-DLVCLab/LongHisDoc’s past year of commit activity
    Python 4 0 0 0 Updated Aug 6, 2025
  • OCR-Reasoning Public

    [arXiv: 2505.17163] OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning

    SCUT-DLVCLab/OCR-Reasoning’s past year of commit activity
    Python 64 Apache-2.0 3 2 0 Updated Aug 4, 2025
  • DOLPHIN Public

    [IEEE TIFS 2024] Online Writer Retrieval with Chinese Handwritten Phrases: A Synergistic Temporal-Frequency Representation Learning Approach

    SCUT-DLVCLab/DOLPHIN’s past year of commit activity
    Python 54 GPL-3.0 0 1 0 Updated Aug 3, 2025
  • PAVENet Public

    [IEEE TPAMI 2025] Privacy-Preserving Biometric Verification With Handwritten Random Digit String

    SCUT-DLVCLab/PAVENet’s past year of commit activity
    Python 63 GPL-3.0 0 1 0 Updated Aug 3, 2025
  • AutoHDR Public

    [ACL 2025 main] The official GitHub page of "Reviving Cultural Heritage: A Novel Approach for Comprehensive Historical Document Restoration"

    SCUT-DLVCLab/AutoHDR’s past year of commit activity
    Python 43 3 2 0 Updated Jul 21, 2025
  • MegaHan97K Public

    [PR 2025] The official GitHub page of "MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Categories"

    SCUT-DLVCLab/MegaHan97K’s past year of commit activity
    Python 63 5 2 0 Updated Jul 16, 2025
  • SigBench Public
    SCUT-DLVCLab/SigBench’s past year of commit activity
    0 GPL-3.0 0 0 0 Updated Jun 19, 2025
  • AutoScaler Public

    [PR 2026] The official GitHub page of "AutoScaler: Self Scale Alignment for Handwritten Mathematical Expression Recognition"

    SCUT-DLVCLab/AutoScaler’s past year of commit activity
    Python 7 0 0 0 Updated Jun 8, 2025
  • ACP-RAG Public

    [NAACL 2025] Large-Scale Corpus Construction and Retrieval-Augmented Generation for Ancient Chinese Poetry: New Method and Data Insights (ACP-Corpus; ACP-QA; ACP-RAG)

    SCUT-DLVCLab/ACP-RAG’s past year of commit activity
    Python 3 0 0 0 Updated May 6, 2025

Top languages

Loading…

Most used topics

Loading…