Skip to content

Pinned Loading

  1. VLM-R1 VLM-R1 Public

    Solve Visual Understanding with Reinforced VLMs

    Python 5.8k 378

  2. OmDet OmDet Public

    Real-time and accurate open-vocabulary end-to-end object detection

    Python 1.4k 113

  3. OmAgent OmAgent Public

    [EMNLP-2024] Build multimodal language agents for fast prototype and production

    Python 2.6k 287

  4. VLM-FO1 VLM-FO1 Public

    VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs

    Python 237 13

  5. OpenTrackVLA OpenTrackVLA Public

    Open & Reproducible Research for Tracking VLAs

    Python 133 8

  6. ZoomEye ZoomEye Public

    [EMNLP-2025 Oral] ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration

    Python 75 7

Repositories

Showing 10 of 21 repositories
  • ImageRAG Public

    Enhancing Ultrahigh Resolution Remote Sensing Imagery Analysis With ImageRAG [GRSM]

    om-ai-lab/ImageRAG’s past year of commit activity
    Jupyter Notebook 28 MIT 1 1 0 Updated Feb 4, 2026
  • OpenTrackVLA Public

    Open & Reproducible Research for Tracking VLAs

    om-ai-lab/OpenTrackVLA’s past year of commit activity
    Python 133 8 3 0 Updated Dec 24, 2025
  • VLM-FO1 Public

    VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs

    om-ai-lab/VLM-FO1’s past year of commit activity
    Python 237 13 7 0 Updated Nov 28, 2025
  • ZoomEye Public

    [EMNLP-2025 Oral] ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration

    om-ai-lab/ZoomEye’s past year of commit activity
    Python 75 7 7 0 Updated Nov 20, 2025
  • VLM-R1 Public

    Solve Visual Understanding with Reinforced VLMs

    om-ai-lab/VLM-R1’s past year of commit activity
    Python 5,836 Apache-2.0 378 164 0 Updated Oct 21, 2025
  • om-ai-lab.github.io Public

    Official website for the org

    om-ai-lab/om-ai-lab.github.io’s past year of commit activity
    HTML 0 1 0 0 Updated Aug 15, 2025
  • open-agent-leaderboard Public

    Reproducible Language Agent Research

    om-ai-lab/open-agent-leaderboard’s past year of commit activity
    Python 33 2 0 0 Updated Jun 25, 2025
  • vlm-r1seg Public
    om-ai-lab/vlm-r1seg’s past year of commit activity
    Python 4 1 0 0 Updated Apr 28, 2025
  • VLM-R1.github.io Public

    Blog Site for VLM-R1

    om-ai-lab/VLM-R1.github.io’s past year of commit activity
    HTML 1 0 0 0 Updated Mar 20, 2025
  • OmAgent Public

    [EMNLP-2024] Build multimodal language agents for fast prototype and production

    om-ai-lab/OmAgent’s past year of commit activity
    Python 2,628 Apache-2.0 287 6 12 Updated Mar 19, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.