Skip to content
Change the repository type filter

All

    Repositories list

    • InternVLA-N1: An Open Dual-System Vision-Language Navigation Foundation Model with Learned Latent Plans
      JavaScript
      1000Updated Sep 11, 2025Sep 11, 2025
    • InternNav

      Public
      InternRobotics' open platform for building generalized navigation foundation models.
      Python
      1920340Updated Sep 11, 2025Sep 11, 2025
    • F1-VLA

      Public
      F1: A Vision Language Action Model Bridging Understanding and Generation to Actions
      Python
      56510Updated Sep 9, 2025Sep 9, 2025
    • An All-in-one robot manipulation learning suite for policy models training and evaluation on various datasets and benchmarks.
      Python
      612820Updated Sep 8, 2025Sep 8, 2025
    • Documentation of Intern Robotics Platform & Toolkits
      Python
      1200Updated Sep 5, 2025Sep 5, 2025
    • NavDP

      Public
      Official implementation of the paper: "NavDP: Learning Sim-to-Real Navigation Diffusion Policy with Privileged Information Guidance"
      Python
      1022540Updated Sep 5, 2025Sep 5, 2025
    • A simulation platform for versatile Embodied AI research and developments.
      Python
      601k140Updated Sep 4, 2025Sep 4, 2025
    • OpenHomie

      Public
      Open-sourced code for "HOMIE: Humanoid Loco-Manipulation with Isomorphic Exoskeleton Cockpit".
      C++
      3540600Updated Sep 1, 2025Sep 1, 2025
    • StreamVLN

      Public
      Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"
      Python
      1122391Updated Aug 31, 2025Aug 31, 2025
    • LaSP

      Public
      [EMNLP'25] Code for paper `Language-to-Space Programming for Training-Free 3D Visual Grounding`.
      Python
      0700Updated Aug 28, 2025Aug 28, 2025
    • Re3Sim

      Public
      Official implementation of "Re3Sim: Generating High-Fidelity Simulation Data via 3D-Photorealistic Real-to-Sim for Robotic Manipulation"
      Jupyter Notebook
      411170Updated Aug 21, 2025Aug 21, 2025
    • GenManip

      Public
      [CVPR 2025] Official implementation of "GenManip: LLM-driven Simulation for Generalizable Instruction-Following Manipulation"
      Python
      06420Updated Aug 15, 2025Aug 15, 2025
    • PointLLM

      Public
      [ECCV 2024 Best Paper Candidate & TPAMI 2025] PointLLM: Empowering Large Language Models to Understand Point Clouds
      Python
      4488160Updated Aug 14, 2025Aug 14, 2025
    • A versatile, all-in-one toolbox for whole-body humanoid robot control.
      Python
      37900Updated Aug 11, 2025Aug 11, 2025
    • [arXiv 2025] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence
      Python
      05100Updated Aug 8, 2025Aug 8, 2025
    • MeshCoder

      Public
      1627510Updated Aug 5, 2025Aug 5, 2025
    • AnySplat

      Public
      [SIGGRAPH Asia 2025 (ACM TOG)] AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views
      Python
      14394171Updated Jul 31, 2025Jul 31, 2025
    • CronusVLA

      Public
      [arXiv 2025] CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation
      03700Updated Jul 27, 2025Jul 27, 2025
    • InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation
      13920Updated Jul 27, 2025Jul 27, 2025
    • InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.
      Python
      29000Updated Jul 25, 2025Jul 25, 2025
    • C++
      617240Updated Jul 25, 2025Jul 25, 2025
    • .github

      Public
      2000Updated Jul 25, 2025Jul 25, 2025
    • InternSR

      Public
      InternRobotics' open-source toolbox for vision-based embodied spatial intelligence.
      Python
      03700Updated Jul 25, 2025Jul 25, 2025
    • OST-Bench

      Public
      OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding
      Python
      15900Updated Jul 24, 2025Jul 24, 2025
    • PPI

      Public
      [RSS 2025] Gripper Keypose and Object Pointflow as Interfaces for Bimanual Robotic Manipulation
      Python
      06710Updated Jul 22, 2025Jul 22, 2025
    • UniHSI

      Public
      [ICLR 2024 Spotlight] Unified Human-Scene Interaction via Prompted Chain-of-Contacts
      Python
      1323210Updated Jul 15, 2025Jul 15, 2025
    • Seer

      Public
      [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation
      Python
      1122650Updated Jul 8, 2025Jul 8, 2025
    • Aether

      Public
      [ICCV 2025] Aether: Geometric-Aware Unified World Modeling
      Python
      447420Updated Jul 7, 2025Jul 7, 2025
    • HorizonGS

      Public
      [CVPR 2025] Horizon-GS: Unified 3D Gaussian Splatting for Large-Scale Aerial-to-Ground Scenes
      C++
      692120Updated Jul 5, 2025Jul 5, 2025
    • HoST

      Public
      [RSS 2025 Best Systems Paper Finalist] 💐Official implementation of "Learning Humanoid Standing-up Control across Diverse Postures"
      Python
      46384140Updated Jun 17, 2025Jun 17, 2025