Skip to content
Change the repository type filter

All

    Repositories list

    • AVI-Bench

      Public
      Toward Human-like Audio-Visual Intelligence of Omni-MLLMs
      0000Updated Feb 16, 2026Feb 16, 2026
    • FMBench

      Public
      Python
      0010Updated Feb 4, 2026Feb 4, 2026
    • SAM3-DMS

      Public
      Decoupled Memory Selection for Multi-target Video Segmentation of SAM3
      Python
      33600Updated Jan 16, 2026Jan 16, 2026
    • [NeurIPS 2025 (Spotlight)] SceneDesigner: Controllable Multi-Object Image Generation with 9-DoF Pose Manipulation
      Python
      12910Updated Dec 19, 2025Dec 19, 2025
    • FFSE

      Public
      [AAAI 2026] Free-Form Scene Editor: Enabling Multi-Round Object Manipulation Like in a 3D Engine
      0310Updated Dec 13, 2025Dec 13, 2025
    • Ref-SAM3D

      Public
      Python
      01900Updated Dec 8, 2025Dec 8, 2025
    • SAAS

      Public
      [AAAI 2026] Segment Anything Across Shots: A Method and Benchmark
      Python
      32730Updated Nov 16, 2025Nov 16, 2025
    • Papers

      Public
      PDF
      0000Updated Nov 16, 2025Nov 16, 2025
    • OmniAVS

      Public
      [ICCV 2025] Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation
      Python
      28330Updated Sep 29, 2025Sep 29, 2025
    • MOVE

      Public
      [ICCV 2025] MOVE: Motion-Guided Few-Shot Video Object Segmentation
      Python
      38700Updated Sep 8, 2025Sep 8, 2025
    • AnyI2V

      Public
      [ICCV 2025] AnyI2V: Animating Any Conditional Image with Motion Control Generation
      Python
      711940Updated Aug 24, 2025Aug 24, 2025
    • A Survey of Image Editing
      1346520Updated Aug 24, 2025Aug 24, 2025
    • SynFMC

      Public
      [ICCV 2025] Free-Form Motion Control: Controlling the 6D Poses of Camera and Objects in Video Generation
      Python
      25620Updated Aug 24, 2025Aug 24, 2025
    • .github

      Public
      0000Updated Jul 1, 2025Jul 1, 2025
    • MeViS

      Public
      [ICCV 2023] MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions
      Python
      21000Updated Jun 24, 2024Jun 24, 2024
    • MOSE-api

      Public
      [ICCV 2023] MOSE: A New Dataset for Video Object Segmentation in Complex Scenes
      Python
      6000Updated Nov 23, 2023Nov 23, 2023
    • ReLA

      Public
      [CVPR2023 Highlight] GRES: Generalized Referring Expression Segmentation
      Python
      22000Updated Sep 5, 2023Sep 5, 2023
    • gRefCOCO

      Public
      A benchmark dataset for GRES and GREC [CVPR2023 Highlight]
      Python
      5000Updated Sep 4, 2023Sep 4, 2023