Skip to content

Fantasy AIGC Family

Fantasy AIGC Family is an open-source initiative exploring Human-centric AI, World Modeling, and Human-World Interaction, aiming to bridge perception, understanding, and generation in the real and digital worlds.

🔥🔥🔥 News!!

  • 📢 Jan 2026 – We released the training and inference code and model weights of FantasyVLN.
  • 🏆 Dec 2025 - FantasyWorld ranked 1st on the WorldScore Leaderboard (by Stanford Prof. Fei-Fei Li's Team), validating our approach against global state-of-the-art models.
  • 🏛 Nov 2025 – Two papers from our family, FantasyTalking2 and FantasyHSI, have been accepted to AAAI 2026.
  • 🏛 Nov 2025 – Two papers from our family, FantasyTalking2 and FantasyHSI, have been accepted to AAAI 2026.
  • 🏛 Jul 2025FantasyTalking is accepted by ACM MM 2025.
  • 📢 Apr 2025 – We released the inference code and model weights of FantasyTalking and FantasyID.

✨✨✨ Members

FantasyVLN

Project arXiv GitHub GitHub Stars HuggingFace Model ModelScope

A unified multimodal Chain-of-Thought (CoT) reasoning framework that enables efficient and precise navigation based on natural language instructions and visual observations.

FantasyWorld

Project arXiv GitHub

Corresponds to the "Worlds" dimension. A unified world model integrating video priors and geometric grounding for synthesizing explorable and geometrically consistent 3D scenes. It emphasizes spatiotemporal consistency driven by Action and serves as a verifiable structural anchor for spatial intelligence.

FantasyTalking

Conference Project arXiv GitHub GitHub Stars HuggingFace Model HuggingFace Space ModelScope

The first Wan-based high-fidelity audio-driven avatar system that synchronizes facial expressions, lip motion, and body gestures in dynamic scenes through dual-stage audio-visual alignment and controllable motion modulation.

FantasyTalking2

Conference Project arXiv GitHub

A novel Timestep-Layer Adaptive Multi-Expert Preference Optimization (TLPO) method enhances the quality of audio-driven avatar in three dimensions: lip-sync, motion naturalness, and visual quality.

FantasyPortrait

Project arXiv GitHub GitHub Stars

A novel expression-driven video-generation method that pairs emotion-enhanced learning with masked cross-attention, enabling the creation of high-quality, richly expressive animations for both single and multi-portrait scenarios.

FantasyHSI

Conference Project arXiv GitHub

Corresponds to the "Interaction" dimension. A graph-based multi-agent framework that grounds video generation within 3D world dynamics. It unifies the action space with a broader interaction loop, transforming video generation from a content endpoint into a control channel for interactive systems.

FantasyID

Project arXiv GitHub GitHub Stars HuggingFace Model ModelScope

A tuning-free text-to-video model that leverages 3D facial priors, multi-view augmentation, and layer-aware guidance injection to deliver dynamic, identity-preserving video generation.

🌟🌟🌟 Our wishes.

  1. Giving Back to the Community: In our daily work, we benefit immensely from the resources, expertise, and support of the open source community, and we aim to give back by making our own projects open source.
  2. Attracting More Contributors: By open sourcing our code, we invite developers worldwide to collaborate—making our models smarter, our engineering more robust, and extending benefits to even more users.
  3. Building an Open Ecosystem: We believe that open source brings together diverse expertise to create a collaborative innovation platform—driving technological progress, industry growth, and broader societal impact.

Pinned Loading

  1. fantasy-talking fantasy-talking Public

    [ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

    Python 1.6k 125

  2. fantasy-portrait fantasy-portrait Public

    FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers

    Python 497 34

  3. fantasy-talking2 fantasy-talking2 Public

    [AAAI 2026] FantasyTalking2: Timestep-Layer Adaptive Preference Optimization for Audio-Driven Portrait Animation

    64 2

  4. fantasy-hsi fantasy-hsi Public

    [AAAI 2026] FantasyHSI: Video-Generation-Centric 4D Human Synthesis In Any Scene through A Graph-based Multi-Agent Framework

    12 3

Repositories

Showing 9 of 9 repositories
  • fantasy-amap.github.io Public

    The homepage of Fantasy AIGC Family.

    Fantasy-AMAP/fantasy-amap.github.io’s past year of commit activity
    HTML 0 0 0 0 Updated Jan 22, 2026
  • .github Public
    Fantasy-AMAP/.github’s past year of commit activity
    HTML 0 0 0 0 Updated Jan 22, 2026
  • fantasy-vln Public

    Official implementation of FantasyVLN: Unified Multimodal Chain-of-Thought Reasoning for Vision-and-Language Navigation

    Fantasy-AMAP/fantasy-vln’s past year of commit activity
    Jupyter Notebook 7 Apache-2.0 0 0 0 Updated Jan 21, 2026
  • fantasy-world Public

    FantasyWorld: Geometry-Consistent World Modeling via Unified Video and 3D Prediction

    Fantasy-AMAP/fantasy-world’s past year of commit activity
    49 Apache-2.0 1 1 0 Updated Jan 8, 2026
  • fantasy-talking Public

    [ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

    Fantasy-AMAP/fantasy-talking’s past year of commit activity
    Python 1,617 Apache-2.0 125 43 0 Updated Jan 8, 2026
  • fantasy-hsi Public

    [AAAI 2026] FantasyHSI: Video-Generation-Centric 4D Human Synthesis In Any Scene through A Graph-based Multi-Agent Framework

    Fantasy-AMAP/fantasy-hsi’s past year of commit activity
    12 Apache-2.0 3 2 0 Updated Sep 3, 2025
  • fantasy-portrait Public

    FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers

    Fantasy-AMAP/fantasy-portrait’s past year of commit activity
    Python 497 Apache-2.0 34 13 0 Updated Aug 20, 2025
  • fantasy-talking2 Public

    [AAAI 2026] FantasyTalking2: Timestep-Layer Adaptive Preference Optimization for Audio-Driven Portrait Animation

    Fantasy-AMAP/fantasy-talking2’s past year of commit activity
    64 2 4 0 Updated Aug 20, 2025
  • fantasy-id Public

    FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation

    Fantasy-AMAP/fantasy-id’s past year of commit activity
    Python 78 Apache-2.0 6 1 0 Updated Aug 20, 2025

Most used topics

Loading…