VISAGE

This is the implementation of our paper Perceiving exposure segregation with open urban imagery.

💡 Introduction

VISAGE is the first AI Urban Scientist framework designed for autonomous urban sensing. It addresses the challenge of measuring socioeconomic exposure segregation—the degree to which daily encounters cross different income groups—using only open satellite and street-level imagery.

By bridging high-level social theory with large-scale observation through automated reasoning, VISAGE completes a full scientific closed-loop:

Literature Agent: LLM agents distill cross-disciplinary theory from extensive literature into an interpretable visual codebook.
Experiment Agent: Automatically detects codebook cues in imagery and generates structured, stepwise reasoning templates.
Perception Agent: A domain-adapted Large Multi-modal Model (LMM) that reasons from scene semantics to infer exposure segregation.
Feedback Loop: Out-of-sample performance is fed back to the system to iteratively update hypotheses and reasoning protocols.

🌟 Framework

VISAGE reframes urban perception as an interpretable reasoning task by organizing three specialized agents into a closed-loop discovery process.

⚙️ Installation

Environment

OS: Linux (Ubuntu 20.04/22.04 recommended).
Python: >= 3.9.
GPU: Training requires 4 x NVIDIA A100 (80GB VRAM) for multi-modal processing.

Setup

# Clone the repository
git clone https://github.com/tsinghua-fib-lab/VISAGE.git
cd VISAGE

# Create and activate environment
conda create -n visage python=3.9
conda activate visage

# Install dependencies
pip install -r requirements.txt

Estimated installation time: ~10 minutes

🚀 Quick Start

1. Literature Agent Workflow

Automate the knowledge distillation process to generate a visual cue codebook:

python scripts/run_literature_agent.py --config configs/literature.yaml

2. Cue Detection & Aggregation

Process community imagery to generate normalized frequency tables:

python scripts/extract_cues.py --data_path ./data/communities/ --codebook ./codebooks/final_v1.json

3. Perception Inference

Run the domain-adapted LMM to infer segregation indices with Chain-of-Thought traces:

python scripts/run_perception.py --task inference --communities ./data/test_split.json

📊 Performance

VISAGE establishes a reliable, scalable pathway for urban perception using open data.

Predictive Reliability: Achieves a Pearson correlation of $r=0.770$ across 10,030 communities in 31 U.S. cities.
Mechanism Discovery: Unravels how "defensiveness" cues (e.g., high fences) drive segregation, while "interaction" cues (e.g., public spaces) foster mixing.
Policy Sensitivity: Successfully evaluates the impact of Inclusionary Housing programs, identifying lower segregation in policy-active areas.

📧 Contact

For questions regarding the code or data, please contact:

Yong Li: liyong07@tsinghua.edu.cn
FIB-Lab, Department of Electronic Engineering, Tsinghua University

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
agents		agents
assets		assets
data		data
evaluation		evaluation
experiments		experiments
scripts		scripts
tools		tools
workflows		workflows
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VISAGE

💡 Introduction

🌟 Framework

⚙️ Installation

Environment

Setup

🚀 Quick Start

1. Literature Agent Workflow

2. Cue Detection & Aggregation

3. Perception Inference

📊 Performance

📧 Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

VISAGE

💡 Introduction

🌟 Framework

⚙️ Installation

Environment

Setup

🚀 Quick Start

1. Literature Agent Workflow

2. Cue Detection & Aggregation

3. Perception Inference

📊 Performance

📧 Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages