sidewalk-validator-ai

This project trains and tests AI models to automatically validate crowdsourced accessibility labels—such as Crosswalks, Curb Ramps, Surface Problems, and Obstacles—for Project Sidewalk.

The system first creates a dataset by fetching Google Street View panoramas and utilizing the "Depth Anything V2" model to generate precise, depth-aware image crops of specific accessibility features.
It then fine-tunes a computer vision model based on the DINOv2 architecture to classify these labels as either "correct" or "incorrect" based on user agreement data.
Finally, the pipeline includes comprehensive evaluation tools that generate precision-recall curves and confidence statistics to determine the model's readiness for deployment

Setup

First, please setup the conda environment:

conda env create -f environment.yml

Now, download the depth anything checkpoint that we use for crop extraction:

wget https://huggingface.co/depth-anything/Depth-Anything-V2-Metric-VKITTI-Large/resolve/main/depth_anything_v2_metric_vkitti_vitl.pth

❗An important note

Please don't forget to activate your conda environment before running any slurm scripts or python files!

Dataset Generation

Before we can train any validator models, we need to create a dataset. To do this for all five label types (Crosswalk, CurbRamp, NoCurbRamp, Obstacle, SurfaceProblem), simply run:

sbatch data_generation.slurm

Once this slurm job starts, you should see the creation of a folder for each label type. It might take some time for the scripts to initialize, though, so don't be alarmed if it doesn't appear instantly. You can check the logs of your slurm job by looking in the log file that was generated immediately after the slurm job started.

Once the job is finished (which will take many hours, possibly even days), each label type folder should have a subfolder ds (which stands for dataset). This is the dataset we will use for the next step, training!

Model Training

Now that we have the datasets for each label type, we want to train a validator model for each label type.

To do this for all label types, simply run:

sbatch fine_tune.slurm

This will take a long time. By the end of it, each label type folder should have a dinov2 folder with model checkpoints for both the best model and the latest model.

Model Evaluation

To evaluate model performance for all label types on the test split, simply run:

sbatch get_stats.slurm

After the job is finished, you'll have some files generated in each label type folder:

confidence_precision_map.json
- This maps model confidence to actual precision.
pvr.png
- The precision-recall curve for the model (Remember, each label type gets its own binary classification model).
report.png
- Basically the precision recall curve, but with some additional stats included in text.

Model Deployment

To use the newly trained models in production, upload the contents the best checkpoint in the dinov2 folder into a new Hugging Face model. Example here. In the future, it might be helpful to have a script to automatically upload to Hugging Face instead of manually doing it.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Depth-Anything-V2		Depth-Anything-V2
.gitignore		.gitignore
README.md		README.md
data_generation.py		data_generation.py
data_generation.slurm		data_generation.slurm
environment.yml		environment.yml
fine_tune.slurm		fine_tune.slurm
fine_tune_classifier.py		fine_tune_classifier.py
get_stats.py		get_stats.py
get_stats.slurm		get_stats.slurm
split_data.py		split_data.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

sidewalk-validator-ai

Setup

❗An important note

Dataset Generation

Model Training

Model Evaluation

Model Deployment

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

sidewalk-validator-ai

Setup

❗An important note

Dataset Generation

Model Training

Model Evaluation

Model Deployment

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages