FEAT: Added VLSU Multimodal Dataset #1309

riyosha · 2026-01-07T09:18:31Z

Description

Added VLSU multimodal dataset. This dataset contains multimodal prompts (text+images) which have safety gradings and categories for the text, images and both combined. This functionality creates text prompts, image prompts and multimodal prompts from the same.

Closes #1285.

Files Changed:

pyrit/datasets/seed_datasets/remote/vlsu_multimodal_dataset.py (new)
pyrit/datasets/seed_datasets/remote/__init__.py (updated exports)
pyrit/datasets/__init__.py (updated exports)
tests/unit/datasets/test_vlsu_multimodal_dataset.py (new)
pyrit/models/seed.py (added optional prompt_text option in the image prompt)

Features:

Loads image-text pairs from the official ML-VLSU repository
Creates prompts based on safety grades:
-- Text-only prompts: When text_grade is unsafe/borderline
-- Image-only prompts: When image_grade is unsafe/borderline
-- Combined prompts: When combined_grade is unsafe/borderline (captures emergent harm)
Supports filtering by 15 harm categories (e.g., hate speech, discrimination, violence)
Configurable unsafe_grades parameter to control which grades trigger prompt creation (options are safe, unsafe, borderline, not_sure)
Supports random sampling with configurable limit and seed (for quicker testing. I can remove this if undesirable)
Caches downloaded images locally for faster subsequent loads

Tests and Documentation

Some essential unit tests added intests/unit/datasets/test_vlsu_multimodal_dataset.py
-- Test dataset name property
-- Test initialization with categories
-- Test invalid categories raise ValueError
-- Test text-only prompt creation (unsafe text grade)
-- Test image-only prompt creation (unsafe image grade)
-- Test combined prompt creation (unsafe combined grade)
-- Test all prompts created when all grades unsafe
-- Test borderline grades trigger prompt creation
-- Test no prompts when all grades safe (expects error)
-- Test category filtering
-- Test handling of failed image downloads
pre-commit run --all-files passes

pyrit/datasets/seed_datasets/remote/__init__.py

pyrit/datasets/seed_datasets/remote/vlsu_multimodal_dataset.py

rlundeen2 · 2026-01-10T06:01:58Z

pyrit/datasets/seed_datasets/remote/vlsu_multimodal_dataset.py

+                group_id = uuid.uuid4()
+
+            # Create text prompt if text_grade is unsafe or borderline
+            if text_grade in self.unsafe_grades:


If I understand this dataset correctly, each line in the dataset has both a text part and an image part. So I would just say if the harm is sufficient, make sure both have the same group id and add them

The text and image parts individually might have a different safety grade than the combined prompt - for instance, text and image individually might be harmful individually, but their combined prompt can be a safe prompt. I thought it could be useful to create individual text and image prompts to use them in cases where the combined prompt would be skipped.

I could create the unsafe text/image prompts specifically when the combined prompt is safe and they'd be skipped otherwise. Or if you think its better to remove them completely, I can just keep the combined prompts too

I think they should always be added (or not) together, because we likely always want to send them together as part of a single Message.

I don't care quite as much about how we decide which ones to add as much as they be added together so we can handle them consistently :)

rlundeen2 · 2026-01-10T06:02:33Z

pyrit/datasets/seed_datasets/remote/vlsu_multimodal_dataset.py

+                    continue
+
+            # Handle UUID
+            if uuid_str:


We should always generate a new uuid. Otherwise there will be weirdness. It's unique to pyrit for when we pull these from the database

right I overlooked that, makes sense!

rlundeen2

This is good! Thank you! I think once comments are addressed and CI passes it should be ready to merge.

riyosha added 4 commits January 7, 2026 00:27

added support for multimodal prompts

59d2f8f

added VLSU multimodal dataset functionality

3581b5c

added unit and integration tests for VLSU dataset

225b567

fixed formatting

40f84dd

rlundeen2 reviewed Jan 10, 2026

View reviewed changes

pyrit/datasets/seed_datasets/remote/__init__.py Outdated Show resolved Hide resolved

rlundeen2 reviewed Jan 10, 2026

View reviewed changes

pyrit/datasets/seed_datasets/remote/vlsu_multimodal_dataset.py Outdated Show resolved Hide resolved

rlundeen2 reviewed Jan 10, 2026

View reviewed changes

rlundeen2 self-assigned this Jan 10, 2026

rlundeen2 reviewed Jan 10, 2026

View reviewed changes

riyosha added 3 commits January 10, 2026 06:15

added unique pyrit uuids

1461d1f

removed limit and random sampling

383a72f

removed exports

1278205

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

FEAT: Added VLSU Multimodal Dataset #1309

FEAT: Added VLSU Multimodal Dataset #1309

riyosha commented Jan 7, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

rlundeen2 Jan 10, 2026 •

edited

Loading

Uh oh!

riyosha Jan 10, 2026

Uh oh!

rlundeen2 Jan 11, 2026

Uh oh!

rlundeen2 Jan 10, 2026

Uh oh!

riyosha Jan 10, 2026

Uh oh!

rlundeen2 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

FEAT: Added VLSU Multimodal Dataset #1309

Are you sure you want to change the base?

FEAT: Added VLSU Multimodal Dataset #1309

Conversation

riyosha commented Jan 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Tests and Documentation

Uh oh!

Uh oh!

Uh oh!

rlundeen2 Jan 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

riyosha Jan 10, 2026

Choose a reason for hiding this comment

Uh oh!

rlundeen2 Jan 11, 2026

Choose a reason for hiding this comment

Uh oh!

rlundeen2 Jan 10, 2026

Choose a reason for hiding this comment

Uh oh!

riyosha Jan 10, 2026

Choose a reason for hiding this comment

Uh oh!

rlundeen2 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

riyosha commented Jan 7, 2026 •

edited

Loading

rlundeen2 Jan 10, 2026 •

edited

Loading