Add benchmark results for google/gemini-3-pro-preview #170

github-actions · 2025-11-18T18:26:37Z

This PR adds benchmark results for the google/gemini-3-pro-preview model.

The following files have been updated:

src/benchmark/results.json - Raw benchmark results
src/benchmark/validation-results.json - Validation results against human baseline

This PR was automatically generated by the benchmark workflow.

Note: If you don't want to merge this PR, close it and the model will be added to the untested list to prevent re-processing.

@alrocar

Note

Adds google/gemini-3-pro-preview to the benchmark config and records its raw results and validation metrics across all benchmark queries.

Config:
- Add google model gemini-3-pro-preview to src/benchmark-config.json.
Benchmarks:
- Populate src/benchmark/results.json with raw outputs for gemini-3-pro-preview across numerous pipe_* queries (SQL, results, metrics, attempts).
Validation:
- Update src/benchmark/validation-results.json to include comparison entries for google/gemini-3-pro-preview, detailing match status, distances, and aggregate stats.

^{Written by Cursor Bugbot for commit 000d226. This will update automatically on new commits. Configure here.}

vercel · 2025-11-18T18:26:41Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Preview	Comments	Updated (UTC)
llm-benchmark	Ready	Preview	Comment	Nov 18, 2025 6:27pm

feat: add benchmark results for google/gemini-3-pro-preview

000d226

vercel bot deployed to Preview November 18, 2025 18:27 View deployment

alrocar merged commit 72eee19 into main Nov 19, 2025
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add benchmark results for google/gemini-3-pro-preview #170

Add benchmark results for google/gemini-3-pro-preview #170

Uh oh!

github-actions bot commented Nov 18, 2025 •

edited by cursor bot

Loading

Uh oh!

vercel bot commented Nov 18, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add benchmark results for google/gemini-3-pro-preview #170

Add benchmark results for google/gemini-3-pro-preview #170

Uh oh!

Conversation

github-actions bot commented Nov 18, 2025 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vercel bot commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

github-actions bot commented Nov 18, 2025 •

edited by cursor bot

Loading

vercel bot commented Nov 18, 2025 •

edited

Loading