Improve Autocomplete Suggestions by Experimenting with models/prompts #906

hassoncs · 2025-06-04T00:04:01Z

hassoncs
Jun 4, 2025
Maintainer

Summary

Experiment with multiple OpenRouter models for autocomplete to identify the best trade-off between quality, latency, and cost, then expose a dropdown UI allowing end users to select their preferred model.

Background

Currently using Gemini Flash 2.5 for inline autocomplete with good accuracy but significant latency issues (taking at least 2 seconds) under load. We need to determine if Gemini 2.5 is actually the best choice by comparing it against potentially cheaper/faster models using a consistent prompt template.

Goals

Define a standard prompt template that ensures "just the code" output with minimal token overhead
Integrate at least four OpenRouter endpoints (Llama 3.3 8B, Mistral 3B, Mistral 7B, Qwen 2.5 Coder 7B)
Instrument metrics for latency and quality measurement
Add UI dropdown in settings for model selection
Produce a comparative report on timing and quality

Acceptance Criteria

Standardized prompt helper function with tests
All OpenRouter models configured and accessible
Metrics logging for latency and quality
UI dropdown in settings panel
Reporting script for data aggregation

Tasks

Create prompt builder helper
Add OpenRouter models to configuration
Implement latency logging
Build quality test harness with golden set
Add UI dropdown with persistence
Create reporting script
Document findings

Technical Notes

Quality measurement will use exact-match scoring initially
Users should see timing stats in the UI
Include fallback logic for rate limits

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve Autocomplete Suggestions by Experimenting with models/prompts #906

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Improve Autocomplete Suggestions by Experimenting with models/prompts #906

Uh oh!

hassoncs Jun 4, 2025 Maintainer

Summary

Background

Goals

Acceptance Criteria

Tasks

Technical Notes

Replies: 0 comments

hassoncs
Jun 4, 2025
Maintainer