Skip to content

Add code search reranking support via external services #6620

@AbinandN

Description

@AbinandN

Problem Description

Current codebase indexing and search suffers from critical retrieval quality issues that degrade the AI's ability to provide accurate responses:

  • Context Pollution: Vector similarity alone retrieves syntactically similar but semantically irrelevant code snippets, polluting the context window with noise. For example, searching for "authentication logic" might retrieve every file that mentions "auth" but miss the actual authentication implementation.

  • Who is affected: All users relying on AI-assisted code understanding and generation

  • When it happens: During any codebase search operation, especially in large repositories

  • Current behavior: Codebase search retrieves top-k results based solely on vector similarity, leading to:

    • Semantically irrelevant results ranking high due to keyword overlap
    • Critical implementation details being pushed out by superficial matches
    • AI receiving diluted context that reduces response accuracy
  • Expected behavior: Retrieved context should be semantically relevant to the query intent, not just vector-similar

  • Impact:

    • AI provides less accurate responses due to poor context quality
    • Users must repeatedly refine queries to get relevant results
    • Reduced trust in AI suggestions when based on irrelevant context

Without reranking, our codebase search treats all vector-similar results equally, ignoring semantic relevance. This leads to context windows filled with marginally relevant code that drowns out the truly important implementations.

Additional Context

This feature introduces reranking as a critical second-stage filter that re-scores retrieval results based on semantic relevance rather than just vector similarity. This dramatically improves the quality of context provided to the AI.

Related PR: #6609 implements this feature

Metadata

Metadata

Assignees

No one assigned

    Labels

    Issue - In ProgressSomeone is actively working on this. Should link to a PR soon.enhancementNew feature or request

    Type

    No type

    Projects

    Status

    Done

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions