Search Relevance testing infrastructure #2243

Mpdreamz · 2025-11-20T16:01:43Z

Add Search Integration Tests with Elasticsearch Explain API

Why

Search relevance is critical for documentation discoverability, but debugging why certain documents rank higher than others has been a black box. When search results don't match expectations, we need detailed insights into Elasticsearch's scoring decisions to improve our search queries and boost factors.

Additionally, as we continue to refine our hybrid search implementation (combining lexical and semantic search with RRF), we need automated tests that not only verify correct behavior but also help us understand and improve search ranking over time.

How

This PR introduces a comprehensive search testing infrastructure with two complementary test classes:

1. Infrastructure Changes

ElasticsearchGateway Refactoring (ElasticsearchGateway.cs):

Extracted query building logic into reusable static methods to eliminate duplication:
- BuildLexicalQuery() - Encapsulates traditional text search with multiple match types and boost factors
- BuildSemanticQuery() - Handles semantic search using semantic_text fields
- NormalizeSearchQuery() - Query normalization (e.g., "dotnet" → "net")
This extraction serves dual purposes: DRY principle and enabling the explain functionality to use the exact same queries as production searches.
Implemented Elasticsearch Explain API integration:
- ExplainDocumentAsync() - Uses Elasticsearch's _explain API to get detailed scoring breakdown for why a document matched (or didn't match) a query
- ExplainTopResultAndExpectedAsync() - Compares actual top result with expected result, providing side-by-side scoring analysis
- FormatExplanation() - Recursively formats Elasticsearch's ExplanationDetail tree into human-readable indented output
- ExplainResult record - Strongly-typed container for explain results (Found, Matched, Score, Explanation)

SearchBootstrapFixture (SearchTestBase.cs):

Implemented intelligent indexing strategy that checks if remote Elasticsearch already has up-to-date data by comparing semantic channel hashes
Only triggers indexing when necessary, significantly reducing test execution time
Shares fixture across multiple test classes using xUnit collection fixtures
Validates successful indexing by checking exit codes and resource states

2. Test Classes

SearchIntegrationTests - Black-box API testing:

Tests the public HTTP API endpoint (/docs/_api/v1/search)
Verifies end-to-end behavior including pagination, error handling, and response formatting
Ensures the API layer correctly interfaces with ElasticsearchGateway

SearchRelevanceTests - White-box relevance testing with explain output:

Uses ElasticsearchGateway directly to bypass HTTP layer
When a test fails (first result doesn't match expected), automatically:
1. Fetches detailed explain for the actual top result
2. Fetches detailed explain for the expected result
3. Outputs formatted scoring breakdowns showing exactly why Elasticsearch ranked them differently
4. Calculates and displays score differences
Includes Assert.SkipUnless(searchFixture.Connected) to gracefully handle Elasticsearch unavailability
Uses the same test cases as SearchIntegrationTests for consistency

3. Configuration & DI

TestParameterProvider:

Implements IParameterProvider interface for test scenarios
Bridges the gap between test configuration (user secrets, Aspire config, environment variables) and production parameter providers (AWS Parameter Store, Lambda Extension)
Follows the same fallback chain pattern used in production

Test Output Example

When a search relevance test fails, developers see:

❌ FIRST RESULT MISMATCH - Fetching detailed explanations...

═══════════════════════════════════════════════════════════════
ACTUAL TOP RESULT: /docs/reference/elasticsearch/clients/python/getting-started
Score: 0.0315
Matched: True
───────────────────────────────────────────────────────────────
Scoring Breakdown:
  0.0315 - max of:
    0.0234 - weight(title:elasticsearch in 1234) [PerFieldSimilarity], result of:
      0.0234 - score(freq=1.0), computed from:
        10.0000 - boost
        2.3400 - idf, computed as log(1 + (N - n + 0.5) / (n + 0.5))
        0.0010 - tf, computed as freq / (freq + k1 * (1 - b + b * dl / avgdl))
    0.0189 - weight(abstract:elasticsearch in 1234) [PerFieldSimilarity]
      ...

═══════════════════════════════════════════════════════════════
EXPECTED RESULT: /docs/reference/elasticsearch/clients/java/getting-started
Score: 0.0289
Matched: True
───────────────────────────────────────────────────────────────
Scoring Breakdown:
  0.0289 - max of:
    0.0212 - weight(title:elasticsearch in 5678) [PerFieldSimilarity]
      ...

This enables data-driven decisions about boost factors, query types, and field weights.

Current Status

All 15 search integration tests passing:

7 SearchIntegrationTests (HTTP API)
8 SearchRelevanceTests (direct gateway with explain capability)

The tests currently pass because search results match expectations, but the infrastructure is ready to provide detailed diagnostics the moment search relevance needs improvement.

(cherry picked from commit 40b0f53)

…ation-tests

github-actions · 2025-11-20T16:08:23Z

🔍 Preview links for changed docs

aspire/README.md

tests-integration/Elastic.Assembler.IntegrationTests/Search/SearchTestBase.cs

src/services/Elastic.Documentation.Assembler/Building/AssemblerBuildService.cs

+		// Early return if --assume-build is specified and output already exists
+		if (assumeBuild.GetValueOrDefault(false))
+		{
+			var indexHtmlPath = Path.Combine(assembleContext.OutputDirectory.FullName, "docs", "index.html");


src/api/Elastic.Documentation.Api.Infrastructure/Adapters/Search/ElasticsearchGateway.cs

+		if (explanation.Details != null && explanation.Details.Count > 0)
+		{
+			foreach (var detail in explanation.Details)
+				result += FormatExplanation(detail, indent + 1);


src/api/Elastic.Documentation.Api.Infrastructure/Adapters/Search/ElasticsearchGateway.cs

+		catch (Exception ex)
+		{
+			_logger.LogError(ex, "Error explaining document '{Url}' for query '{Query}'", documentUrl, query);
+			return new ExplainResult
+			{
+				DocumentUrl = documentUrl,
+				Found = false,
+				Explanation = $"Exception during explain: {ex.Message}"
+			};
+		}


tests-integration/Elastic.Assembler.IntegrationTests/Search/SearchTestBase.cs

…ixture

tests-integration/Elastic.Assembler.IntegrationTests/Search/SearchTestBase.cs

+		catch (Exception ex)
+		{
+			Console.WriteLine($"Error checking Elasticsearch state: {ex.Message}. Will proceed with indexing.");
+			return true; // If we can't check, safer to index
+		}


Mpdreamz added 4 commits November 20, 2025 09:39

Add Api.IntegrationTests to slnx

dcc6f15

(cherry picked from commit 40b0f53)

Add initial infrastructure for search integration tests

abb271d

Add SearchRelevanceTests with detailed explain results for scoring

0e6d022

SkipUnless connected to Elasticsearch

ed76ec2

Mpdreamz requested a review from a team as a code owner November 20, 2025 16:01

Mpdreamz requested a review from cotti November 20, 2025 16:01

Mpdreamz added the feature label Nov 20, 2025

Mpdreamz self-assigned this Nov 20, 2025

Mpdreamz changed the title ~~feature/search integration tests~~ Search Relevance testing infrastructure Nov 20, 2025

Merge remote-tracking branch 'origin/main' into feature/search-integr…

31d4140

…ation-tests

github-actions bot deployed to docs-preview November 20, 2025 16:05 View deployment

github-code-quality bot found potential problems Nov 20, 2025

View reviewed changes

fix integration tests and starting the elasticsearch-ingest through f…

21a2de9

…ixture

github-actions bot deployed to docs-preview November 20, 2025 17:15 View deployment

github-code-quality bot found potential problems Nov 20, 2025

View reviewed changes

tests-integration/Elastic.Assembler.IntegrationTests/Search/SearchTestBase.cs

Comment on lines +208 to +212

catch (Exception ex)

{

Console.WriteLine($"Error checking Elasticsearch state: {ex.Message}. Will proceed with indexing.");

return true; // If we can't check, safer to index

}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Search Relevance testing infrastructure #2243

Search Relevance testing infrastructure #2243

Uh oh!

Mpdreamz commented Nov 20, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Nov 20, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Search Relevance testing infrastructure #2243

Are you sure you want to change the base?

Search Relevance testing infrastructure #2243

Uh oh!

Conversation

Mpdreamz commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Add Search Integration Tests with Elasticsearch Explain API

Why

How

1. Infrastructure Changes

2. Test Classes

3. Configuration & DI

Test Output Example

Current Status

Uh oh!

github-actions bot commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔍 Preview links for changed docs

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Mpdreamz commented Nov 20, 2025 •

edited

Loading

github-actions bot commented Nov 20, 2025 •

edited

Loading