Add RerankRequestChunker #130485

dan-rubinstein · 2025-07-02T18:02:00Z

This change adds the ability to use chunking for the elastic reranker as an alternative long document handling strategy to the existing truncation method. To enable chunking you must include the long_document_strategy (with the value set to chunk) in the service_settings of the rerank inference endpoint being used to perform inference. The value can also be set manually to truncate to force chunking but this is currently the default behavior. The max_chunks_per_doc value can optionally be included to limit the number of chunks that are sent for inference per document. If this value is not set then all chunks generated for the document will be sent. For example:

PUT _inference/rerank/my-elasticrerank-endpoint
{
  "service": "elasticsearch",
  "service_settings": {
    "model_id": ".rerank-v1", 
    "num_threads": 1,
    "num_allocations": 1,
    "long_document_strategy": "chunk",
    "max_chunks_per_doc": 2
  }
}

When using chunking, documents will be chunked before inference and the chunks (either all or some depending on whether max_chunks_per_doc is set) will be sent for inference. For each document, the relevance score returned to the user will be the maximum score for any given chunk within the document.

Testing

Unit tests + integration tests
Created an elastic reranker endpoint with no chunking configuration values and ensured that truncation worked as expected.
Created an elastic reranker endpoint with truncate selected for long document strategy and ensured that truncation worked as expected.
Created an elastic reranker endpoint with chunk selected for long document strategy and ensured that documents were chunked and all chunks were sent for inference.
Created an elastic reranker endpoint with chunk selected for long document strategy and max_chunks_per_doc set and ensured that subset of chunks were sent for inference.
(TODO) Created an non-elastic reranker elasticsearch service endpoint and ensured that inference is still working.

dan-rubinstein · 2025-07-03T18:17:16Z

@elasticmachine merge upstream

...ference/src/main/java/org/elasticsearch/xpack/inference/action/TransportInferenceAction.java

...pack/inference/rank/textsimilarity/TextSimilarityRankFeaturePhaseRankCoordinatorContext.java

dan-rubinstein · 2025-07-30T15:13:32Z

@elasticmachine merge upstream

dan-rubinstein · 2025-08-13T13:29:16Z

@elasticmachine merge upstream

…um chunks per document

dan-rubinstein · 2025-09-11T18:53:11Z

@elasticmachine merge upstream

...pack/inference/rank/textsimilarity/TextSimilarityRankFeaturePhaseRankCoordinatorContext.java

...org/elasticsearch/xpack/inference/services/elasticsearch/ElasticRerankerServiceSettings.java

...inference/src/main/java/org/elasticsearch/xpack/inference/chunking/RerankRequestChunker.java

dan-rubinstein · 2025-09-22T13:43:45Z

@elasticmachine merge upstream

elasticsearchmachine · 2025-09-22T17:18:06Z

Pinging @elastic/ml-core (Team:ML)

elasticsearchmachine · 2025-09-22T17:18:38Z

Hi @dan-rubinstein, I've created a changelog YAML for you.

davidkyle

LGTM

davidkyle · 2025-09-23T13:40:31Z

...inference/src/main/java/org/elasticsearch/xpack/inference/chunking/RerankRequestChunker.java

+    private RankedDocsResults parseRankedDocResultsForChunks(RankedDocsResults rankedDocsResults) {
+        List<RankedDocsResults.RankedDoc> updatedRankedDocs = new ArrayList<>();
+        Set<Integer> docIndicesSeen = new HashSet<>();
+        for (RankedDocsResults.RankedDoc rankedDoc : rankedDocsResults.getRankedDocs()) {


To be safe and ensure the highest scoring chunk is used rankedDocsResults should be sorted. The results almost certainly will be sorted but just in case.

The sorting could be done in the RankedDocsResults constructor

Right, good catch, I added the sort at the end of this function but it should be in the construction to cover cases when the results aren't sorted but it should be in the rankedDocsResults.getRankedDocs() call to ensure we are taking the top result for each doc. I'll update this to sort the ranked docs before looping and will also update the updatedRankedDocs to be topRankedDocs as I think that's a bit clearer on what we're trying to store.

...pack/inference/rank/textsimilarity/TextSimilarityRankFeaturePhaseRankCoordinatorContext.java

...a/org/elasticsearch/xpack/inference/services/elasticsearch/ElasticsearchInternalService.java

test/test-clusters/src/main/java/org/elasticsearch/test/cluster/FeatureFlag.java

...ence/src/test/java/org/elasticsearch/xpack/inference/chunking/RerankRequestChunkerTests.java

…equest-chunker

Add RerankRequestChunker

849147e

dan-rubinstein added :ml Machine learning Team:ML Meta label for the ML team v9.2.0 labels Jul 2, 2025

elasticmachine and others added 2 commits July 3, 2025 20:17

Merge branch 'main' into rerank-request-chunker

c41d54c

Add chunking strategy generation

da4c939

dan-rubinstein commented Jul 18, 2025

View reviewed changes

...ference/src/main/java/org/elasticsearch/xpack/inference/action/TransportInferenceAction.java Outdated Show resolved Hide resolved

dan-rubinstein commented Jul 18, 2025

View reviewed changes

...pack/inference/rank/textsimilarity/TextSimilarityRankFeaturePhaseRankCoordinatorContext.java Outdated Show resolved Hide resolved

davidkyle added the cloud-deploy Publish cloud docker image for Cloud-First-Testing label Jul 18, 2025

Merge branch 'main' into rerank-request-chunker

004ca8f

elasticmachine and others added 2 commits July 30, 2025 17:13

Merge branch 'main' into rerank-request-chunker

5ec620a

Adding unit tests and fixing token/word ratio

4ff8eb0

Merge branch 'main' into rerank-request-chunker

ec78b87

This was referenced Aug 26, 2025

[ML] Evaluate reranking all chunks from docs with the elastic reranker #133588

Closed

[ML] Evaluate reranking first N chunks from docs #133589

Open

[ML] Evaluate chunked rerank against third-party integrations #133590

Open

dan-rubinstein added 2 commits September 9, 2025 10:30

Add configurable values for long document handling strategy and maxim…

9ef8917

…um chunks per document

Adding back sentence overlap for rerank chunking strategy

24497ae

Merge branch 'main' into rerank-request-chunker

1fea365

davidkyle reviewed Sep 22, 2025

View reviewed changes

elasticmachine and others added 2 commits September 22, 2025 15:43

Merge branch 'main' into rerank-request-chunker

8396214

Adding unit tests, transport version, and feature flag

8b97711

dan-rubinstein marked this pull request as ready for review September 22, 2025 17:17

dan-rubinstein added the >enhancement label Sep 22, 2025

Update docs/changelog/130485.yaml

833ef02

davidkyle approved these changes Sep 23, 2025

View reviewed changes

dan-rubinstein added 2 commits September 25, 2025 12:05

Merge branch 'main' of github.com:elastic/elasticsearch into rerank-r…

77701e1

…equest-chunker

Adding unit tests and refactoring code with clearer naming conventions

344e121

dan-rubinstein requested a review from davidkyle September 25, 2025 19:23

davidkyle approved these changes Sep 26, 2025

View reviewed changes

Merge branch 'main' of github.com:elastic/elasticsearch into rerank-r…

02c9d0a

…equest-chunker

dan-rubinstein requested a review from davidkyle September 29, 2025 13:30

davidkyle approved these changes Sep 29, 2025

View reviewed changes

Merge branch 'main' of github.com:elastic/elasticsearch into rerank-r…

d68bf09

…equest-chunker

dan-rubinstein enabled auto-merge (squash) September 29, 2025 14:49

dan-rubinstein merged commit 0cee213 into elastic:main Sep 29, 2025
35 checks passed

This was referenced Oct 1, 2025

[ML] Add rerank chunking for elastic reranker #135791

Closed

[ML] Add configurable settings for rerank chunking #133084

Closed

kosabogi mentioned this pull request Oct 8, 2025

[9.2] Add new parameters to inference PUT API for the rerank task type elastic/elasticsearch-specification#5451

Closed

Uh oh!

Add RerankRequestChunker #130485

Add RerankRequestChunker #130485

Uh oh!

Conversation

dan-rubinstein commented Jul 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Testing

Uh oh!

dan-rubinstein commented Jul 3, 2025

Uh oh!

Uh oh!

Uh oh!

dan-rubinstein commented Jul 30, 2025

Uh oh!

dan-rubinstein commented Aug 13, 2025

Uh oh!

dan-rubinstein commented Sep 11, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dan-rubinstein commented Sep 22, 2025

Uh oh!

elasticsearchmachine commented Sep 22, 2025

Uh oh!

elasticsearchmachine commented Sep 22, 2025

Uh oh!

davidkyle left a comment

Choose a reason for hiding this comment

Uh oh!

davidkyle Sep 23, 2025

Choose a reason for hiding this comment

Uh oh!

dan-rubinstein Sep 23, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

dan-rubinstein commented Jul 2, 2025 •

edited

Loading