Add SegmentOrder utility class for per-query segment re-ordering #15591

romseygeek · 2026-01-20T17:04:39Z

This allows client code that is querying sorted indexes to ensure that segments most likely to produce entries in a top-k search are queried first, allowing efficient skipping of other segments.

romseygeek · 2026-01-21T10:08:29Z

lucene/core/src/test/org/apache/lucene/search/TestSegmentReordering.java

+
+    @Override
+    public LeafCollector getLeafCollector(LeafReaderContext context) throws IOException {
+      String segmentId = context.toString().substring(19, 20);


This is something of a hack, but I couldn't think of an easier way to work out what the 'original' ordinal for a segment is. Less hacky solution ideas are welcome!

Isn't ord a public field in LeafReaderContext?

It is, but it is generated by BaseCompositeReader after the segment sorter has been applied. So the ord of the first segment in the leaves array is always 0, and we can't use it to see if the segments have been re-ordered.

Ok, segment id - nevermind. Don't know. The substring hack seems... very fragile.

I updated the test to just compare the order of leaves before and after the re-ordering. The collector checks were necessary when I was wiring everything through IndexSearcher but aren't needed now.

) This allows client code that is querying sorted indexes to ensure that segments most likely to produce entries in a top-k search are queried first, allowing efficient skipping of other segments.

…che#15591) This allows client code that is querying sorted indexes to ensure that segments most likely to produce entries in a top-k search are queried first, allowing efficient skipping of other segments.

Add SegmentOrder utility class for per-query segment re-ordering

91bbb0f

This allows client code that is querying sorted indexes to ensure that segments most likely to produce entries in a top-k search are queried first, allowing efficient skipping of other segments.

romseygeek self-assigned this Jan 20, 2026

github-actions bot added module:core/index module:core/search module:test-framework labels Jan 20, 2026

romseygeek mentioned this pull request Jan 20, 2026

Allow Collectors to re-order segments for non-exhaustive searches #15436

Closed

github-actions bot added this to the 10.4.0 milestone Jan 20, 2026

romseygeek added 3 commits January 20, 2026 17:07

changes

56e2eb8

lint

1c89856

cleanup

7d16245

github-actions bot removed the module:test-framework label Jan 21, 2026

romseygeek commented Jan 21, 2026

View reviewed changes

romseygeek added 2 commits January 21, 2026 14:53

Improve test

518c2a1

tidy

53ff583

romseygeek merged commit d455fef into apache:main Jan 23, 2026
12 checks passed

romseygeek deleted the sort/reader-resorter branch January 23, 2026 10:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add SegmentOrder utility class for per-query segment re-ordering #15591

Add SegmentOrder utility class for per-query segment re-ordering #15591

Uh oh!

romseygeek commented Jan 20, 2026

Uh oh!

romseygeek Jan 21, 2026

Uh oh!

dweiss Jan 21, 2026

Uh oh!

romseygeek Jan 21, 2026

Uh oh!

dweiss Jan 21, 2026

Uh oh!

romseygeek Jan 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add SegmentOrder utility class for per-query segment re-ordering #15591

Add SegmentOrder utility class for per-query segment re-ordering #15591

Uh oh!

Conversation

romseygeek commented Jan 20, 2026

Uh oh!

romseygeek Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

dweiss Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

romseygeek Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

dweiss Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

romseygeek Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants