Rare Terms Aggregation Performance Optimization

Unsure about existing performance of Rare Terms Aggregation at the moment, but looking through initial code at high level, it looks like that this aggregation also utilizes iterating through each document.

The idea is to utilize the terms frequency from Lucene similar to https://github.com/opensearch-project/OpenSearch/pull/11643 and avoid iterating through individual documents.

Next Steps:
- Measure/gather existing performance of rare terms aggregation
- Improve upon the implementation if it can be done with above ideation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Rare Terms Aggregation Performance Optimization #13122

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Rare Terms Aggregation Performance Optimization #13122

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions