[ES|QL] Add CHUNK function #134320

kderusso · 2025-09-08T18:37:34Z

Adds a new function, CHUNK that takes text from a field and returns chunks based on the requested chunking strategy.

For this PR, we're inputting a size which will correspond to the default number of words in a sentence based chunking strategy. Future planned PRs will include the support for explicit chunking settings or an inference ID on top of these defaults. Future optimizations could also include supporting a max chunk size of LIMIT and optimizations to semantic text fields.

Examples of how to call this function:

FROM wikipedia
 | WHERE MATCH(content, \"churchill\") 
 | EVAL chunks = chunk(content, 3, 20) 
 | MV_EXPAND chunks
 | KEEP chunks
 | LIMIT 10

FROM wikipedia
 | WHERE MATCH(content, \"churchill\") 
 | EVAL chunks = chunk(content, 3) 
 | MV_EXPAND chunks
 | KEEP chunks
 | LIMIT 10 

FROM wikipedia
 | WHERE MATCH(content, \"churchill\") 
 | EVAL chunks = chunk(content) 
 | MV_EXPAND chunks
 | KEEP chunks
 | LIMIT 10

kderusso added 3 commits September 3, 2025 16:16

Add new function to chunk strings

98739d7

Refactor CHUNK function to support multiple values

6ae1cdc

Default to returning all chunks

1f4342c

elasticsearchmachine added the v9.2.0 label Sep 8, 2025

kderusso changed the title ~~Kderusso/esql chunk function~~ [ES|QL] Add CHUNK function Sep 8, 2025

elasticsearchmachine and others added 3 commits September 8, 2025 18:44

[CI] Auto commit changes from spotless

528c12c

Handle warnings

04307f2

Loosen export restrictions to try to get compile error working

66a13bb

elasticsearchmachine added v9.3.0 and removed v9.2.0 labels Oct 2, 2025

kderusso added 4 commits October 13, 2025 14:43

Merge main

6b3191e

Remove inference dependencies

693ea01

Fix compilation errors

fde0368

Remove more inference deps

a70d5b1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ES|QL] Add CHUNK function #134320

[ES|QL] Add CHUNK function #134320

Uh oh!

kderusso commented Sep 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[ES|QL] Add CHUNK function #134320

Are you sure you want to change the base?

[ES|QL] Add CHUNK function #134320

Uh oh!

Conversation

kderusso commented Sep 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants