Skip to content

Conversation

@dan-rubinstein
Copy link
Member

Backports the following commits to 8.19:

* Add recursive chunker

* Update docs/changelog/126866.yaml

* Clean up separator sets and add asMap function for RecrusiveChunkingSettings

* Add javadoc for chunker, add tests, reduce word counting operations

* Remove split merging and add long document unit test

* [CI] Auto commit changes from spotless

* Add markdown chunking tests and reduce substring calls

* Clean up matcher logic

* Add testing for not splitting after valid chunk is found

---------

Co-authored-by: elasticsearchmachine <[email protected]>
Co-authored-by: Elastic Machine <[email protected]>
@dan-rubinstein dan-rubinstein added :ml Machine learning >enhancement auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) backport Team:ML Meta label for the ML team labels Jun 18, 2025
@dan-rubinstein dan-rubinstein removed the auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) label Jun 18, 2025
@dan-rubinstein dan-rubinstein merged commit e92de38 into elastic:8.19 Jun 18, 2025
22 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

backport >enhancement :ml Machine learning Team:ML Meta label for the ML team v8.19.0

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants