Skip to content

Conversation

@dan-rubinstein
Copy link
Member

@dan-rubinstein dan-rubinstein commented Jun 25, 2025

TODO

Questions:

  1. In Elasticsearch we limit chunking to at most 512 chunks. When running through the EmbeddingRequestChunker we provide a maxNumberOfChunksPerBatch that the third-party service can handle. For late chunking, this value has to be at least 512 as we must pass all of the chunks for a given document into a single call downstream. Is this an assumption we can make?

@dan-rubinstein dan-rubinstein added :ml Machine learning Team:ML Meta label for the ML team >enhancement labels Jun 25, 2025
@elasticsearchmachine
Copy link
Collaborator

Hi @dan-rubinstein, I've created a changelog YAML for you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

>enhancement :ml Machine learning Team:ML Meta label for the ML team v9.3.0

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants