[ES\QL] Text embedding function constant folding #135710

afoucret · 2025-09-30T16:12:22Z

This PR introduces constant folding optimization for the TEXT_EMBEDDING function in ESQL.

Key components:

InferenceFunctionEvaluator: a new piece of infrastructure that uses an InferenceOperator to evaluate an inference function. Currently, only folding of constants is supported, but it could be extended to more use cases in the future.
Added a rules mechanism to the LogicalPlanPreOptimizer and introduced a rule that folds inference functions (FoldInferenceFunctions)
Adding CSV tests for the TEXT_EMBEDDING function, including usage with other vector functions such as KNN)

Part of #131022

…nference functions.

…timization.

elasticsearchmachine · 2025-09-30T16:14:08Z

Pinging @elastic/es-search-relevance (Team:Search Relevance)

…esql_text_embedding_function_evaluator

github-actions · 2025-10-01T08:41:18Z

ℹ️ Important: Docs version tagging

👋 Thanks for updating the docs! Just a friendly reminder that our docs are now cumulative. This means all 9.x versions are documented on the same page and published off of the main branch, instead of creating separate pages for each minor version.

We use applies_to tags to mark version-specific features and changes.

Expand for a quick overview

When to use applies_to tags:

✅ At the page level to indicate which products/deployments the content applies to (mandatory)
✅ When features change state (e.g. preview, ga) in a specific version
✅ When availability differs across deployments and environments

What NOT to do:

❌ Don't remove or replace information that applies to an older version
❌ Don't add new information that applies to a specific version without an applies_to tag
❌ Don't forget that applies_to tags can be used at the page, section, and inline level

🤔 Need help?

Check out the cumulative docs guidelines
Reach out in the #docs Slack channel

afoucret · 2025-10-01T08:41:22Z

...test/java/org/elasticsearch/xpack/esql/expression/function/inference/TextEmbeddingTests.java

+import static org.hamcrest.Matchers.equalTo;
+
+@FunctionName("text_embedding")
+public class TextEmbeddingTests extends AbstractFunctionTestCase {


ℹ️ This tests were not added before because they were an issue while the dense vector type was under construction.

afoucret · 2025-10-01T08:42:05Z

docs/reference/query-languages/esql/_snippets/functions/description/text_embedding.md

@@ -0,0 +1,6 @@
+% This is generated by ESQL's AbstractFunctionTestCase. Do not edit it. See ../README.md for how to regenerate it.


ℹ️ Doc update are caused by the dense vector type being out of snapshot.

ioanatia · 2025-10-03T12:20:26Z

x-pack/plugin/esql/qa/testFixtures/src/main/resources/text-embedding.csv-spec

 ;
 // end::embedding-eval[]

+input:keyword       | embedding:dense_vector


can we get a test with multiple text_embedding calls with different query strings?

plus it tests inference function in the context of fork

x-pack/plugin/esql/qa/testFixtures/src/main/resources/text-embedding.csv-spec

...in/esql/src/main/java/org/elasticsearch/xpack/esql/inference/InferenceFunctionEvaluator.java

…esql_text_embedding_function_evaluator

carlosdelest

LGTM 💯

ioanatia · 2025-10-07T07:52:45Z

...rg/elasticsearch/xpack/esql/optimizer/rules/logical/preoptimizer/FoldInferenceFunctions.java

+ * Example transformation:
+ * {@code TEXT_EMBEDDING("hello world", "model1")} → {@code [0.1, 0.2, 0.3, ...]}
+ */
+public class FoldInferenceFunctions implements LogicalPlanPreOptimizerRule {


I guess something like this would also be used for folding COMPLETION when it's used with a foldable prompt?

You are right. There will be something similar for inference plans.

ioanatia · 2025-10-07T07:54:34Z

...ql/src/test/java/org/elasticsearch/xpack/esql/inference/InferenceFunctionEvaluatorTests.java

+import static org.mockito.Mockito.mock;
+import static org.mockito.Mockito.when;
+
+public class InferenceFunctionEvaluatorTests extends ComputeTestCase {


should these be skipped for release tests?

I double-checked and it is not necessary.
The test is passing in release build since it does not use features that change in release build (function registry, writeable, ...)

afoucret added 3 commits September 30, 2025 15:40

Introducing InferenceFunctionEvaluator to allow folding of constant i…

4c8db9e

…nference functions.

Fold text embedding function to a constant during logical plan pre-op…

28e1185

…timization.

Add CSV tests for the TEXT_EMBEDDING function.

ffb3ad4

elasticsearchmachine added needs:triage Requires assignment of a team area label v9.2.0 labels Sep 30, 2025

afoucret added >non-issue Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch :Search Relevance/ES|QL Search functionality in ES|QL and removed needs:triage Requires assignment of a team area label labels Sep 30, 2025

afoucret requested review from a team, carlosdelest and ioanatia October 1, 2025 06:45

afoucret added 3 commits October 1, 2025 09:20

Merge branch 'main' of https://github.com/elastic/elasticsearch into …

0c9b903

…esql_text_embedding_function_evaluator

Update the doc generation now that dense vector is enabled.

dd5ec23

lint

aaec4e1

afoucret force-pushed the esql_text_embedding_function_evaluator branch from dc7ac08 to aaec4e1 Compare October 1, 2025 08:39

afoucret commented Oct 1, 2025

View reviewed changes

afoucret added v9.3.0 and removed v9.2.0 labels Oct 2, 2025

afoucret added 3 commits October 2, 2025 09:28

Merge branch 'main' into esql_text_embedding_function_evaluator

2674d6c

Merge branch 'main' into esql_text_embedding_function_evaluator

3976822

Merge branch 'main' into esql_text_embedding_function_evaluator

4872c0c

ioanatia reviewed Oct 3, 2025

View reviewed changes

carlosdelest reviewed Oct 3, 2025

View reviewed changes

x-pack/plugin/esql/qa/testFixtures/src/main/resources/text-embedding.csv-spec Outdated Show resolved Hide resolved

...in/esql/src/main/java/org/elasticsearch/xpack/esql/inference/InferenceFunctionEvaluator.java Show resolved Hide resolved

afoucret added 3 commits October 3, 2025 18:20

Merge branch 'main' of https://github.com/elastic/elasticsearch into …

7f9e9f4

…esql_text_embedding_function_evaluator

Update renamed capability dense_vector_field_type_released in CSV tests.

69928fb

Adding a CSV tests with fork.

bf7d5df

carlosdelest approved these changes Oct 3, 2025

View reviewed changes

afoucret and others added 7 commits October 6, 2025 10:01

Merge branch 'main' into esql_text_embedding_function_evaluator

8656357

Merge branch 'main' into esql_text_embedding_function_evaluator

029cee6

Merge branch 'main' into esql_text_embedding_function_evaluator

7e97208

Merge branch 'main' into esql_text_embedding_function_evaluator

5fc7461

Merge branch 'main' into esql_text_embedding_function_evaluator

a2822e3

Fixing flakiness in CSV tests.

2b45433

Merge branch 'main' into esql_text_embedding_function_evaluator

f475dbb

ioanatia approved these changes Oct 7, 2025

View reviewed changes

afoucret enabled auto-merge (squash) October 7, 2025 08:10

afoucret merged commit f9c72c1 into elastic:main Oct 7, 2025
34 checks passed

afoucret mentioned this pull request Oct 8, 2025

ES|QL: Add TEXT_EMBEDDING function #131022

Closed

6 tasks

astefan mentioned this pull request Oct 8, 2025

[CI] EsqlSpecIT test {csv-spec:math.NegateIntLongDouble} failing #136089

Closed

		@@ -0,0 +1,6 @@
		% This is generated by ESQL's AbstractFunctionTestCase. Do not edit it. See ../README.md for how to regenerate it.

[ES\QL] Text embedding function constant folding #135710

[ES\QL] Text embedding function constant folding #135710

Uh oh!

Conversation

afoucret commented Sep 30, 2025

Key components:

Uh oh!

elasticsearchmachine commented Sep 30, 2025

Uh oh!

github-actions bot commented Oct 1, 2025

ℹ️ Important: Docs version tagging

When to use applies_to tags:

What NOT to do:

🤔 Need help?

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

carlosdelest left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants