Adding support for specifying embedding type to Jina AI service settings #121548

ymao1 · 2025-02-03T17:49:44Z

Summary

Adding the ability to specify embedding_type in the Jina AI service settings when creating a Jina AI inference endpoint.

Usage

Not specifying an embedding_type will default to the float embedding type (same as previous behavior)
Allowed types: ['float', 'bit', 'binary']
Specifying embedding_type: 'bit' or embedding_type: 'binary' will return results in the text_embedding_bits key (same as Cohere)

# Create a Jina AI binary inference endpoint
PUT /_inference/text_embedding/jina_embeddings_bit
{
    "service": "jinaai",
    "service_settings": {
        "api_key": <apiKey>,
        "model_id": "jina-embeddings-v3",
        "embedding_type": "bit"
    }
}

# Perform an inference task
POST /_inference/text_embedding/jina_embeddings_bit
{
    "input": "hello",
    "task_settings": {
        "input_type": "ingest"
    }
}

# Response
{
    "text_embedding_bits": [
        {
            "embedding": [
                -55,
                74,
                101,
                67,
                83,
                1,
                53,
                -101,
                -71,
                -98,
                -116,
                -99,
                80,
                -49,
                65,
                .
                .
                .
            ]
        }
    ]
}

Notes

Same as Cohere, requesting a binary embedding from Jina AI will returns an array of binary embeddings packed as bytes with int8 precision. Since this aligns with what is expected as input when you specify a dense_vector mapping with element_type: bit, we do not perform any bit unpacking on the response and handle the bytes as-is.

elasticsearchmachine · 2025-02-04T19:28:33Z

Hi @ymao1, I've created a changelog YAML for you.

elasticsearchmachine · 2025-02-05T12:15:26Z

Pinging @elastic/ml-core (Team:ML)

jonathan-buttner · 2025-02-05T19:16:04Z

...lasticsearch/xpack/inference/services/jinaai/embeddings/JinaAIEmbeddingsServiceSettings.java

        out.writeOptionalVInt(dimensions);
        out.writeOptionalVInt(maxInputTokens);
+
+        if (out.getTransportVersion().onOrAfter(TransportVersions.JINA_AI_EMBEDDING_TYPE_SUPPORT_ADDED)


Since we use this if logic a few times how about we move it into its own function?

Something like isEmbeddingTypeSupported(TransportVersion version) or something like that.

…into jina-bit-embeddings

ymao1 · 2025-03-04T14:51:04Z

💚 All backports created successfully

Status	Branch	Result
✅	8.x

Questions ?

Please refer to the Backport tool documentation

…ngs (elastic#121548) * Adding embeddings type to Jina AI service settings * Update docs/changelog/121548.yaml * Setting default similarity to L2 norm for binary embedding type (cherry picked from commit 6b2e566) # Conflicts: # server/src/main/java/org/elasticsearch/TransportVersions.java

… settings (#121548) (#124010) * Adding support for specifying embedding type to Jina AI service settings (#121548) * Adding embeddings type to Jina AI service settings * Update docs/changelog/121548.yaml * Setting default similarity to L2 norm for binary embedding type (cherry picked from commit 6b2e566) # Conflicts: # server/src/main/java/org/elasticsearch/TransportVersions.java * [CI] Auto commit changes from spotless --------- Co-authored-by: elasticsearchmachine <[email protected]>

…ngs (elastic#121548) * Adding embeddings type to Jina AI service settings * Update docs/changelog/121548.yaml * Setting default similarity to L2 norm for binary embedding type

elasticsearchmachine added the v9.1.0 label Feb 3, 2025

ymao1 force-pushed the jina-bit-embeddings branch 3 times, most recently from 51f5fce to 00ba02e Compare February 4, 2025 19:18

ymao1 changed the title ~~Jina bit embeddings~~ Adding support for specifying embedding type to Jina AI service settings Feb 4, 2025

Adding embeddings type to Jina AI service settings

d4e7a92

ymao1 force-pushed the jina-bit-embeddings branch from 00ba02e to d4e7a92 Compare February 4, 2025 19:26

ymao1 added v8.19.0 >enhancement :ml Machine learning Team:ML Meta label for the ML team labels Feb 4, 2025

Update docs/changelog/121548.yaml

6c856cb

ymao1 marked this pull request as ready for review February 5, 2025 12:15

jonathan-buttner approved these changes Feb 5, 2025

View reviewed changes

ymao1 added 5 commits February 21, 2025 11:22

Merging in main

3ca707e

Merge branch 'jina-bit-embeddings' of github.com:ymao1/elasticsearch …

6b3d4f1

…into jina-bit-embeddings

Setting default similarity to L2 norm for binary embedding type

ca90d62

Merging in main

f0e1e36

Merging in main

8146e60

ymao1 merged commit 6b2e566 into elastic:main Mar 3, 2025
17 checks passed

ymao1 deleted the jina-bit-embeddings branch March 3, 2025 18:00

ymao1 added the auto-backport Automatically create backport pull requests when merged label Mar 3, 2025

ymao1 mentioned this pull request Mar 3, 2025

Support for bit precision in the Inference API text_embedding task #111747

Open

davidkyle mentioned this pull request Mar 4, 2025

[ML] Support binary embeddings for Voyage AI #123983

Open

ymao1 mentioned this pull request Mar 4, 2025

[8.x] Adding support for specifying embedding type to Jina AI service settings (#121548) #124010

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding support for specifying embedding type to Jina AI service settings #121548

Adding support for specifying embedding type to Jina AI service settings #121548

Uh oh!

ymao1 commented Feb 3, 2025 •

edited

Loading

Uh oh!

elasticsearchmachine commented Feb 4, 2025

Uh oh!

elasticsearchmachine commented Feb 5, 2025

Uh oh!

jonathan-buttner Feb 5, 2025

Uh oh!

Uh oh!

ymao1 commented Mar 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Adding support for specifying embedding type to Jina AI service settings #121548

Adding support for specifying embedding type to Jina AI service settings #121548

Uh oh!

Conversation

ymao1 commented Feb 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Usage

Notes

Uh oh!

elasticsearchmachine commented Feb 4, 2025

Uh oh!

elasticsearchmachine commented Feb 5, 2025

Uh oh!

jonathan-buttner Feb 5, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ymao1 commented Mar 4, 2025

💚 All backports created successfully

Questions ?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ymao1 commented Feb 3, 2025 •

edited

Loading