Skip to content

text-embeddings-inference 1.9.0#267964

Merged
chenrui333 merged 2 commits intomainfrom
bump-text-embeddings-inference-1.9.0
Feb 17, 2026
Merged

text-embeddings-inference 1.9.0#267964
chenrui333 merged 2 commits intomainfrom
bump-text-embeddings-inference-1.9.0

Conversation

@BrewTestBot
Copy link
Contributor

Created by brew bump


Created with brew bump-formula-pr.

Details

release notes
text-embeddings-inference-v1 9 0

What's changed?

🚨 Breaking changes

Default GeLU implementation is now GeLU + tanh approximation instead of exact GeLU (aka. GeLU erf) to make sure that the CPU and CUDA embeddings are the same (as cuBLASlt only supports GeLU + tanh), which represents a slight misalignment from how Transformers handles it, as when hidden_act="gelu" is set in config.json, GeLU erf should be used. The numerical differences between GeLU + tanh and GeLU erf should have negligible impact on inference quality.

--auto-truncate now defaults to true, meaning that the sequences will be truncated to the lower value between the --max-batch-tokens or the maximum model length, to prevent the --max-batch-tokens from being lower than the actual maximum supported length.

🎉 Additions

🐛 Fixes

⚡ Improvements

📄 Other

🆕 New Contributors

Full Changelog: huggingface/text-embeddings-inference@v1.8.3...v1.9.0

View the full release notes at https://github.com/huggingface/text-embeddings-inference/releases/tag/v1.9.0.


@github-actions github-actions bot added rust Rust use is a significant feature of the PR or issue bump-formula-pr PR was created using `brew bump-formula-pr` labels Feb 17, 2026
@github-actions
Copy link
Contributor

🤖 An automated task has requested bottles to be published to this PR.

Caution

Please do not push to this PR branch before the bottle commits have been pushed, as this results in a state that is difficult to recover from. If you need to resolve a merge conflict, please use a merge commit. Do not force-push to this PR branch.

@github-actions
Copy link
Contributor

⚠️ Bottle publish failed.

@github-actions
Copy link
Contributor

🤖 An automated task has requested bottles to be published to this PR.

Caution

Please do not push to this PR branch before the bottle commits have been pushed, as this results in a state that is difficult to recover from. If you need to resolve a merge conflict, please use a merge commit. Do not force-push to this PR branch.

@github-actions
Copy link
Contributor

⚠️ Bottle publish failed.

@chenrui333 chenrui333 added the CI-published-bottle-commits The commits for the built bottles have been pushed to the PR branch. label Feb 17, 2026
@chenrui333 chenrui333 enabled auto-merge February 17, 2026 16:43
@chenrui333 chenrui333 added this pull request to the merge queue Feb 17, 2026
Merged via the queue into main with commit ce0f1d3 Feb 17, 2026
38 of 46 checks passed
@chenrui333 chenrui333 deleted the bump-text-embeddings-inference-1.9.0 branch February 17, 2026 16:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bump-formula-pr PR was created using `brew bump-formula-pr` CI-published-bottle-commits The commits for the built bottles have been pushed to the PR branch. rust Rust use is a significant feature of the PR or issue

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants