Release 2025-05-08: Default to text-embedding-3-large with compression, GlobalStandard SKU · Azure-Samples/azure-search-openai-demo

This release upgrades the infrastructure and code to default to the text-embedding-3-large model from OpenAI. The model has a maximum dimensions of 3072, but we are using BinaryQuantizationCompression and truncating the dimensions to 1024, with oversampling and rescoring enabled. That means the embeddings will be stored efficiently, but search quality should remain high.
Learn more about compression from this RAG time episode or Azure AI Search documentation.

If you are already using the repository and don't wish to use the new embedding model, you can continue to use text-embedding-ada-002. You may need to set azd environment variables if they aren't already set, see the embedding models customization guide. If you want to switch over to the new embedding model, you will either need to re-ingest your data from scratch in a new index, or you will need to add a new field for the new model and re-generate embeddings for just that field. The code now has a variable for the embedding column field, so it should be possible to have a search index with fields for two different embedding models.

As part of this change, all model deploments now default to the GlobalStandard SKU. We made that change since it is easier to find regions in common across the many models used by this repository when using the GlobalStandard SKU. However, if you can't use that SKU for whatever reason, you can still customize the SKU using the parameters described in the documentation.

Please let us know in the issue tracker if you encounter any issues with the new default embedding model configuration.

What's Changed

Upgrade syntax to Python 3.9 by @tonybaloney in #2484
Remove outdated docs by @pamelafox in #2492
Use ENFORCE_ACCESS_CONTROL to decide whether to make acls by @pamelafox in #2494
Bump idna from 3.8 to 3.10 by @dependabot in #2464
Bump vite from 5.4.14 to 5.4.18 in /app/frontend by @dependabot in #2486
Bump types-html5lib from 1.1.11.20240806 to 1.1.11.20241018 by @dependabot in #2462
Bump msal-extensions from 1.2.0 to 1.3.1 by @dependabot in #2463
Update reasoning docs to include API version by @pamelafox in #2499
Bump @babel/runtime from 7.25.6 to 7.27.0 in /app/frontend by @dependabot in #2497
Upgrade Bicep versions of resources by @pamelafox in #2500
Add missing output for reasoning effort, updated evals including o3-mini by @pamelafox in #2501
Resolve datetime deprecation warnings by @emmanuel-ferdman in #2502
Upgrade to text-embedding-3-large model as default, with vector storage optimizations by @pamelafox in #2470
Update evals requirements by @pamelafox in #2528
Raise minimum node version by @pamelafox in #2519
Add migration script for Azure Cosmos DB, old container to new container by @pamelafox in #2442
Bump astral-sh/setup-uv from 5 to 6 in the github-actions group by @dependabot in #2512

New Contributors

@emmanuel-ferdman made their first contribution in #2502

Full Changelog: 2025-04-02...2025-05-08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

2025-05-08: Default to text-embedding-3-large with compression, GlobalStandard SKU

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

What's Changed

New Contributors

Contributors

Uh oh!