2025-05-08: Default to text-embedding-3-large with compression, GlobalStandard SKU
This release upgrades the infrastructure and code to default to the text-embedding-3-large model from OpenAI. The model has a maximum dimensions of 3072, but we are using BinaryQuantizationCompression and truncating the dimensions to 1024, with oversampling and rescoring enabled. That means the embeddings will be stored efficiently, but search quality should remain high.
Learn more about compression from this RAG time episode or Azure AI Search documentation.
If you are already using the repository and don't wish to use the new embedding model, you can continue to use text-embedding-ada-002. You may need to set azd environment variables if they aren't already set, see the embedding models customization guide. If you want to switch over to the new embedding model, you will either need to re-ingest your data from scratch in a new index, or you will need to add a new field for the new model and re-generate embeddings for just that field. The code now has a variable for the embedding column field, so it should be possible to have a search index with fields for two different embedding models.
As part of this change, all model deploments now default to the GlobalStandard SKU. We made that change since it is easier to find regions in common across the many models used by this repository when using the GlobalStandard SKU. However, if you can't use that SKU for whatever reason, you can still customize the SKU using the parameters described in the documentation.
Please let us know in the issue tracker if you encounter any issues with the new default embedding model configuration.
What's Changed
- Upgrade syntax to Python 3.9 by @tonybaloney in #2484
- Remove outdated docs by @pamelafox in #2492
- Use ENFORCE_ACCESS_CONTROL to decide whether to make acls by @pamelafox in #2494
- Bump idna from 3.8 to 3.10 by @dependabot in #2464
- Bump vite from 5.4.14 to 5.4.18 in /app/frontend by @dependabot in #2486
- Bump types-html5lib from 1.1.11.20240806 to 1.1.11.20241018 by @dependabot in #2462
- Bump msal-extensions from 1.2.0 to 1.3.1 by @dependabot in #2463
- Update reasoning docs to include API version by @pamelafox in #2499
- Bump @babel/runtime from 7.25.6 to 7.27.0 in /app/frontend by @dependabot in #2497
- Upgrade Bicep versions of resources by @pamelafox in #2500
- Add missing output for reasoning effort, updated evals including o3-mini by @pamelafox in #2501
- Resolve datetime deprecation warnings by @emmanuel-ferdman in #2502
- Upgrade to text-embedding-3-large model as default, with vector storage optimizations by @pamelafox in #2470
- Update evals requirements by @pamelafox in #2528
- Raise minimum node version by @pamelafox in #2519
- Add migration script for Azure Cosmos DB, old container to new container by @pamelafox in #2442
- Bump astral-sh/setup-uv from 5 to 6 in the github-actions group by @dependabot in #2512
New Contributors
- @emmanuel-ferdman made their first contribution in #2502
Full Changelog: 2025-04-02...2025-05-08