Summary: Meilisearch Indexing Changes

## Summary of Meilisearch Indexing Changes

### Workflow Changes

* **New populate-search-engine command** (#685): Replaced the old `Daily Build Embeddings` matrix job with a streamlined command that fetches pre-built docs from hf-doc-build/doc-build dataset with intelligent markdown chunking based on headings.
* **Improved document ID generation** (#719): Changed to a readable `{library}-{page}-{hash}` format instead of full SHA1 hashes.
* **Migration scripts for index management** (#718): Added scripts for clearing and creating Meilisearch indexes.
* **Index swap script** (#720): Added script to swap indexes atomically.
* **Simplified workflow** (#717): Removed the cleanup job that previously handled success/failure scenarios with automatic index swapping.
* **Refactored embedding inference** (#711): Updated to use a direct URL and token approach (`HF_IE_URL`) instead of the previous name and namespace pattern.
* **Incremental embeddings mode** (#737): Added `--incremental` flag to only process new/changed documents. Tracks document IDs in `hf-doc-build/doc-builder-embeddings-tracker` dataset. Automatically removes stale entries when pages are updated or deleted. Significantly reduces costs by avoiding re-embedding unchanged documents.

---

## TODO

- [x] Initial populate of the docs semantic search index
- [x] Use this new vector index for hf.co/docs embedding endpoints
- [x] Create PR that will efficiently add vectors only to changed pages (https://github.com/huggingface/doc-builder/pull/759)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Summary: Meilisearch Indexing Changes #722

Summary of Meilisearch Indexing Changes

Workflow Changes

TODO

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Summary: Meilisearch Indexing Changes #722

Description

Summary of Meilisearch Indexing Changes

Workflow Changes

TODO

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions