Skip to content
Leonardo Xavier Kuffo Rivero edited this page Mar 25, 2026 · 10 revisions

Super K-Means

SuperKMeans is a super-fast library for clustering high-dimensional vector embeddings. The main use case is to create partition-based indexes in large vector collections (e.g., IVF).

We offer two clustering options:

Use Super K-Means if:

  • You need to index a large collection of high-dimensional (d > 128) vector embeddings
  • You need a lightweight and much faster alternative to FAISS clustering

How to Install

Check INSTALL.md.

Documentation (C++ API)

Home

Quickstart

Usage example in C++
Usage example in Python

C++ API Documentation

SuperKMeans
Hierarchical SuperKMeans

Comparisons

(coming soon)

Clone this wiki locally