Skip to content

Commit ea837a5

Browse files
Sentence Transformers v5.0 - Sparse Encoder Models (#2924)
* Add an initial draft for Sentence Transformers v5.0 * Finish the Trainer section, add evaluation section * Add a section on what Sparse Embedding models are * Add table of contents * Relocate Contrastive Sparse Representation (CSR) section and Inference-free Splade for clarity * Update table of contents * Add Vector Database Integration section with Qdrant example + cite CSR paper * Add training examples and resources for sparse embedding models * Add some commas * Unindent inference script * Use human-first titles in additional resources * Add section on pretrained sparse encoders and SPLADE Models collection * Update inference-free Splade implementation to use SparseStaticEmbedding module and chnage lambda associated code * corpus_regularizer -> document_regularizer * Add _blog.yml with release date * Remove incorrect documentation list item * Update model link * Update SPLADE Models collection link * CSRSparsity -> SparseAutoEncoder * "generally" use expansion * Update variable name * Fix link * Update link to Loss Overview * Update HPO link --------- Co-authored-by: arthurbr11 <[email protected]>
1 parent bfcde35 commit ea837a5

File tree

2 files changed

+864
-0
lines changed

2 files changed

+864
-0
lines changed

_blog.yml

Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -6228,3 +6228,14 @@
62286228
- community
62296229
- research
62306230
- multimodal
6231+
6232+
- local: train-sparse-encoder
6233+
title: "Training and Finetuning Sparse Embedding Models with Sentence Transformers v5"
6234+
author: tomaarsen
6235+
thumbnail: /blog/assets/train-sentence-transformers/st-hf-thumbnail.png
6236+
date: Jul 1, 2025
6237+
tags:
6238+
- nlp
6239+
- guide
6240+
- community
6241+
- open-source

0 commit comments

Comments
 (0)