Skip to content

Commit 2ac056f

Browse files
committed
Added Indo-e5 Model
1 parent 889b2ae commit 2ac056f

File tree

2 files changed

+18
-0
lines changed

2 files changed

+18
-0
lines changed

README.md

Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -54,6 +54,7 @@ Like SimCSE, [ConGen: Unsupervised Control and Generalization Distillation For S
5454
| [ConGen-IndoBERT Lite Base](https://huggingface.co/LazarusNLP/congen-indobert-lite-base) | 12M | [IndoBERT Lite Base](https://huggingface.co/indobenchmark/indobert-lite-base-p1) | [paraphrase-multilingual-mpnet-base-v2](https://huggingface.co/sentence-transformers/paraphrase-multilingual-mpnet-base-v2) | [Wikipedia](https://huggingface.co/datasets/LazarusNLP/wikipedia_id_20230520) | |
5555
| [ConGen-IndoBERT Base](https://huggingface.co/LazarusNLP/congen-indobert-base) | 125M | [IndoBERT Base](https://huggingface.co/indobenchmark/indobert-base-p1) | [paraphrase-multilingual-mpnet-base-v2](https://huggingface.co/sentence-transformers/paraphrase-multilingual-mpnet-base-v2) | [Wikipedia](https://huggingface.co/datasets/LazarusNLP/wikipedia_id_20230520) | |
5656
| [ConGen-SimCSE-IndoBERT Base](https://huggingface.co/LazarusNLP/congen-simcse-indobert-base) | 125M | [SimCSE-IndoBERT Base](https://huggingface.co/LazarusNLP/simcse-indobert-base) | [paraphrase-multilingual-mpnet-base-v2](https://huggingface.co/sentence-transformers/paraphrase-multilingual-mpnet-base-v2) | [Wikipedia](https://huggingface.co/datasets/LazarusNLP/wikipedia_id_20230520) | |
57+
| [ConGen-Indo-e5 Small](https://huggingface.co/LazarusNLP/congen-indo-e5-small) | 118M | [multilingual-e5-small](https://huggingface.co/intfloat/multilingual-e5-small) | [paraphrase-multilingual-mpnet-base-v2](https://huggingface.co/sentence-transformers/paraphrase-multilingual-mpnet-base-v2) | [Wikipedia](https://huggingface.co/datasets/LazarusNLP/wikipedia_id_20230520) | |
5758
| [SCT-IndoBERT Base](https://huggingface.co/LazarusNLP/sct-indobert-base) | 125M | [IndoBERT Base](https://huggingface.co/indobenchmark/indobert-base-p1) | [paraphrase-multilingual-mpnet-base-v2](https://huggingface.co/sentence-transformers/paraphrase-multilingual-mpnet-base-v2) | [Wikipedia](https://huggingface.co/datasets/LazarusNLP/wikipedia_id_20230520) | |
5859
| [S-IndoBERT Base mMARCO](https://huggingface.co/LazarusNLP/s-indobert-base-mmarco) | 125M | [IndoBERT Base](https://huggingface.co/indobenchmark/indobert-base-p1) | N/A | [mMARCO](https://huggingface.co/datasets/unicamp-dl/mmarco) ||
5960
| [distiluse-base-multilingual-cased-v2](https://huggingface.co/sentence-transformers/distiluse-base-multilingual-cased-v2) | 134M | [DistilBERT Base Multilingual](https://huggingface.co/distilbert-base-multilingual-cased) | mUSE | See: [SBERT](https://www.sbert.net/docs/pretrained_models.html#model-overview) ||
@@ -76,6 +77,7 @@ Like SimCSE, [ConGen: Unsupervised Control and Generalization Distillation For S
7677
| [ConGen-IndoBERT Lite Base](https://huggingface.co/LazarusNLP/congen-indobert-lite-base) | 79.97 |
7778
| [ConGen-IndoBERT Base](https://huggingface.co/LazarusNLP/congen-indobert-base) | 80.47 |
7879
| [ConGen-SimCSE-IndoBERT Base](https://huggingface.co/LazarusNLP/congen-simcse-indobert-base) | 81.16 |
80+
| [ConGen-Indo-e5 Small](https://huggingface.co/LazarusNLP/congen-indo-e5-small) | 80.94 |
7981
| [SCT-IndoBERT Base](https://huggingface.co/LazarusNLP/sct-indobert-base) | 74.56 |
8082
| [S-IndoBERT Base mMARCO](https://huggingface.co/LazarusNLP/s-indobert-base-mmarco) | 72.95 |
8183
| [distiluse-base-multilingual-cased-v2](https://huggingface.co/sentence-transformers/distiluse-base-multilingual-cased-v2) | 75.08 |
@@ -94,6 +96,7 @@ Like SimCSE, [ConGen: Unsupervised Control and Generalization Distillation For S
9496
| [ConGen-IndoBERT Lite Base](https://huggingface.co/LazarusNLP/congen-indobert-lite-base) | 46.04 | 59.06 | 51.01 |
9597
| [ConGen-IndoBERT Base](https://huggingface.co/LazarusNLP/congen-indobert-base) | 45.93 | 58.58 | 49.95 |
9698
| [ConGen-SimCSE-IndoBERT Base](https://huggingface.co/LazarusNLP/congen-simcse-indobert-base) | 45.83 | 58.27 | 49.91 |
99+
| [ConGen-Indo-e5 Small](https://huggingface.co/LazarusNLP/congen-indo-e5-small) | 55.00 | 66.74 | 58.95 |
97100
| [SCT-IndoBERT Base](https://huggingface.co/LazarusNLP/sct-indobert-base) | 40.41 | 47.29 | 40.68 |
98101
| [distiluse-base-multilingual-cased-v2](https://huggingface.co/sentence-transformers/distiluse-base-multilingual-cased-v2) | 41.35 | 54.93 | 48.79 |
99102
| [paraphrase-multilingual-mpnet-base-v2](https://huggingface.co/sentence-transformers/paraphrase-multilingual-mpnet-base-v2) | 52.81 | 65.07 | 57.97 |
@@ -109,6 +112,7 @@ Like SimCSE, [ConGen: Unsupervised Control and Generalization Distillation For S
109112
| [ConGen-IndoBERT Lite Base](https://huggingface.co/LazarusNLP/congen-indobert-lite-base) | 75.22 | 81.55 | 84.13 |
110113
| [ConGen-IndoBERT Base](https://huggingface.co/LazarusNLP/congen-indobert-base) | 73.09 | 80.32 | 83.29 |
111114
| [ConGen-SimCSE-IndoBERT Base](https://huggingface.co/LazarusNLP/congen-simcse-indobert-base) | 72.38 | 79.37 | 82.51 |
115+
| [ConGen-Indo-e5 Small](https://huggingface.co/LazarusNLP/congen-indo-e5-small) | 84.60 | 89.30 | 91.27 |
112116
| [SCT-IndoBERT Base](https://huggingface.co/LazarusNLP/sct-indobert-base) | 76.81 | 83.16 | 85.87 |
113117
| [distiluse-base-multilingual-cased-v2](https://huggingface.co/sentence-transformers/distiluse-base-multilingual-cased-v2) | 70.44 | 77.94 | 81.56 |
114118
| [paraphrase-multilingual-mpnet-base-v2](https://huggingface.co/sentence-transformers/paraphrase-multilingual-mpnet-base-v2) | 81.41 | 87.05 | 89.44 |
@@ -126,6 +130,7 @@ Like SimCSE, [ConGen: Unsupervised Control and Generalization Distillation For S
126130
| [ConGen-IndoBERT Lite Base](https://huggingface.co/LazarusNLP/congen-indobert-lite-base) | 62.41 | 60.94 |
127131
| [ConGen-IndoBERT Base](https://huggingface.co/LazarusNLP/congen-indobert-base) | 61.14 | 60.02 |
128132
| [ConGen-SimCSE-IndoBERT Base](https://huggingface.co/LazarusNLP/congen-simcse-indobert-base) | 60.93 | 59.50 |
133+
| [ConGen-Indo-e5 Small](https://huggingface.co/LazarusNLP/congen-indo-e5-small) | 62.92 | 60.18 |
129134
| [SCT-IndoBERT Base](https://huggingface.co/LazarusNLP/sct-indobert-base) | 55.66 | 54.48 |
130135
| [distiluse-base-multilingual-cased-v2](https://huggingface.co/sentence-transformers/distiluse-base-multilingual-cased-v2) | 55.99 | 52.44 |
131136
| [paraphrase-multilingual-mpnet-base-v2](https://huggingface.co/sentence-transformers/paraphrase-multilingual-mpnet-base-v2) | 65.43 | 63.55 |
@@ -141,6 +146,7 @@ Like SimCSE, [ConGen: Unsupervised Control and Generalization Distillation For S
141146
| [ConGen-IndoBERT Lite Base](https://huggingface.co/LazarusNLP/congen-indobert-lite-base) | 67.25 | 66.53 |
142147
| [ConGen-IndoBERT Base](https://huggingface.co/LazarusNLP/congen-indobert-base) | 67.72 | 67.32 |
143148
| [ConGen-SimCSE-IndoBERT Base](https://huggingface.co/LazarusNLP/congen-simcse-indobert-base) | 67.12 | 66.64 |
149+
| [ConGen-Indo-e5 Small](https://huggingface.co/LazarusNLP/congen-indo-e5-small) | 66.92 | 66.29 |
144150
| [SCT-IndoBERT Base](https://huggingface.co/LazarusNLP/sct-indobert-base) | 61.89 | 60.97 |
145151
| [distiluse-base-multilingual-cased-v2](https://huggingface.co/sentence-transformers/distiluse-base-multilingual-cased-v2) | 65.25 | 63.45 |
146152
| [paraphrase-multilingual-mpnet-base-v2](https://huggingface.co/sentence-transformers/paraphrase-multilingual-mpnet-base-v2) | 70.72 | 70.58 |
@@ -156,6 +162,7 @@ Like SimCSE, [ConGen: Unsupervised Control and Generalization Distillation For S
156162
| [ConGen-IndoBERT Lite Base](https://huggingface.co/LazarusNLP/congen-indobert-lite-base) | 58.18 | 58.84 |
157163
| [ConGen-IndoBERT Base](https://huggingface.co/LazarusNLP/congen-indobert-base) | 57.04 | 57.06 |
158164
| [ConGen-SimCSE-IndoBERT Base](https://huggingface.co/LazarusNLP/congen-simcse-indobert-base) | 59.54 | 60.37 |
165+
| [ConGen-Indo-e5 Small](https://huggingface.co/LazarusNLP/congen-indo-e5-small) | 60.00 | 60.52 |
159166
| [SCT-IndoBERT Base](https://huggingface.co/LazarusNLP/sct-indobert-base) | 61.13 | 61.70 |
160167
| [distiluse-base-multilingual-cased-v2](https://huggingface.co/sentence-transformers/distiluse-base-multilingual-cased-v2) | 63.63 | 64.13 |
161168
| [paraphrase-multilingual-mpnet-base-v2](https://huggingface.co/sentence-transformers/paraphrase-multilingual-mpnet-base-v2) | 63.18 | 63.78 |
@@ -171,6 +178,7 @@ Like SimCSE, [ConGen: Unsupervised Control and Generalization Distillation For S
171178
| [ConGen-IndoBERT Lite Base](https://huggingface.co/LazarusNLP/congen-indobert-lite-base) | 81.2 | 75.59 |
172179
| [ConGen-IndoBERT Base](https://huggingface.co/LazarusNLP/congen-indobert-base) | 85.4 | 82.12 |
173180
| [ConGen-SimCSE-IndoBERT Base](https://huggingface.co/LazarusNLP/congen-simcse-indobert-base) | 83.0 | 78.74 |
181+
| [ConGen-Indo-e5 Small](https://huggingface.co/LazarusNLP/congen-indo-e5-small) | 84.2 | 80.21 |
174182
| [SCT-IndoBERT Base](https://huggingface.co/LazarusNLP/sct-indobert-base) | 82.0 | 76.92 |
175183
| [distiluse-base-multilingual-cased-v2](https://huggingface.co/sentence-transformers/distiluse-base-multilingual-cased-v2) | 78.8 | 73.64 |
176184
| [paraphrase-multilingual-mpnet-base-v2](https://huggingface.co/sentence-transformers/paraphrase-multilingual-mpnet-base-v2) | 89.6 | **86.56** |
@@ -188,6 +196,7 @@ Like SimCSE, [ConGen: Unsupervised Control and Generalization Distillation For S
188196
| [ConGen-IndoBERT Lite Base](https://huggingface.co/LazarusNLP/congen-indobert-lite-base) | 69.44 | 53.74 |
189197
| [ConGen-IndoBERT Base](https://huggingface.co/LazarusNLP/congen-indobert-base) | 71.14 | 56.35 |
190198
| [ConGen-SimCSE-IndoBERT Base](https://huggingface.co/LazarusNLP/congen-simcse-indobert-base) | 70.80 | 56.59 |
199+
| [ConGen-Indo-e5 Small](https://huggingface.co/LazarusNLP/congen-indo-e5-small) | 70.51 | 55.67 |
191200
| [SCT-IndoBERT Base](https://huggingface.co/LazarusNLP/sct-indobert-base) | 59.82 | 53.41 |
192201
| [distiluse-base-multilingual-cased-v2](https://huggingface.co/sentence-transformers/distiluse-base-multilingual-cased-v2) | 58.48 | 50.50 |
193202
| [paraphrase-multilingual-mpnet-base-v2](https://huggingface.co/sentence-transformers/paraphrase-multilingual-mpnet-base-v2) | **74.87** | **57.96** |

0 commit comments

Comments
 (0)