Skip to content

Commit 889b2ae

Browse files
committed
Updated ConGen Code
1 parent 2d5c6f6 commit 889b2ae

File tree

2 files changed

+19
-1
lines changed

2 files changed

+19
-1
lines changed

unsupervised_learning/ConGen/README.md

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -60,6 +60,24 @@ python train_con_gen.py \
6060
--teacher-temp 0.5
6161
```
6262

63+
### Multilingual e5 Small
64+
65+
```sh
66+
python train_con_gen.py \
67+
--model-name intfloat/multilingual-e5-small \
68+
--train-dataset-name LazarusNLP/wikipedia_id_20230520 \
69+
--max-seq-length 128 --min-text-length 150 --max-text-length 500 \
70+
--max-train-samples 1000000 \
71+
--num-epochs 20 \
72+
--train-batch-size 128 \
73+
--early-stopping-patience 7 \
74+
--learning-rate 1e-4 \
75+
--teacher-model-name sentence-transformers/paraphrase-multilingual-mpnet-base-v2 \
76+
--queue-size 65536 \
77+
--student-temp 0.5 \
78+
--teacher-temp 0.5
79+
```
80+
6381
## Results
6482

6583
### STSB-MT-ID

unsupervised_learning/ConGen/train_con_gen.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -183,7 +183,7 @@ def main(args: Args):
183183
epochs=args.num_epochs,
184184
warmup_steps=warmup_steps,
185185
show_progress_bar=True,
186-
optimizer_params={"lr": args.learning_rate, "eps": 1e-6, "correct_bias": False},
186+
optimizer_params={"lr": args.learning_rate, "eps": 1e-6},
187187
output_path=args.output_path,
188188
save_best_model=True,
189189
early_stopping_patience=args.early_stopping_patience,

0 commit comments

Comments
 (0)