Skip to content

Commit a684257

Browse files
Update biencoder example config to use hf dataset
Signed-off-by: Oliver Holworthy <1216955+oliverholworthy@users.noreply.github.com>
1 parent 080ad70 commit a684257

File tree

1 file changed

+1
-3
lines changed

1 file changed

+1
-3
lines changed

examples/biencoder/llama3_2_1b_biencoder.yaml

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -53,9 +53,7 @@ dataloader:
5353
dataset:
5454
_target_: nemo_automodel.components.datasets.llm.make_retrieval_dataset
5555
data_dir_list:
56-
- /adasif/retriever_models_research/training_datasets/nqsh_shuffled_50k.json
57-
- /adasif/retriever_models_research/training_datasets/mldr_en_perc95_small.json
58-
- /adasif/retriever_models_research/training_datasets/miracl_train_es_llama3_1b_4m_512len.json
56+
- hf://nvidia/embed-nemotron-dataset-v1/FEVER
5957
data_type: train
6058
train_n_passages: 5
6159
eval_negative_size: 4

0 commit comments

Comments
 (0)