Skip to content

Commit 1836bad

Browse files
committed
fix
Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
1 parent 842db72 commit 1836bad

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

nemo_automodel/components/datasets/llm/retrieval_dataset_inline.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -520,9 +520,9 @@ def make_retrieval_dataset(
520520

521521
# Apply same processing as _get_processed_dataset
522522
if data_type == "train":
523+
if do_shuffle:
524+
dataset = dataset.shuffle(seed=seed)
523525
if max_train_samples is not None:
524-
if do_shuffle:
525-
dataset = dataset.shuffle(seed=seed)
526526
dataset = dataset.select(
527527
range(train_data_select_offset, min(train_data_select_offset + max_train_samples, len(dataset)))
528528
)

0 commit comments

Comments
 (0)