Skip to content

Commit 42109b9

Browse files
committed
Turning off shuffling for datasets to decrease initialization time
1 parent 489a775 commit 42109b9

File tree

1 file changed

+2
-1
lines changed

1 file changed

+2
-1
lines changed

axlearn/experiments/text/gpt/common.py

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -504,7 +504,8 @@ def mixture_train_input_source(
504504
config_for_function(input_tf_data.tfds_dataset).set(
505505
dataset_name=component.name,
506506
split=component.split,
507-
train_shuffle_buffer_size=64 * component.shuffle_buffer_size,
507+
# train_shuffle_buffer_size=64 * component.shuffle_buffer_size,
508+
train_shuffle_buffer_size=0,
508509
read_config=tfds_read_config(),
509510
)
510511
)

0 commit comments

Comments
 (0)