Skip to content
This repository was archived by the owner on Jan 15, 2024. It is now read-only.

Commit ded0f99

Browse files
authored
fix prepare_openwebtext (#1289)
1 parent 20af58f commit ded0f99

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

scripts/datasets/pretrain_corpus/prepare_openwebtext.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -57,7 +57,7 @@ def extract_files(full_name, output_dir, shuffle=False):
5757
with tarfile.open(full_name) as t:
5858
txt_names = t.getnames()
5959
if shuffle:
60-
txt_names = random.shuffle(txt_names)
60+
random.shuffle(txt_names)
6161
for txt_name in txt_names:
6262
f = t.extractfile(txt_name)
6363
for line in f.readlines():

0 commit comments

Comments
 (0)