Skip to content

Commit faee51e

Browse files
added changes to support mmapped file.
1 parent 52011b4 commit faee51e

File tree

2 files changed

+2
-2
lines changed

2 files changed

+2
-2
lines changed

dlio_benchmark/configs/workload/megatron_deepspeed.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@ dataset:
1212
format: mmap_indexed_binary
1313
num_files_train: 1
1414
num_samples_per_file: 277203535
15-
record_length: 4096
15+
record_length: 1024
1616

1717
reader:
1818
data_loader: pytorch

dlio_benchmark/data_generator/indexed_binary_generator.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -55,7 +55,7 @@ def generate(self):
5555
sample_size = dim1 * dim2
5656
total_size = sample_size * self.num_samples
5757
write_size = total_size
58-
MEMORY_SIZE = 2*GB
58+
MEMORY_SIZE = 16*GB
5959
if total_size > MEMORY_SIZE:
6060
write_size = MEMORY_SIZE - (MEMORY_SIZE % sample_size)
6161
out_path_spec = self.storage.get_uri(self._file_list[i])

0 commit comments

Comments
 (0)