Skip to content

Commit

Permalink
added changes to support mmapped file.
Browse files Browse the repository at this point in the history
  • Loading branch information
hariharan-devarajan committed Nov 28, 2023
1 parent 52011b4 commit faee51e
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 2 deletions.
2 changes: 1 addition & 1 deletion dlio_benchmark/configs/workload/megatron_deepspeed.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@ dataset:
format: mmap_indexed_binary
num_files_train: 1
num_samples_per_file: 277203535
record_length: 4096
record_length: 1024

reader:
data_loader: pytorch
Expand Down
2 changes: 1 addition & 1 deletion dlio_benchmark/data_generator/indexed_binary_generator.py
Original file line number Diff line number Diff line change
Expand Up @@ -55,7 +55,7 @@ def generate(self):
sample_size = dim1 * dim2
total_size = sample_size * self.num_samples
write_size = total_size
MEMORY_SIZE = 2*GB
MEMORY_SIZE = 16*GB
if total_size > MEMORY_SIZE:
write_size = MEMORY_SIZE - (MEMORY_SIZE % sample_size)
out_path_spec = self.storage.get_uri(self._file_list[i])
Expand Down

0 comments on commit faee51e

Please sign in to comment.