-
Notifications
You must be signed in to change notification settings - Fork 3k
Closed
Description
Describe the bug
I'm getting the following exception:
lib/python3.10/zipfile.py:1353 in _RealGetContents β
β β
β 1350 β β # self.start_dir: Position of start of central directory β
β 1351 β β self.start_dir = offset_cd + concat β
β 1352 β β if self.start_dir < 0: β
β β± 1353 β β β raise BadZipFile("Bad offset for central directory") β
β 1354 β β fp.seek(self.start_dir, 0) β
β 1355 β β data = fp.read(size_cd) β
β 1356 β β fp = io.BytesIO(data) β
β°βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ―
BadZipFile: Bad offset for central directory
Extracting data files: 35%|ββββββββββββββββββ | 38572/110812 [00:10<00:20, 3576.26it/s]
Steps to reproduce the bug
load_dataset(
args.dataset_name,
args.dataset_config_name,
cache_dir=args.cache_dir,
),
Expected behavior
loads the dataset
Environment info
datasets==2.8.0
Python 3.10.8
Linux 129-146-3-202 5.15.0-52-generic #58~20.04.1-Ubuntu SMP Thu Oct 13 13:09:46 UTC 2022 x86_64 x86_64 x86_64 GNU/Linux
Metadata
Metadata
Assignees
Labels
No labels