-
Notifications
You must be signed in to change notification settings - Fork 3.1k
Closed
Description
Describe the bug
When using load_datasetto load HuggingFaceM4/VQAv2, I am getting FSTimeoutError.
Error
TimeoutError:
The above exception was the direct cause of the following exception:
FSTimeoutError Traceback (most recent call last)
[/usr/local/lib/python3.10/dist-packages/fsspec/asyn.py](https://klh9mr78js-496ff2e9c6d22116-0-colab.googleusercontent.com/outputframe.html?vrz=colab_20240924-060116_RC00_678132060#) in sync(loop, func, timeout, *args, **kwargs)
99 if isinstance(return_result, asyncio.TimeoutError):
100 # suppress asyncio.TimeoutError, raise FSTimeoutError
--> 101 raise FSTimeoutError from return_result
102 elif isinstance(return_result, BaseException):
103 raise return_result
FSTimeoutError:
It usually fails around 5-6 GB.
Steps to reproduce the bug
To reproduce it, run this in colab notebook:
!pip install -q -U datasets
from datasets import load_dataset
ds = load_dataset('HuggingFaceM4/VQAv2', split="train[:10%]")
Expected behavior
It should download properly.
Environment info
Using Colab Notebook.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels