-
Notifications
You must be signed in to change notification settings - Fork 29
Open
Description
Hello!
We are trying out scan_fastq and read_fastq atm and the problem is it crashes on files coming from basespace.
It does work when we use our own compressed files.
8192rows [00:00, 13066.77rows/s]
Traceback (most recent call last):
File "/home/ubuntu/working_dir/notebooks/pipeline_dev/COMB-243/test.py", line 7, in <module>
print(df.collect())
~~~~~~~~~~^^
File "/home/ubuntu/miniforge3/envs/dev/lib/python3.13/site-packages/polars/_utils/deprecation.py", line 97, in wrapper
return function(*args, **kwargs)
File "/home/ubuntu/miniforge3/envs/dev/lib/python3.13/site-packages/polars/lazyframe/opt_flags.py", line 330, in wrapper
return function(*args, **kwargs)
File "/home/ubuntu/miniforge3/envs/dev/lib/python3.13/site-packages/polars/lazyframe/frame.py", line 2407, in collect
return wrap_df(ldf.collect(engine, callback))
~~~~~~~~~~~^^^^^^^^^^^^^^^^^^
polars.exceptions.ComputeError: caught exception during execution of a Python source, exception: Exception: DataFusion error: IoError(Kind(UnexpectedEof))
The script basically reads a file (tested s3 and locally).
Any idea of how to confirm this suspicion?
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels