How can I string-split a string field into a list field in Arrow Datafusion? #4605
Unanswered
WijnandOntoforce
asked this question in
Q&A
Replies: 1 comment
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I have a CSV-file people.csv which looks like this:
I can read the file like this:
This results in the field 'hobbies' being a single string (Utf8) field. However, its values are comma-separated strings. I would like to transform that field into a "list-of-string" field by splitting the strings on comma.
Is this possible in DataFusion? I know that Arrow has list-type, but I'm not sure whether DataFusion supports or allows this.
(I'm using the dataframe API, not SQL)
Beta Was this translation helpful? Give feedback.
All reactions