BUG: fix writing some non-string object columns with arrow by theroggy · Pull Request #630 · geopandas/pyogrio

theroggy · 2026-01-19T12:49:02Z

In write_dataframe with use_arrow=False, object dtype columns are implicitly serialised to string to be able to write them to the output file.

With use_arrow=True, the pyarrow to_table function supports treatment of some datatypes (e.g. lists,...), but for other cases such columns rather given an error. For datatypes that are not supported by pyarrow a good default behaviour would be to just convert them to string as is done without arrow.

This PR explicitly converts object columns to string for columns that aren't supported to be interpreted by pyarrow.

resolves #631

…ow=True

pyogrio/geopandas.py

…vert-object-columns-to-string-for-use_arrow=True

jorisvandenbossche · 2026-02-23T20:50:32Z

pyogrio/tests/test_geopandas_io.py

+    # Verify that object_col is actually inferred as object dtype for this test.
+    str_dtype = (
+        "str"
+        if PANDAS_GE_30 or (PANDAS_GE_23 and pd.options.future.infer_string)
+        else "object"


Instead of verifying this here, I would instead specify dtype=object in the construction above to ensure the input data is always using object dtype (also for purely strings).

Strings as the str dtype is already covered by other tests (and also does not go through the code path you changed), so I would have this test just focus on object dtype (and we should cover the case of all strings as object dtype anyway as well)

jorisvandenbossche · 2026-02-23T20:52:32Z

pyogrio/tests/test_geopandas_io.py

+            ]
+        elif isinstance(object_col_data[0], bytes):
+            # byte objects are read back as byte objects with arrow
+            expected_dtype = "object"


Maybe you can move the expected_dtype as an additional value in the parametrization to avoid most of this while if/else block?

The expected dtype as well as the expected data depends on use_arrow as well as the pandas version, so I don't think it will become more readable that way...

…vert-object-columns-to-string-for-use_arrow=True

theroggy added 2 commits January 19, 2026 13:42

ENH: in write_dataframe, convert object columns to string for use_arr…

ce12356

…ow=True

Improve tests

5f07b4e

theroggy changed the title ~~ENH: in write_dataframe, convert object columns to string with arrow~~ ENH: improve support of writing object columns with arrow Jan 19, 2026

theroggy self-assigned this Jan 19, 2026

theroggy changed the title ~~ENH: improve support of writing object columns with arrow~~ BUG: fix writing non-string object columns with arrow Jan 19, 2026

theroggy added 10 commits January 19, 2026 18:41

Fix for lists

81bc475

Remove mixed not supported test

420741c

Fix tests

d1204ee

Update geopandas.py

a6354f9

Fix test for older pandas versions

23e2391

Fix test for pandas 3

53d302c

Update test_geopandas_io.py

87ce012

Update test_geopandas_io.py

e2bfc4c

Fixes to tests for pandas 3

4f12245

Update test_geopandas_io.py

62fa8db

theroggy marked this pull request as ready for review January 19, 2026 23:31

Update CHANGES.md

32d4815

jorisvandenbossche reviewed Jan 29, 2026

View reviewed changes

pyogrio/geopandas.py Outdated Show resolved Hide resolved

pyogrio/geopandas.py Outdated Show resolved Hide resolved

theroggy added 8 commits February 18, 2026 15:51

Move df.copy so it is always applied to avoid passed in df to be changed

bc65ad8

Convert all object columns that cannot be handled by pyarrow to str

a214688

Merge remote-tracking branch 'upstream/main' into ENH-also-try-to-con…

d3e8d31

…vert-object-columns-to-string-for-use_arrow=True

Update geopandas.py

6c51aa0

Update geopandas.py

9e3219c

Update geopandas.py

f22ae15

Update geopandas.py

692b357

Update geopandas.py

eda9233

theroggy closed this Feb 19, 2026

theroggy reopened this Feb 19, 2026

theroggy added this to the 0.12.2 milestone Feb 19, 2026

theroggy requested a review from jorisvandenbossche February 19, 2026 09:08

theroggy changed the title ~~BUG: fix writing non-string object columns with arrow~~ BUG: fix writing some non-string object columns with arrow Feb 20, 2026

jorisvandenbossche reviewed Feb 23, 2026

View reviewed changes

theroggy added 5 commits March 4, 2026 11:06

Always write object dtype in test

f25f4d0

Update test_geopandas_io.py

35d43c9

Merge remote-tracking branch 'upstream/main' into ENH-also-try-to-con…

d12c0a9

…vert-object-columns-to-string-for-use_arrow=True

Fix tests for old pandas versions

558867a

Another fix to test to support old pandas versions

230557a

theroggy requested a review from jorisvandenbossche March 5, 2026 07:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

BUG: fix writing some non-string object columns with arrow#630

BUG: fix writing some non-string object columns with arrow#630
theroggy wants to merge 26 commits intogeopandas:mainfrom
theroggy:ENH-also-try-to-convert-object-columns-to-string-for-use_arrow=True

theroggy commented Jan 19, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

jorisvandenbossche Feb 23, 2026

Uh oh!

jorisvandenbossche Feb 23, 2026

Uh oh!

theroggy Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

theroggy commented Jan 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jorisvandenbossche Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

jorisvandenbossche Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

theroggy Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

theroggy commented Jan 19, 2026 •

edited

Loading