MAINT: Prepare patsy for pandas3 StringDtype #229

bashtage · 2025-07-22T11:03:01Z

Adds support for StringDtype
Fixes tests that are not valid with copy-on-write

Adds support for StringDtype Fixes tests that are not valid with copy-on-write

Copilot

Pull Request Overview

This PR prepares the patsy library for pandas 3 compatibility by adding support for StringDtype and fixing tests that are incompatible with pandas 3's copy-on-write behavior. The changes ensure the library continues to work with newer pandas versions while maintaining backward compatibility.

Adds detection logic for pandas 3 and StringDtype support
Updates test assertions to account for copy-on-write behavior in pandas 3
Extends dtype checking functions to handle StringDtype alongside existing categorical dtype support

patsy/util.py

Special 1 and 2 d cases when indexing for pandas 3 support

for more information, see https://pre-commit.ci

matthewwardrop · 2025-07-23T04:04:30Z

patsy/missing.py

@@ -183,7 +183,7 @@ def _handle_NA_drop(self, values, is_NAs, origins):
            total_mask |= is_NA
        good_mask = ~total_mask
        # "..." to handle 1- versus 2-dim indexing
-        return [v[good_mask, ...] for v in values]
+        return [v[good_mask] if v.ndim == 1 else v[good_mask, ...] for v in values]


Is this due to upstream indexing changes? Kind of annoying if ... no longer supports "zero" expansion.

Yes, it appears that it is stricter and no longer supports expansion. I only found this by running against the statsmodels test suite.

matthewwardrop · 2025-07-23T04:10:15Z

patsy/util.py

@@ -799,7 +820,7 @@ def test_safe_is_pandas_categorical():
 #   https://github.com/pydata/pandas/issues/9581
 #   https://github.com/pydata/pandas/issues/9581#issuecomment-77099564
 def safe_issubdtype(dt1, dt2):
-    if safe_is_pandas_categorical_dtype(dt1):
+    if safe_is_pandas_categorical_dtype(dt1) or safe_is_pandas_string_dtype(dt1):


Hmmm... are there other places that string dtypes should be treated as categorical, no? I'll need to take a look, since I haven't looked at patsy code for a while.

MAINT: Prepare patsy for pandas3 StringDtype

6a1e812

Adds support for StringDtype Fixes tests that are not valid with copy-on-write

bashtage force-pushed the pandas-3-support branch from 4001d9e to 6a1e812 Compare July 22, 2025 11:11

bashtage requested review from Copilot and matthewwardrop and removed request for Copilot July 22, 2025 11:20

This comment was marked as outdated.

Sign in to view

bashtage force-pushed the pandas-3-support branch from 2ceaeab to f282fae Compare July 22, 2025 11:25

bashtage requested a review from Copilot July 22, 2025 11:37

Copilot AI reviewed Jul 22, 2025

View reviewed changes

patsy/util.py Outdated Show resolved Hide resolved

patsy/util.py Outdated Show resolved Hide resolved

patsy/util.py Show resolved Hide resolved

CLN: Fix copilot identified issues

a964dab

bashtage force-pushed the pandas-3-support branch from f94b74d to a964dab Compare July 22, 2025 11:44

MAINT: Fix issue with 1d indexing

f15e36c

Special 1 and 2 d cases when indexing for pandas 3 support

bashtage force-pushed the pandas-3-support branch from c15476b to f15e36c Compare July 22, 2025 15:21

[pre-commit.ci] auto fixes from pre-commit.com hooks

cb2f01c

for more information, see https://pre-commit.ci

matthewwardrop reviewed Jul 23, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

MAINT: Prepare patsy for pandas3 StringDtype #229

MAINT: Prepare patsy for pandas3 StringDtype #229

Uh oh!

bashtage commented Jul 22, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

matthewwardrop Jul 23, 2025

Uh oh!

bashtage Jul 23, 2025

Uh oh!

matthewwardrop Jul 23, 2025

Uh oh!

Uh oh!

MAINT: Prepare patsy for pandas3 StringDtype #229

Are you sure you want to change the base?

MAINT: Prepare patsy for pandas3 StringDtype #229

Uh oh!

Conversation

bashtage commented Jul 22, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

matthewwardrop Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

bashtage Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

matthewwardrop Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!