|
| 1 | +.. _whatsnew_231: |
| 2 | + |
| 3 | +What's new in 2.3.1 (Month XX, 2025) |
| 4 | +------------------------------------ |
| 5 | + |
| 6 | +These are the changes in pandas 2.3.1. See :ref:`release` for a full changelog |
| 7 | +including other versions of pandas. |
| 8 | + |
| 9 | +{{ header }} |
| 10 | + |
| 11 | +.. --------------------------------------------------------------------------- |
| 12 | +.. _whatsnew_231.string_fixes: |
| 13 | + |
| 14 | +Improvements and fixes for the StringDtype |
| 15 | +~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ |
| 16 | + |
| 17 | +.. _whatsnew_231.string_fixes.string_comparisons: |
| 18 | + |
| 19 | +Comparisons between different string dtypes |
| 20 | +^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ |
| 21 | + |
| 22 | +In previous versions, comparing :class:`Series` of different string dtypes (e.g. ``pd.StringDtype("pyarrow", na_value=pd.NA)`` against ``pd.StringDtype("python", na_value=np.nan)``) would result in inconsistent resulting dtype or incorrectly raise. pandas will now use the hierarchy |
| 23 | + |
| 24 | + object < (python, NaN) < (pyarrow, NaN) < (python, NA) < (pyarrow, NA) |
| 25 | + |
| 26 | +in determining the result dtype when there are different string dtypes compared. Some examples: |
| 27 | + |
| 28 | +- When ``pd.StringDtype("pyarrow", na_value=pd.NA)`` is compared against any other string dtype, the result will always be ``boolean[pyarrow]``. |
| 29 | +- When ``pd.StringDtype("python", na_value=pd.NA)`` is compared against ``pd.StringDtype("pyarrow", na_value=np.nan)``, the result will be ``boolean``, the NumPy-backed nullable extension array. |
| 30 | +- When ``pd.StringDtype("python", na_value=pd.NA)`` is compared against ``pd.StringDtype("python", na_value=np.nan)``, the result will be ``boolean``, the NumPy-backed nullable extension array. |
| 31 | + |
| 32 | +.. _whatsnew_231.string_fixes.ignore_empty: |
| 33 | + |
| 34 | +Index set operations ignore empty RangeIndex and object dtype Index |
| 35 | +^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ |
| 36 | + |
| 37 | +When enabling the ``future.infer_string`` option, :class:`Index` set operations (like |
| 38 | +union or intersection) will now ignore the dtype of an empty :class:`RangeIndex` or |
| 39 | +empty :class:`Index` with ``object`` dtype when determining the dtype of the resulting |
| 40 | +Index (:issue:`60797`). |
| 41 | + |
| 42 | +This ensures that combining such empty Index with strings will infer the string dtype |
| 43 | +correctly, rather than defaulting to ``object`` dtype. For example: |
| 44 | + |
| 45 | +.. code-block:: python |
| 46 | +
|
| 47 | + >>> pd.options.mode.infer_string = True |
| 48 | + >>> df = pd.DataFrame() |
| 49 | + >>> df.columns.dtype |
| 50 | + dtype('int64') # default RangeIndex for empty columns |
| 51 | + >>> df["a"] = [1, 2, 3] |
| 52 | + >>> df.columns.dtype |
| 53 | + <StringDtype(na_value=nan)> # new columns use string dtype instead of object dtype |
| 54 | +
|
| 55 | +.. _whatsnew_231.string_fixes.bugs: |
| 56 | + |
| 57 | +Bug fixes |
| 58 | +^^^^^^^^^ |
| 59 | +- Bug in :meth:`.DataFrameGroupBy.min`, :meth:`.DataFrameGroupBy.max`, :meth:`.Resampler.min`, :meth:`.Resampler.max` where all NA values of string dtype would return float instead of string dtype (:issue:`60810`) |
| 60 | +- Bug in :meth:`DataFrame.sum` with ``axis=1``, :meth:`.DataFrameGroupBy.sum` or :meth:`.SeriesGroupBy.sum` with ``skipna=True``, and :meth:`.Resampler.sum` with all NA values of :class:`StringDtype` resulted in ``0`` instead of the empty string ``""`` (:issue:`60229`) |
| 61 | +- Fixed bug in :meth:`DataFrame.explode` and :meth:`Series.explode` where methods would fail with ``dtype="str"`` (:issue:`61623`) |
| 62 | + |
| 63 | + |
| 64 | +.. _whatsnew_231.regressions: |
| 65 | + |
| 66 | +Fixed regressions |
| 67 | +~~~~~~~~~~~~~~~~~ |
| 68 | +- |
| 69 | + |
| 70 | +.. --------------------------------------------------------------------------- |
| 71 | +.. _whatsnew_231.bug_fixes: |
| 72 | + |
| 73 | +Bug fixes |
| 74 | +~~~~~~~~~ |
| 75 | +- |
| 76 | + |
| 77 | +.. --------------------------------------------------------------------------- |
| 78 | +.. _whatsnew_231.other: |
| 79 | + |
| 80 | +Other |
| 81 | +~~~~~ |
| 82 | +- |
| 83 | + |
| 84 | +.. --------------------------------------------------------------------------- |
| 85 | +.. _whatsnew_231.contributors: |
| 86 | + |
| 87 | +Contributors |
| 88 | +~~~~~~~~~~~~ |
0 commit comments