File tree Expand file tree Collapse file tree 1 file changed +7
-6
lines changed Expand file tree Collapse file tree 1 file changed +7
-6
lines changed Original file line number Diff line number Diff line change @@ -96,14 +96,15 @@ is intended to become the default in pandas 3.0).
96
96
97
97
To be able to move forward with a string data type in pandas 3.0, this PDEP proposes:
98
98
99
- 1 . For pandas 3.0, a ` "str" ` string dtype is enabled by default, which will use PyArrow
100
- if installed, and otherwise falls back to an in-house functionally-equivalent
101
- (but slower) version .
99
+ 1 . For pandas 3.0, a ` "str" ` string dtype is enabled by default, i.e. this
100
+ string dtype will be used as the default dtype for text data when creating
101
+ pandas objects (e.g. inference in constructors, I/O functions) .
102
102
2 . This default string dtype will follow the same behaviour for missing values
103
103
as other default data types, and use ` NaN ` as the missing value sentinel.
104
- 3 . The version that is not backed by PyArrow can reuse (with minor code
105
- additions) the existing numpy object-dtype backed StringArray for its
106
- implementation.
104
+ 3 . The string dtype will use PyArrow if installed, and otherwise falls back to
105
+ an in-house functionally-equivalent (but slower) version. This fallback can
106
+ reuse (with minor code additions) the existing numpy object-dtype backed
107
+ StringArray for its implementation.
107
108
4 . Installation guidelines are updated to clearly encourage users to install
108
109
pyarrow for the default user experience.
109
110
You can’t perform that action at this time.
0 commit comments