BUG: Take method of NumpyExtensionArray now returns another extension array with the correct dtype. #62502

aijams · 2025-09-29T17:44:04Z

In the edge case when called with integer arrays and asked to access non-existent entries (to be replaced with NaN), the take method of NumpyExtensionArray produces arrays whose dtypes don't match their underlying data.
Specifically, take promotes the underlying data to a floating-point type, but doesn't promote the dtype of the extension array to match.
These changes ensure that the result of this method has the correct dtype for its data.

closes #62448
Tests added and passed if fixing a bug or adding a new feature
All code checks passed.
Added type annotations to new arguments/methods/functions.
Added an entry in the latest doc/source/whatsnew/vX.X.X.rst file if fixing a bug or adding a new feature.

…take method to return correct dtype.

…case.

…n-invalid-dtype

aijams · 2025-10-02T17:52:40Z

Several tests are currently failing due to a new version of numexpr.
#62545
All other tests pass.
I hard-coded a list of dtypes in take to check for the integer types that can't store NaN values since setting the dtype on an extension array after it's created isn't allowed. I have yet to think of another way to correct this issue without modifying the base extension array class to allow its dtype to be modified.
Let me know if you thought of a better way of approaching this.

…n-invalid-dtype

pandas/core/arrays/_mixins.py

pandas/tests/arrays/numpy_/test_numpy.py

…n-invalid-dtype

pandas/core/arrays/numpy_.py

pandas/tests/arrays/numpy_/test_numpy.py

…n-invalid-dtype

pandas/core/arrays/numpy_.py

jbrockmendel · 2025-11-03T16:40:43Z

pandas/tests/arrays/numpy_/test_numpy.py

+        np.int64,
+    ],
+)
+def test_take_assigns_integer_dtype_when_fill_disallowed(dtype):


this test is unnecessary

I added this test to make sure that take does the expected thing when not allowed to fill missing values. I did a quick search, but I didn't find any other tests for this. There are other tests that work with this method, but I would like to have a test explicitly for this behavior to check for regressions and make it clear that take should preserve dtype if it can. If you still think this test isn't needed, can you explain why you think so?

this is tested in the extension tests

@aijams if you address this we can get this merged

I found a test in BaseGetitemTests that tests the same case I tested, except for the case of boolean data. However, these tests didn't run under Pytest when I ran it. I removed my test, however I still think the case for boolean data should be tested.

…n-invalid-dtype

…ion array with.

…n-invalid-dtype

jbrockmendel · 2025-11-16T18:39:53Z

thanks @aijams

… array with the correct dtype. (pandas-dev#62502)

aijams added 2 commits September 29, 2025 13:25

Added test to check the dtype of the result of take method. Modified …

7a72a06

…take method to return correct dtype.

Added bug info to docs.

65511cd

aijams marked this pull request as draft September 29, 2025 18:11

rhshadrach added Bug ExtensionArray Extending pandas with custom dtypes or arrays. Algos Non-arithmetic algos: value_counts, factorize, sorting, isin, clip, shift, diff labels Sep 30, 2025

aijams added 2 commits October 2, 2025 10:48

Added conditional to handle integer arrays in take method as special …

42898c6

…case.

Merge remote-tracking branch 'upstream/main' into aijams-take-functio…

333ba45

…n-invalid-dtype

aijams added 2 commits October 6, 2025 14:10

Added cases for smaller integer types to mixins take function.

3ca2e84

Merge remote-tracking branch 'upstream/main' into aijams-take-functio…

b6e45ac

…n-invalid-dtype

aijams marked this pull request as ready for review October 7, 2025 16:33

aijams added 2 commits October 7, 2025 12:34

Merge remote-tracking branch 'upstream/main' into aijams-take-functio…

e0ff0d0

…n-invalid-dtype

Added note to take method.

eeb3d85

jbrockmendel reviewed Oct 9, 2025

View reviewed changes

pandas/core/arrays/_mixins.py Outdated Show resolved Hide resolved

jbrockmendel reviewed Oct 9, 2025

View reviewed changes

pandas/tests/arrays/numpy_/test_numpy.py Outdated Show resolved Hide resolved

aijams added 5 commits October 14, 2025 13:46

Merge remote-tracking branch 'upstream/main' into aijams-take-functio…

cd2f2aa

…n-invalid-dtype

Added link to issue in test for take method.

468de3f

Moved changes to take method to NumpyExtensionArray class.

91442a8

Cleaned up work comments.

aa3c228

Merge remote-tracking branch 'upstream/main' into aijams-take-functio…

8bacb94

…n-invalid-dtype

jbrockmendel reviewed Oct 16, 2025

View reviewed changes

pandas/core/arrays/numpy_.py Outdated Show resolved Hide resolved

jbrockmendel reviewed Oct 16, 2025

View reviewed changes

pandas/tests/arrays/numpy_/test_numpy.py Outdated Show resolved Hide resolved

aijams added 7 commits October 17, 2025 11:45

Merge remote-tracking branch 'upstream/main' into aijams-take-functio…

56a329a

…n-invalid-dtype

Merge remote-tracking branch 'upstream/main' into aijams-take-functio…

040c127

…n-invalid-dtype

Merge remote-tracking branch 'upstream/main' into aijams-take-functio…

1bdb1f8

…n-invalid-dtype

Tests for take method check against object dtype for boolean inputs.

3e5836c

Removed TODO comment.

b4258f2

Merge remote-tracking branch 'upstream/main' into aijams-take-functio…

e4c3a5b

…n-invalid-dtype

Updated references to numpy.bool to include underscore.

acdfb62

Merge remote-tracking branch 'upstream/main' into aijams-take-functio…

fd56024

…n-invalid-dtype

aijams mentioned this pull request Oct 24, 2025

CI: Pyodide checks failing with NotImplemented error #62820

Closed

aijams added 2 commits October 28, 2025 10:39

Merge remote-tracking branch 'upstream/main' into aijams-take-functio…

8dcf8b2

…n-invalid-dtype

Merge remote-tracking branch 'upstream/main' into aijams-take-functio…

f0886de

…n-invalid-dtype

jbrockmendel reviewed Nov 3, 2025

View reviewed changes

pandas/core/arrays/numpy_.py Outdated Show resolved Hide resolved

jbrockmendel reviewed Nov 3, 2025

View reviewed changes

aijams added 4 commits November 4, 2025 11:12

Merge remote-tracking branch 'upstream/main' into aijams-take-functio…

c174833

…n-invalid-dtype

Take method now gets underlying array of result to build a new extens…

aa1c58b

…ion array with.

Removed dtype preservation tests for take, except for boolean case.

79c3116

Merge remote-tracking branch 'upstream/main' into aijams-take-functio…

3edbc31

…n-invalid-dtype

jbrockmendel merged commit 2d73d62 into pandas-dev:main Nov 16, 2025
41 checks passed

rustamali9183 pushed a commit to rustamali9183/pandas that referenced this pull request Nov 17, 2025

BUG: Take method of NumpyExtensionArray now returns another extension…

2f39c1b

… array with the correct dtype. (pandas-dev#62502)

mittal-aakriti pushed a commit to mittal-aakriti/pandas that referenced this pull request Nov 19, 2025

BUG: Take method of NumpyExtensionArray now returns another extension…

5b11b68

… array with the correct dtype. (pandas-dev#62502)

mittal-aakriti pushed a commit to mittal-aakriti/pandas that referenced this pull request Nov 19, 2025

BUG: Take method of NumpyExtensionArray now returns another extension…

93bad06

… array with the correct dtype. (pandas-dev#62502)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

BUG: Take method of NumpyExtensionArray now returns another extension array with the correct dtype. #62502

BUG: Take method of NumpyExtensionArray now returns another extension array with the correct dtype. #62502

Uh oh!

aijams commented Sep 29, 2025 •

edited

Loading

Uh oh!

aijams commented Oct 2, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jbrockmendel Nov 3, 2025

Uh oh!

aijams Nov 4, 2025

Uh oh!

jbrockmendel Nov 4, 2025

Uh oh!

jbrockmendel Nov 10, 2025

Uh oh!

aijams Nov 11, 2025

Uh oh!

Uh oh!

jbrockmendel commented Nov 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

BUG: Take method of NumpyExtensionArray now returns another extension array with the correct dtype. #62502

BUG: Take method of NumpyExtensionArray now returns another extension array with the correct dtype. #62502

Uh oh!

Conversation

aijams commented Sep 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aijams commented Oct 2, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jbrockmendel Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

aijams Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

jbrockmendel Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

jbrockmendel Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

aijams Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jbrockmendel commented Nov 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

aijams commented Sep 29, 2025 •

edited

Loading