Skip to content

Conversation

Veneel77
Copy link

@Veneel77 Veneel77 commented Jun 16, 2025

What does this PR do?

Replaces manual deduplication logic in the Python parser with the dedup_names utility from pandas.io.common.

Why is this important?


Checklist

@Veneel77
Copy link
Author

Hi maintainers 👋

This PR replaces manual deduplication logic with the shared dedup_names utility. Parser-related tests pass as expected ✅

Some CI checks failed due to network-based tests (test_network.py, test_url, etc.), which appear unrelated to my change and are known to be flaky or environment-dependent.

Please let me know if you’d like anything else adjusted. Thanks!

@jbrockmendel
Copy link
Member

can you merge main and see if the CI passes

self.dtype.update({col: self.dtype.get(old_col)})
this_columns[i] = col
counts[col] = cur_count + 1
from pandas.io.common import dedup_names
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can this import go at the top of the file

else:
cur_count = counts[col]

if (
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks like this chunk isn't present in dedup_names?

@Veneel77
Copy link
Author

Veneel77 commented Jul 16, 2025 via email

Copy link
Contributor

This pull request is stale because it has been open for thirty days with no activity. Please update and respond to this comment if you're still interested in working on this.

@github-actions github-actions bot added the Stale label Aug 16, 2025
@Veneel77
Copy link
Author

Veneel77 commented Aug 21, 2025 via email

@jbrockmendel
Copy link
Member

Closing to clear the queue. Can re-open when you're ready to continue here.

@Veneel77
Copy link
Author

Veneel77 commented Oct 13, 2025 via email

@jbrockmendel jbrockmendel reopened this Oct 13, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

CLN: Use dedup_names in all instances where duplicate column names are renamed

3 participants