Skip to content

[Data] Expose datasources via package exports#60645

Open
400Ping wants to merge 6 commits intoray-project:masterfrom
400Ping:data/export-datasource
Open

[Data] Expose datasources via package exports#60645
400Ping wants to merge 6 commits intoray-project:masterfrom
400Ping:data/export-datasource

Conversation

@400Ping
Copy link
Contributor

@400Ping 400Ping commented Feb 1, 2026

Description

Goal:

Make ray.data._internal.datasource a package entry point with short imports and alphabetized __all__.

Changes:

Add/complete __all__ in datasource/datasink modules and re-export via __init__.py.
Update imports to from ray.data._internal.datasource import ....
Keep private helpers module-scoped to avoid cycles and accidental exposure.

Related issues

Related to #60204

Additional information

Signed-off-by: 400Ping <jiekaichang@apache.org>
@400Ping 400Ping requested a review from a team as a code owner February 1, 2026 15:23
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request is a nice refactoring that centralizes datasource imports, making the codebase cleaner and easier to maintain. The changes are consistent with the stated goal of creating a single entry point for datasources. I've identified one minor omission regarding the export of JSON_FILE_EXTENSIONS, which I've detailed in the review comments. Other than that, the changes look solid.

@ray-gardener ray-gardener bot added data Ray Data-related issues community-contribution Contributed by the community labels Feb 1, 2026
Signed-off-by: 400Ping <jiekaichang@apache.org>
Copy link

@cursor cursor bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Cursor Bugbot has reviewed your changes and found 1 potential issue.

Signed-off-by: 400Ping <jiekaichang@apache.org>
@400Ping
Copy link
Contributor Author

400Ping commented Feb 4, 2026

@bveeramani PTAL

@400Ping
Copy link
Contributor Author

400Ping commented Feb 5, 2026

cc @owenowenisme

Signed-off-by: Jie-Kai Chang <jiekaichang@apache.org>
Signed-off-by: 400Ping <jiekaichang@apache.org>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

community-contribution Contributed by the community data Ray Data-related issues

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants