Skip to content

Commit 1538910

Browse files
tomvdwThe TensorFlow Datasets Authors
authored andcommitted
Prioritize registered datasets over datasets in the datasets folder
If users have their own datasets, they are not in the datasets folder, but they will be part of the legacy dataset provider. PiperOrigin-RevId: 686488188
1 parent 379a55d commit 1538910

File tree

1 file changed

+7
-3
lines changed

1 file changed

+7
-3
lines changed

tensorflow_datasets/core/registered.py

Lines changed: 7 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -300,8 +300,12 @@ def get_builder_cls(self, name: str) -> Type[RegisteredDataset]:
300300
...
301301

302302

303-
class LegacyDatasetBuilderProvider(DatasetBuilderProvider):
304-
"""Provider of dataset builders that are defined in the legacy codebase."""
303+
class RegisteredDatasetBuilderProvider(DatasetBuilderProvider):
304+
"""Provider of dataset builders that are defined by builder classes.
305+
306+
If there's a builder class, then it will be registered because it's a subclass
307+
of RegisteredDataset.
308+
"""
305309

306310
def has_dataset(self, name: str) -> bool:
307311
if name not in _DATASET_REGISTRY:
@@ -373,8 +377,8 @@ def get_builder_cls(self, name: str) -> Type[RegisteredDataset]:
373377

374378
def _get_inital_providers() -> list[DatasetBuilderProvider]:
375379
return [
380+
RegisteredDatasetBuilderProvider(),
376381
SourceDirDatasetBuilderProvider(constants.DATASETS_TFDS_SRC_DIR),
377-
LegacyDatasetBuilderProvider(),
378382
]
379383

380384

0 commit comments

Comments
 (0)