Skip to content

Commit c35bab0

Browse files
committed
dask force use
Signed-off-by: George Zelenfroind <gzelenfroind@nvidia.com>
1 parent 79e8da0 commit c35bab0

File tree

2 files changed

+5
-5
lines changed

2 files changed

+5
-5
lines changed

sdp/processors/base_processor.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -82,7 +82,7 @@ def test(self):
8282
There are not tests by default.
8383
"""
8484

85-
class DaskParallelProcessor(BaseProcessor):
85+
class BaseParallelProcessor(BaseProcessor):
8686
"""
8787
Processor class which allows operations on each entry to be parallelized using Dask.
8888
@@ -251,7 +251,7 @@ def finalize(self, metrics: List[Any]):
251251

252252

253253

254-
class BaseParallelProcessor(BaseProcessor):
254+
class LegacyParallelProcessor(BaseProcessor):
255255
"""Processor class which allows operations on each utterance to be parallelized.
256256
257257
Parallelization is done using ``tqdm.contrib.concurrent.process_map`` inside

sdp/processors/modify_manifest/common.py

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,7 @@
2424
BaseParallelProcessor,
2525
BaseProcessor,
2626
DataEntry,
27-
DaskParallelProcessor,
27+
LegacyParallelProcessor,
2828
)
2929
from sdp.utils.common import load_manifest
3030

@@ -99,9 +99,9 @@ def process_dataset_entry(self, data_entry: Dict):
9999
return [DataEntry(data=data_entry)]
100100

101101

102-
class AddConstantFields(DaskParallelProcessor):
102+
class AddConstantFields(BaseParallelProcessor):
103103
"""
104-
This processor adds constant fields to all manifest entries using DaskParallelProcessor.
104+
This processor adds constant fields to all manifest entries using Dask BaseParallelProcessor.
105105
It is useful when you want to attach fixed information (e.g., a language label or metadata)
106106
to each entry for downstream tasks such as language identification model training.
107107

0 commit comments

Comments
 (0)