Skip to content

Commit 6547a3c

Browse files
committed
Remove unrelated changes from .gitignore, score_filter.py, and megatron_tokenizer.py
Signed-off-by: asolergi-nv <asolergibert@nvidia.com>
1 parent 9c89d09 commit 6547a3c

File tree

2 files changed

+0
-15
lines changed

2 files changed

+0
-15
lines changed

.gitignore

Lines changed: 0 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -158,17 +158,3 @@ data/
158158

159159
# InternVideo2 dependency (cloned by installation script)
160160
InternVideo/
161-
162-
# UV cache directory
163-
.uv_cache/
164-
165-
# Ray temp directory
166-
.ray_temp/
167-
168-
uv.lock
169-
pyproject.toml
170-
171-
token_test/
172-
*.parquet
173-
*.bin
174-
*.idx

nemo_curator/stages/text/modules/score_filter.py

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -322,7 +322,6 @@ def process(self, batch: DocumentBatch) -> DocumentBatch | None:
322322
DocumentBatch: A batch with the score and filter applied
323323
324324
"""
325-
logger.info(f"We have {batch.num_items} Documents")
326325
df = batch.to_pandas()
327326

328327
if df.empty:

0 commit comments

Comments
 (0)