Skip to content

Actions: NVIDIA-NeMo/Curator

Actions

Create PR to main with cherry-pick from release

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
405 workflow runs
405 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Shuffler modules restructure (#1003)
Create PR to main with cherry-pick from release #245: Commit 0558194 pushed by praateekmahajan
10s main
Wip ab/final metrics (#996)
Create PR to main with cherry-pick from release #244: Commit 76c3349 pushed by abhinavg4
13s main
Exact deduplication api (#965)
Create PR to main with cherry-pick from release #243: Commit 008060b pushed by sarahyurick
11s main
Fix minor bugs in fuzzy workflow (#999)
Create PR to main with cherry-pick from release #242: Commit f1067d4 pushed by sarahyurick
8s main
Adding ray port range to the free port checker (#997)
Create PR to main with cherry-pick from release #241: Commit 8ccd7d4 pushed by sarahyurick
9s main
Pin transformers (#998)
Create PR to main with cherry-pick from release #240: Commit 4264120 pushed by ayushdg
10s main
[Tutorials] Port the TinyStories and PEFT curation tutorials to Ray A…
Create PR to main with cherry-pick from release #239: Commit 4d79815 pushed by Maghoumi
12s main
Add SpeechBatch task, InferenceAsrNemoStage, GetPairwiseWerStage in F…
Create PR to main with cherry-pick from release #238: Commit 3a075ba pushed by sarahyurick
12s main
Bug fix in removal write kwargs + add input_task_limit in removal (#995)
Create PR to main with cherry-pick from release #237: Commit f60d1ed pushed by sarahyurick
16s main
feat: Add internvideo2 multi modality as a submodule (#992)
Create PR to main with cherry-pick from release #236: Commit f3b8e00 pushed by thomasdhc
9s main
Ray Data should set num_blocks=len(tasks) #994
Create PR to main with cherry-pick from release #235: Commit 4603256 pushed by praateekmahajan
9s main
Bug fix in removal workflow (#993)
Create PR to main with cherry-pick from release #234: Commit 22be54d pushed by praateekmahajan
12s main
Minor changes for Dedup / IO consistency (#983)
Create PR to main with cherry-pick from release #233: Commit ea95158 pushed by praateekmahajan
11s main
ci: Re-enable gpu testing (#990)
Create PR to main with cherry-pick from release #232: Commit 8ef8439 pushed by thomasdhc
12s main
replace -1 with None (#977)
Create PR to main with cherry-pick from release #231: Commit fca362e pushed by suiyoubi
13s main
Add TextDuplicateRemovalWorkflow (#974)
Create PR to main with cherry-pick from release #230: Commit cb19a7d pushed by praateekmahajan
8s main
Remove outdated Dask files and organize new Ray files (#971)
Create PR to main with cherry-pick from release #229: Commit 5830988 pushed by sarahyurick
11s main
Huvu/image dedup removal (#951)
Create PR to main with cherry-pick from release #228: Commit 306afe3 pushed by huvunvidia
12s main
Add FuzzyDeduplicationPipeline (#937)
Create PR to main with cherry-pick from release #227: Commit 0b22ceb pushed by ayushdg
11s main
Catch runtime error for PyNvcFrameExtractor (#975)
Create PR to main with cherry-pick from release #226: Commit b44ca25 pushed by suiyoubi
8s main
Pin cosmos xenna (#976)
Create PR to main with cherry-pick from release #225: Commit 653ce4f pushed by ayushdg
9s main
ci: Fix vllm version (#973)
Create PR to main with cherry-pick from release #224: Commit 9470fba pushed by thomasdhc
8s main
Add fsspec support to id_gen io (#972)
Create PR to main with cherry-pick from release #223: Commit 83d43fd pushed by ayushdg
12s main
Text Removal for Dedup Stage (#924)
Create PR to main with cherry-pick from release #222: Commit fae00de pushed by praateekmahajan
12s main
Update community-bot to add issues to shard project (#913)
Create PR to main with cherry-pick from release #221: Commit 4d84b0a pushed by chtruong814
11s main