Skip to content

Actions: NVIDIA-NeMo/Curator

Actions

Create PR to main with cherry-pick from release

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
388 workflow runs
388 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Bug fix in dockerfile ARG vs ENV var (#446)
Create PR to main with cherry-pick from release #38: Commit 35b5993 pushed by praateekmahajan
16s main
update test params to account for new minhash algo (#442)
Create PR to main with cherry-pick from release #37: Commit c929203 pushed by ayushdg
16s main
ci: Bump _build_container (#448)
Create PR to main with cherry-pick from release #36: Commit f73c1b8 pushed by ko3n1g
16s main
ci: Bump release workflow (better message formatting) (#444)
Create PR to main with cherry-pick from release #35: Commit 55fe227 pushed by sarahyurick
16s main
Prompt Task/Complexity Classifier (#364)
Create PR to main with cherry-pick from release #34: Commit 27dd211 pushed by sarahyurick
20s main
Add blocksize to DocumentDataset.read_* that uses `dask_cudf.read_*…
Create PR to main with cherry-pick from release #33: Commit e820b8b pushed by sarahyurick
19s main
Bump RAPIDS stable to 24.12 and RAPIDS nightly to 25.02 (#434)
Create PR to main with cherry-pick from release #32: Commit c54826a pushed by sarahyurick
14s main
Content Type Classifier (#361)
Create PR to main with cherry-pick from release #31: Commit 9df5d7b pushed by sarahyurick
16s main
Add documentation for Instruction-Data-Guard classifier (#398)
Create PR to main with cherry-pick from release #30: Commit 86830ab pushed by sarahyurick
15s main
Adding fuzzy and semantic dedupe (#428)
Create PR to main with cherry-pick from release #29: Commit 3c3cc98 pushed by sarahyurick
16s main
Allow users to write to single file (#383)
Create PR to main with cherry-pick from release #28: Commit 079d46f pushed by sarahyurick
16s main
ci: Bump release workflow (#422)
Create PR to main with cherry-pick from release #26: Commit 1c0382e pushed by pablo-garay
23s main
Use PyTorchModelHubMixin for InstructionDataGuardNet (#416)
Create PR to main with cherry-pick from release #25: Commit 87d0cc7 pushed by sarahyurick
18s main
Updating index.rst to match requested layout (#414)
Create PR to main with cherry-pick from release #24: Commit b4c67b5 pushed by ryantwolf
18s main
Multilingual Domain Classifier (#363)
Create PR to main with cherry-pick from release #23: Commit 7272ca0 pushed by sarahyurick
13s main
Update FineWebEduClassifier identifier (#403)
Create PR to main with cherry-pick from release #22: Commit edd6262 pushed by sarahyurick
14s main
Add READMEs to examples/ and nemo_curator/scripts directories (#332)
Create PR to main with cherry-pick from release #21: Commit d1f52f6 pushed by sarahyurick
20s main
fix gpu CI test failure (#401)
Create PR to main with cherry-pick from release #20: Commit bc724ec pushed by sarahyurick
23s main
Change FineTuneGuardClassifier to InstructionDataGuardClassifier
Create PR to main with cherry-pick from release #19: Commit d14ac42 pushed by sarahyurick
18s main
ci: Allow dry-run of release (#395)
Create PR to main with cherry-pick from release #18: Commit 110cede pushed by pablo-garay
14s main
Add support for parallel data curation (#193)
Create PR to main with cherry-pick from release #17: Commit 3d14b0d pushed by ryantwolf
16s main
Add support for FineTune-Guard classifier (#397)
Create PR to main with cherry-pick from release #16: Commit b15b08a pushed by sarahyurick
15s main
Add codepath for computing buckets without int conversion (#326)
Create PR to main with cherry-pick from release #15: Commit 3ebc807 pushed by ayushdg
13s main
pin crossfit to 0.0.7 (#394)
Create PR to main with cherry-pick from release #14: Commit a024652 pushed by sarahyurick
14s main
ProTip! You can narrow down the results and go further in time using created:<2024-11-27 or the other filters available.