Skip to content

Commit bfdc9e6

Browse files
Add 0.8.0 changelog (#701) (#702)
* Add 0.8.0 changelog * Trim trailing whitespace in changelog --------- Signed-off-by: Charlie Truong <chtruong@nvidia.com> Co-authored-by: Charlie Truong <chtruong@nvidia.com>
1 parent 33db56e commit bfdc9e6

File tree

1 file changed

+9
-0
lines changed

1 file changed

+9
-0
lines changed

CHANGELOG.md

Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,14 @@
11
# Changelog
22

3+
## NVIDIA NeMo Curator 0.8.0
4+
5+
- Llama Based PII Redaction
6+
- Trafilatura Text Extractor
7+
- Chinese & Japanese Stopwords for Text Extractors
8+
- Writing gzip compressed jsonl datasets
9+
- Training dataset curation for retriever customization using hard-negative mining
10+
- Implemented a memory efficient pairwise similarity in Semantic Deduplication
11+
312
## NVIDIA NeMo Curator 0.7.1
413

514
- Fix Transformers + Cuda Context bug

0 commit comments

Comments
 (0)