Skip to content

Commit 20fd237

Browse files
authored
Add 0.8.0 changelog (#701)
* Add 0.8.0 changelog Signed-off-by: Charlie Truong <chtruong@nvidia.com> * Trim trailing whitespace in changelog Signed-off-by: Charlie Truong <chtruong@nvidia.com> --------- Signed-off-by: Charlie Truong <chtruong@nvidia.com>
1 parent 8b99196 commit 20fd237

File tree

1 file changed

+9
-0
lines changed

1 file changed

+9
-0
lines changed

CHANGELOG.md

Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,14 @@
11
# Changelog
22

3+
## NVIDIA NeMo Curator 0.8.0
4+
5+
- Llama Based PII Redaction
6+
- Trafilatura Text Extractor
7+
- Chinese & Japanese Stopwords for Text Extractors
8+
- Writing gzip compressed jsonl datasets
9+
- Training dataset curation for retriever customization using hard-negative mining
10+
- Implemented a memory efficient pairwise similarity in Semantic Deduplication
11+
312
## NVIDIA NeMo Curator 0.7.1
413

514
- Fix Transformers + Cuda Context bug

0 commit comments

Comments
 (0)