You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -4,7 +4,7 @@ Reddit User Mimic Bot is an end-to-end Python project that fine‑tunes a BART m
4
4
5
5
## Features
6
6
-**LoRA Training** – fine-tune `facebook/bart-base` using Low-Rank Adaptation on scraped Reddit Q&A data.
7
-
-**Data Pipeline** – scripts to scrape Reddit posts with filters for gathering better quality data, preprocess them, split into train/validation/test sets, tokenize and reformat.
7
+
-**Data Pipeline** – scripts to scrape Reddit posts with filters for gathering better quality data, preprocess them, split into train/validation/test sets, tokenize and reformat; leverages data collator for dynamic batching and tensor‑core‑friendly padding.
8
8
-**Hugging Face Integration** – optional model upload to the Hugging Face Hub.
9
9
-**Logging and Experiment Tracking** – train/loss, val/loss tracked via Weights & Biases.
10
10
-**Model Efficiency** – utilize early stopping and Scaled Dot-Product Attention (SDPA) to optimize training efficiency and manage compute resources effectively.
0 commit comments