Skip to content

Commit e1edde3

Browse files
Update phishing_email_detection_gpt2.py
try a low seq_len
1 parent ca7b34f commit e1edde3

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

phishing_email_detection_gpt2.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -34,7 +34,7 @@
3434

3535
# Text encoding / embedding related constants
3636

37-
MAX_SEQ_LENGTH = 1536 # (baseline from text classification model)
37+
MAX_SEQ_LENGTH = 500 # (baseline from text classification model)
3838

3939
tokenizer_checkpoint = "HuggingFaceTB/SmolLM3-3B" # "HuggingFaceTB/SmolLM2-1.7B-Instruct"
4040
tokenizer = AutoTokenizer.from_pretrained(tokenizer_checkpoint)

0 commit comments

Comments
 (0)