fix: LoRA Model Training Configuration and Data Balance #233

OneZero-Y · 2025-09-26T05:25:36Z

What type of PR is this?
Fix LoRA Model Training Configuration and Data Balance

What this PR does / why we need it:

Fixed critical configuration and data balance issues in LoRA training scripts that were causing model accuracy problems and training instability.

Key Issues Resolved

Enhanced Target Modules Configuration

Improved attention learning: Added attention.self.key to BERT/RoBERTa target modules
Better error handling: Added target modules verification and clear error messages

Advanced Dataset Balancing

Intent Classification: Enhanced category balance category validation
PII Detection: Implemented entity-type balanced sampling with negative examples
Security Detection: Added strict 1:1 jailbreak/benign balance with distribution verification

Files Modified

common_lora_utils.py - Core LoRA configuration and model validation
ft_linear_lora.py - Intent classification with enhanced category balancing
pii_bert_finetuning_lora.py - PII detection with entity-type balancing
jailbreak_bert_finetuning_lora.py - Security detection with strict 1:1 balance

netlify · 2025-09-26T05:25:42Z

✅ Deploy Preview for vllm-semantic-router ready!

Name	Link
🔨 Latest commit	`989b376`
🔍 Latest deploy log	https://app.netlify.com/projects/vllm-semantic-router/deploys/68d7fa656fad0800083d3119
😎 Deploy Preview	https://deploy-preview-233--vllm-semantic-router.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

github-actions · 2025-09-26T05:25:48Z

👥 vLLM Semantic Team Notification

The following members have been identified for the changed files in this PR and have been automatically assigned:

📁 `src`

Owners: @rootfs, @Xunzhuo, @wangchen615
Files changed:

src/training/training_lora/OWNER
src/training/training_lora/classifier_model_fine_tuning_lora/ft_linear_lora.py
src/training/training_lora/classifier_model_fine_tuning_lora/ft_linear_lora_verifier.go
src/training/training_lora/classifier_model_fine_tuning_lora/train_cpu_optimized.sh
src/training/training_lora/common_lora_utils.py
src/training/training_lora/pii_model_fine_tuning_lora/pii_bert_finetuning_lora.py
src/training/training_lora/pii_model_fine_tuning_lora/pii_bert_finetuning_lora_verifier.go
src/training/training_lora/pii_model_fine_tuning_lora/train_cpu_optimized.sh
src/training/training_lora/prompt_guard_fine_tuning_lora/jailbreak_bert_finetuning_lora.py
src/training/training_lora/prompt_guard_fine_tuning_lora/jailbreak_bert_finetuning_lora_verifier.go
src/training/training_lora/prompt_guard_fine_tuning_lora/train_cpu_optimized.sh

🎉 Thanks for your contributions!

This comment was automatically generated based on the OWNER files in the repository.

rootfs · 2025-09-26T14:57:25Z

@OneZero-Y Thanks for the idea. For jailbreak 1:1 mapping, is it necessary? Would that bias towards jailbreak in evaluation?

rootfs · 2025-09-26T15:03:04Z

src/training/training_lora/classifier_model_fine_tuning_lora/ft_linear_lora.py

    validate_lora_config,
 )

 # Setup logging


nit: can you add OWNER file in the training_lora directory and add you there?

Okay, I added it.

Signed-off-by: OneZero-Y <[email protected]> Fix LoRA Model Training Configuration and Data Balance Signed-off-by: OneZero-Y <[email protected]>

OneZero-Y · 2025-09-27T07:48:22Z

@rootfs
According to latest LLM Guard and Guardrails documentation :

LLM Guard: 1:1 balanced datasets are recommended for binary security classifiers
- Source: github.com/protectai/llm-guard/blob/main/docs/tutorials/notebooks/langchain_rag.ipynb (multi-layer detection)
- Source: github.com/protectai/llm-guard/blob/main/docs/output_scanners/ban_topics.md (threshold tuning)
- Source: github.com/protectai/llm-guard/blob/main/docs/output_scanners/emotion_detection.md (binary classification thresholds)
Guardrails AI: Supports balanced training with threshold tuning for optimization
- Source: github.com/guardrails-ai/guardrails/tree/main/docs/how_to_guides/custom_validators.md (threshold configuration)
- Source: github.com/guardrails-ai/guardrails/tree/main/docs/examples/provenance.ipynb (validation_method="sentence")
- Source: github.com/guardrails-ai/guardrails/tree/main/docs/examples/toxic_language.ipynb (multi-validator strategy)
Prevent model bias toward majority class, ensure equal sensitivity to attack/benign patterns

Signed-off-by: OneZero-Y <[email protected]> fix:LoRA Model Training Configuration and Data Balance Signed-off-by: OneZero-Y <[email protected]>

rootfs · 2025-09-27T14:47:57Z

@OneZero-Y thank you for the references! They are very helpful!

rootfs · 2025-09-27T14:51:59Z

@OneZero-Y I am still catching up on the LoRA training. This is beyond my working experience :D

Merging the PR now

…#233) * Fix LoRA Model Training Configuration and Data Balance Signed-off-by: OneZero-Y <[email protected]> Fix LoRA Model Training Configuration and Data Balance Signed-off-by: OneZero-Y <[email protected]> * fix:LoRA Model Training Configuration and Data Balance Signed-off-by: OneZero-Y <[email protected]> fix:LoRA Model Training Configuration and Data Balance Signed-off-by: OneZero-Y <[email protected]> --------- Signed-off-by: OneZero-Y <[email protected]> Co-authored-by: Huamin Chen <[email protected]> Signed-off-by: liuhy <[email protected]>

…#233) * Fix LoRA Model Training Configuration and Data Balance Signed-off-by: OneZero-Y <[email protected]> Fix LoRA Model Training Configuration and Data Balance Signed-off-by: OneZero-Y <[email protected]> * fix:LoRA Model Training Configuration and Data Balance Signed-off-by: OneZero-Y <[email protected]> fix:LoRA Model Training Configuration and Data Balance Signed-off-by: OneZero-Y <[email protected]> --------- Signed-off-by: OneZero-Y <[email protected]> Co-authored-by: Huamin Chen <[email protected]>

OneZero-Y requested review from Xunzhuo, rootfs and wangchen615 as code owners September 26, 2025 05:25

github-actions bot assigned rootfs, wangchen615 and Xunzhuo Sep 26, 2025

OneZero-Y changed the title ~~Fix LoRA Model Training Configuration and Data Balance~~ fix: LoRA Model Training Configuration and Data Balance Sep 26, 2025

OneZero-Y force-pushed the fix/lora-model-training branch from 715b5c9 to 1262ea5 Compare September 26, 2025 13:15

rootfs reviewed Sep 26, 2025

View reviewed changes

rootfs added this to the v0.1 milestone Sep 26, 2025

Fix LoRA Model Training Configuration and Data Balance

cb3d2d5

Signed-off-by: OneZero-Y <[email protected]> Fix LoRA Model Training Configuration and Data Balance Signed-off-by: OneZero-Y <[email protected]>

OneZero-Y force-pushed the fix/lora-model-training branch from 1262ea5 to 17c38ae Compare September 27, 2025 07:49

fix:LoRA Model Training Configuration and Data Balance

36ec9ff

Signed-off-by: OneZero-Y <[email protected]> fix:LoRA Model Training Configuration and Data Balance Signed-off-by: OneZero-Y <[email protected]>

OneZero-Y force-pushed the fix/lora-model-training branch from 17c38ae to 36ec9ff Compare September 27, 2025 13:14

rootfs approved these changes Sep 27, 2025

View reviewed changes

rootfs added 2 commits September 27, 2025 09:52

Merge branch 'main' into fix/lora-model-training

04a6630

Merge branch 'main' into fix/lora-model-training

989b376

rootfs merged commit 56cdc08 into vllm-project:main Sep 27, 2025
4 checks passed

OneZero-Y deleted the fix/lora-model-training branch September 30, 2025 08:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: LoRA Model Training Configuration and Data Balance #233

fix: LoRA Model Training Configuration and Data Balance #233

Uh oh!

OneZero-Y commented Sep 26, 2025

Uh oh!

netlify bot commented Sep 26, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Sep 26, 2025 •

edited

Loading

Uh oh!

rootfs commented Sep 26, 2025

Uh oh!

rootfs Sep 26, 2025

Uh oh!

OneZero-Y Sep 27, 2025

Uh oh!

OneZero-Y commented Sep 27, 2025

Uh oh!

rootfs commented Sep 27, 2025

Uh oh!

rootfs commented Sep 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix: LoRA Model Training Configuration and Data Balance #233

fix: LoRA Model Training Configuration and Data Balance #233

Uh oh!

Conversation

OneZero-Y commented Sep 26, 2025

Uh oh!

netlify bot commented Sep 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for vllm-semantic-router ready!

Uh oh!

github-actions bot commented Sep 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

👥 vLLM Semantic Team Notification

📁 src

🎉 Thanks for your contributions!

Uh oh!

rootfs commented Sep 26, 2025

Uh oh!

rootfs Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

OneZero-Y Sep 27, 2025

Choose a reason for hiding this comment

Uh oh!

OneZero-Y commented Sep 27, 2025

Uh oh!

rootfs commented Sep 27, 2025

Uh oh!

rootfs commented Sep 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

netlify bot commented Sep 26, 2025 •

edited

Loading

github-actions bot commented Sep 26, 2025 •

edited

Loading

📁 `src`