Skip to content

Commit 409668f

Browse files
authored
Feature Enhancement: Batch Inference Support in candle-binding (#71)
* Feature Enhancement: Batch Inference Support in candle-binding Feature Enhancement: Batch Inference Support in candle-binding Signed-off-by: OneZero-Y <[email protected]> fix: unified_classifier_test Signed-off-by: OneZero-Y <[email protected]> fix: unified_classifier_test Signed-off-by: OneZero-Y <[email protected]> fix: unit_test Signed-off-by: OneZero-Y <[email protected]> * feat: add LoRA training pipeline and intelligent model selection - Complete LoRA training scripts for 3 classification tasks - Smart model selection with architecture priority (BERT > RoBERTa > ModernBERT) - Official Candle BERT integration for Python-Go consistency - Enhanced unified classifier with high-confidence LoRA models Signed-off-by: OneZero-Y <[email protected]> * fix: unit test and model download from huggingface Signed-off-by: OneZero-Y <[email protected]> fix: unit test and model download from huggingface Signed-off-by: OneZero-Y <[email protected]> fix: unit test and model download from huggingface Signed-off-by: OneZero-Y <[email protected]> fix: unit test and model download from huggingface Signed-off-by: OneZero-Y <[email protected]> fix: unit test and model download from huggingface Signed-off-by: OneZero-Y <[email protected]> fix: unit test and model download from huggingface Signed-off-by: OneZero-Y <[email protected]> fix: unit test and model download from huggingface Signed-off-by: OneZero-Y <[email protected]> --------- Signed-off-by: OneZero-Y <[email protected]>
1 parent ee71357 commit 409668f

34 files changed

+11454
-791
lines changed

Makefile

Lines changed: 36 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -256,6 +256,42 @@ download-models:
256256
hf download LLM-Semantic-Router/pii_classifier_modernbert-base_presidio_token_model --local-dir models/pii_classifier_modernbert-base_presidio_token_model; \
257257
fi
258258

259+
@if [ ! -d "lora_intent_classifier_bert-base-uncased_model" ]; then \
260+
hf download LLM-Semantic-Router/lora_intent_classifier_bert-base-uncased_model --local-dir models/lora_intent_classifier_bert-base-uncased_model; \
261+
fi
262+
263+
@if [ ! -d "models/lora_intent_classifier_roberta-base_model" ]; then \
264+
hf download LLM-Semantic-Router/lora_intent_classifier_roberta-base_model --local-dir models/lora_intent_classifier_roberta-base_model; \
265+
fi
266+
267+
@if [ ! -d "models/lora_intent_classifier_modernbert-base_model" ]; then \
268+
hf download LLM-Semantic-Router/lora_intent_classifier_modernbert-base_model --local-dir models/lora_intent_classifier_modernbert-base_model; \
269+
fi
270+
271+
@if [ ! -d "models/lora_pii_detector_bert-base-uncased_model" ]; then \
272+
hf download LLM-Semantic-Router/lora_pii_detector_bert-base-uncased_model --local-dir models/lora_pii_detector_bert-base-uncased_model; \
273+
fi
274+
275+
@if [ ! -d "models/lora_pii_detector_roberta-base_model" ]; then \
276+
hf download LLM-Semantic-Router/lora_pii_detector_roberta-base_model --local-dir models/lora_pii_detector_roberta-base_model; \
277+
fi
278+
279+
@if [ ! -d "models/lora_pii_detector_modernbert-base_model" ]; then \
280+
hf download LLM-Semantic-Router/lora_pii_detector_modernbert-base_model --local-dir models/lora_pii_detector_modernbert-base_model; \
281+
fi
282+
283+
@if [ ! -d "models/lora_jailbreak_classifier_bert-base-uncased_model" ]; then \
284+
hf download LLM-Semantic-Router/lora_jailbreak_classifier_bert-base-uncased_model --local-dir models/lora_jailbreak_classifier_bert-base-uncased_model; \
285+
fi
286+
287+
@if [ ! -d "models/lora_jailbreak_classifier_roberta-base_model" ]; then \
288+
hf download LLM-Semantic-Router/lora_jailbreak_classifier_roberta-base_model --local-dir models/lora_jailbreak_classifier_roberta-base_model; \
289+
fi
290+
291+
@if [ ! -d "models/lora_jailbreak_classifier_modernbert-base_model" ]; then \
292+
hf download LLM-Semantic-Router/lora_jailbreak_classifier_modernbert-base_model --local-dir models/lora_jailbreak_classifier_modernbert-base_model; \
293+
fi
294+
259295
# Milvus container management
260296
start-milvus:
261297
@echo "Starting Milvus container for testing with $(CONTAINER_RUNTIME)..."

0 commit comments

Comments
 (0)