Skip to content

✨ feat: RAG Service vulnerable to Data Poisoning Attacks + new exploits and detection strategies#169

Open
LauraRandl wants to merge 83 commits intomainfrom
rag-service
Open

✨ feat: RAG Service vulnerable to Data Poisoning Attacks + new exploits and detection strategies#169
LauraRandl wants to merge 83 commits intomainfrom
rag-service

Conversation

@LauraRandl
Copy link
Contributor

@LauraRandl LauraRandl commented Mar 3, 2026

The following new functionality is added to Unguard in the course of the Master’s Thesis “Evaluating Embedding-based Detection Strategies for Data Poisoning Attacks on Knowledge Bases in RAG Systems”:

  • Three new services, adding spam predictions for posts in Unguard: rag-service, feedback-ingestion-service, ollama
  • New AI Vulnerability: The RAG service's Knowledge Base is vulnerable to Data Poisoning Attacks
  • Three types of Data Poisoning Attacks are implemented, which can be performed either using the exploit toolkit or manually through the frontend: general label flipping attack, targeted label flipping attack, keyword injection attack
  • Three Embeddings-based Detection Strategies which - if enabled - are applied before ingesting new data into the Knowledge Base to detect poisoned entries

The RAG service can be used with either OpenAI models via LangDock (API key required), or Ollama models running locally. However, when using local Ollama models please note that the performance is heavily dependent on the machine it is running on.

@LauraRandl LauraRandl requested a review from a team as a code owner March 3, 2026 12:08
@LauraRandl LauraRandl changed the title ✨ feat: ✨ feat: RAG Service Mar 3, 2026
@LauraRandl LauraRandl changed the title ✨ feat: RAG Service ✨ feat: RAG Service vulnerable to Data Poisoning Attacks + new exploits and detection strategies Mar 3, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant