Skip to content

Extend IT prompts with 59 attack pattern test cases across 10 categories#20

Merged
vobu merged 4 commits intomainfrom
copilot/fix-2443838-1139127023-b1311f80-14d3-4c82-b62e-3cbc3d9844cb
Apr 1, 2026
Merged

Extend IT prompts with 59 attack pattern test cases across 10 categories#20
vobu merged 4 commits intomainfrom
copilot/fix-2443838-1139127023-b1311f80-14d3-4c82-b62e-3cbc3d9844cb

Conversation

Copy link
Copy Markdown
Contributor

Copilot AI commented Apr 1, 2026

Adds comprehensive prompt injection test coverage for 10 attack categories and hardens the system prompt to explicitly detect these patterns.

New prompt test files (59 total, all block category)

Category Count Prefix
Direct Instruction Override 7 override-*
Role & Persona Hijacking 7 persona-*
Prompt Leaking / Extraction 7 leaking-*
Nested / Indirect Injection 6 nested-*
Delimiter / Context Confusion 6 delimiter-*
Encoding & Obfuscation 8 encoding-*
Hypothetical / Fictional Framing 6 hypothetical-*
Incremental / Gradual Escalation 4 escalation-*
Instruction Smuggling via Format 5 smuggling-*
Payload Splitting / Multi-step 3 payload-*

System prompt enhancements

Expanded the risk detection checklist and escalation rules in safeguard-systemprompt.txt:

  • injection/jailbreak — added explicit patterns for persona hijacking (developer mode, maintenance mode), prompt leaking (repeat everything above, translate your instructions), delimiter confusion (fake ###SYSTEM###, [INST], <|im_start|> tokens), nested injection (directives embedded in documents/JSON/HTML/emails), payload splitting (code-word tricks, deferred instructions), incremental escalation (false precedents)
  • policy_evasion — added fictional/hypothetical framing patterns and instruction smuggling via YAML/LaTeX/code comments/HTML
  • obfuscation — added leetspeak, reversed text, token splitting, URL-encoding, Unicode lookalikes
  • escalation rules — nested injection in structured content → always block; fictional framing around harmful content → block

All three prompt locations synced via mvn compile exec:java (canonical txt → FEEL → BPMN embedding).

Copilot AI assigned Copilot and vobu Apr 1, 2026
Copilot AI linked an issue Apr 1, 2026 that may be closed by this pull request
Copilot AI changed the title [WIP] Copilot Request Extend IT prompts with 59 attack pattern test cases across 10 categories Apr 1, 2026
Copilot finished work on behalf of vobu April 1, 2026 16:03
Copilot AI requested a review from vobu April 1, 2026 16:03
@github-actions
Copy link
Copy Markdown

github-actions bot commented Apr 1, 2026

Test Coverage Report (Camunda 8.8)

Docker image: camunda/camunda:8.8-SNAPSHOT

Process Test Coverage (safeguard-agent): 100.0%

✅ Meets minimum requirement (60%)

📊 Detailed reports available in build artifacts

@github-actions
Copy link
Copy Markdown

github-actions bot commented Apr 1, 2026

Test Coverage Report (Camunda 8.9)

Docker image: camunda/camunda:8.9-SNAPSHOT

Process Test Coverage (safeguard-agent): 100.0%

✅ Meets minimum requirement (60%)

📊 Detailed reports available in build artifacts

@github-actions
Copy link
Copy Markdown

github-actions bot commented Apr 1, 2026

Test Coverage Report (Camunda 8.8)

Docker image: camunda/camunda:8.8-SNAPSHOT

Process Test Coverage (safeguard-agent): 100.0%

✅ Meets minimum requirement (60%)

📊 Detailed reports available in build artifacts

@github-actions
Copy link
Copy Markdown

github-actions bot commented Apr 1, 2026

Test Coverage Report (Camunda 8.9)

Docker image: camunda/camunda:8.9-SNAPSHOT

Process Test Coverage (safeguard-agent): 100.0%

✅ Meets minimum requirement (60%)

📊 Detailed reports available in build artifacts

@vobu vobu marked this pull request as ready for review April 1, 2026 17:03
@vobu vobu merged commit 36c0bfc into main Apr 1, 2026
21 of 23 checks passed
@vobu vobu deleted the copilot/fix-2443838-1139127023-b1311f80-14d3-4c82-b62e-3cbc3d9844cb branch April 1, 2026 17:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

extend IT Prompts

2 participants