Extend IT prompts with 59 attack pattern test cases across 10 categories by Copilot · Pull Request #20 · camunda/BizSol_bb-ai-firewall-agent

Copilot · 2026-04-01T15:40:18Z

Adds comprehensive prompt injection test coverage for 10 attack categories and hardens the system prompt to explicitly detect these patterns.

New prompt test files (59 total, all `block` category)

Category	Count	Prefix
Direct Instruction Override	7	`override-*`
Role & Persona Hijacking	7	`persona-*`
Prompt Leaking / Extraction	7	`leaking-*`
Nested / Indirect Injection	6	`nested-*`
Delimiter / Context Confusion	6	`delimiter-*`
Encoding & Obfuscation	8	`encoding-*`
Hypothetical / Fictional Framing	6	`hypothetical-*`
Incremental / Gradual Escalation	4	`escalation-*`
Instruction Smuggling via Format	5	`smuggling-*`
Payload Splitting / Multi-step	3	`payload-*`

System prompt enhancements

Expanded the risk detection checklist and escalation rules in safeguard-systemprompt.txt:

injection/jailbreak — added explicit patterns for persona hijacking (developer mode, maintenance mode), prompt leaking (repeat everything above, translate your instructions), delimiter confusion (fake ###SYSTEM###, [INST], <|im_start|> tokens), nested injection (directives embedded in documents/JSON/HTML/emails), payload splitting (code-word tricks, deferred instructions), incremental escalation (false precedents)
policy_evasion — added fictional/hypothetical framing patterns and instruction smuggling via YAML/LaTeX/code comments/HTML
obfuscation — added leetspeak, reversed text, token splitting, URL-encoding, Unicode lookalikes
escalation rules — nested injection in structured content → always block; fictional framing around harmful content → block

All three prompt locations synced via mvn compile exec:java (canonical txt → FEEL → BPMN embedding).

Agent-Logs-Url: https://github.com/camunda/BizSol_bb-ai-firewall-agent/sessions/f86fa402-ff3f-4394-b37a-f40a955a3444 Co-authored-by: vobu <6573426+vobu@users.noreply.github.com>

github-actions · 2026-04-01T16:16:05Z

Test Coverage Report (Camunda 8.8)

Docker image: camunda/camunda:8.8-SNAPSHOT

Process Test Coverage (safeguard-agent): 100.0%

✅ Meets minimum requirement (60%)

📊 Detailed reports available in build artifacts

github-actions · 2026-04-01T16:16:18Z

Test Coverage Report (Camunda 8.9)

Docker image: camunda/camunda:8.9-SNAPSHOT

Process Test Coverage (safeguard-agent): 100.0%

✅ Meets minimum requirement (60%)

📊 Detailed reports available in build artifacts

github-actions · 2026-04-01T16:43:37Z

Test Coverage Report (Camunda 8.8)

Docker image: camunda/camunda:8.8-SNAPSHOT

Process Test Coverage (safeguard-agent): 100.0%

✅ Meets minimum requirement (60%)

📊 Detailed reports available in build artifacts

github-actions · 2026-04-01T16:43:55Z

Test Coverage Report (Camunda 8.9)

Docker image: camunda/camunda:8.9-SNAPSHOT

Process Test Coverage (safeguard-agent): 100.0%

✅ Meets minimum requirement (60%)

📊 Detailed reports available in build artifacts

Initial plan

e5343c7

Copilot AI assigned Copilot and vobu Apr 1, 2026

Copilot started work on behalf of vobu April 1, 2026 15:40 View session

Copilot AI linked an issue Apr 1, 2026 that may be closed by this pull request

extend IT Prompts #16

Closed

Copilot AI and others added 2 commits April 1, 2026 15:49

test: add 59 new block prompt test files for 10 attack categories

deff3a1

Agent-Logs-Url: https://github.com/camunda/BizSol_bb-ai-firewall-agent/sessions/f86fa402-ff3f-4394-b37a-f40a955a3444 Co-authored-by: vobu <6573426+vobu@users.noreply.github.com>

feat: enhance system prompt with extended attack pattern detection

e7b37ed

Agent-Logs-Url: https://github.com/camunda/BizSol_bb-ai-firewall-agent/sessions/f86fa402-ff3f-4394-b37a-f40a955a3444 Co-authored-by: vobu <6573426+vobu@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Copilot Request~~ Extend IT prompts with 59 attack pattern test cases across 10 categories Apr 1, 2026

Copilot finished work on behalf of vobu April 1, 2026 16:03

Copilot AI requested a review from vobu April 1, 2026 16:03

fix: failing prompt tests with refined prompt

32caf3b

vobu approved these changes Apr 1, 2026

View reviewed changes

vobu marked this pull request as ready for review April 1, 2026 17:03

vobu merged commit 36c0bfc into main Apr 1, 2026
21 of 23 checks passed

vobu deleted the copilot/fix-2443838-1139127023-b1311f80-14d3-4c82-b62e-3cbc3d9844cb branch April 1, 2026 17:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extend IT prompts with 59 attack pattern test cases across 10 categories#20

Extend IT prompts with 59 attack pattern test cases across 10 categories#20
vobu merged 4 commits intomainfrom
copilot/fix-2443838-1139127023-b1311f80-14d3-4c82-b62e-3cbc3d9844cb

Copilot AI commented Apr 1, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Apr 1, 2026

Uh oh!

github-actions bot commented Apr 1, 2026

Uh oh!

github-actions bot commented Apr 1, 2026

Uh oh!

github-actions bot commented Apr 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Copilot AI commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

New prompt test files (59 total, all block category)

System prompt enhancements

Uh oh!

github-actions bot commented Apr 1, 2026

Test Coverage Report (Camunda 8.8)

Process Test Coverage (safeguard-agent): 100.0%

Uh oh!

github-actions bot commented Apr 1, 2026

Test Coverage Report (Camunda 8.9)

Process Test Coverage (safeguard-agent): 100.0%

Uh oh!

github-actions bot commented Apr 1, 2026

Test Coverage Report (Camunda 8.8)

Process Test Coverage (safeguard-agent): 100.0%

Uh oh!

github-actions bot commented Apr 1, 2026

Test Coverage Report (Camunda 8.9)

Process Test Coverage (safeguard-agent): 100.0%

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Apr 1, 2026 •

edited

Loading

New prompt test files (59 total, all `block` category)