Release v3.3.0 by RomiconEZ · Pull Request #157 · LLAMATOR-Core/llamator

RomiconEZ · 2025-07-15T16:45:28Z

Changelog v3.3.0

Redesigned the output of testing parameter presets. Added the following presets: all, owasp:llm01, owasp:llm07, owasp:llm09, llm, vlm, eng, rus.
Added a new Linguistic Sandwich attack. An adversarial prompt in a low-resource language is sandwiched between benign prompts in other languages.
In the System Prompt Leakage attack, the heuristiс evaluation has been replaced with LLM-as-a-judge. This checks the similarity between the system's output and the intended prompt based on the system description.
The static Past Tense attack has become the dynamic Time Machine attack. The attacking model now alters the temporal context of the adversarial prompt.
Add new tag - model: llm / vlm
README update - Enterprise Version announce
Other minor fixes and improvements.

add fitting datasets to `num_attempts`

Whatsapp example

* Set dependency - httpx == 0.27.2 * Release v1.1.0 * Delete deprecate img and and chroma-data to gitignore

Union history

rewrite all examples notebooks in english

fix attack model system prompt

* Implement class "MultiStageInteractionSession" for multistage attack. Add new functionality for ChatSession class. * Add multistage to the sycophancy and logical tests --------- Co-authored-by: Roman <roman.nieronov@mail.ru>

Refactor sycophancy and logical_inconsistencies and linguistic

…ultiStageInteractionSession.

…the attacker

Add refine_attack_prompt func to MultiStageInteractionSession.

enhance whatsapp example

…n chat_client

nizamovtimur

Changelog v3.2.0..v3.3.0

Redesigned the output of testing parameter presets. Added the following presets: all, owasp:llm01, owasp:llm07, owasp:llm09, llm, vlm, eng, rus.
Added a new Linguistic Sandwich attack. An adversarial prompt in a low-resource language is sandwiched between benign prompts in other languages.
In the System Prompt Leakage attack, the heuristiс evaluation has been replaced with LLM-as-a-judge. This checks the similarity between the system's output and the intended prompt based on the system description.
The static Past Tense attack has become the dynamic Time Machine attack. The attacking model now alters the temporal context of the adversarial prompt.
Other minor fixes and improvements.

NickoJo · 2025-07-15T17:38:03Z

Completely

это лучше убрать, смешное слово

Add NoneType checking for Judge Model responses fix AutoDAN-Turbo

nizamovtimur

требуется добавить в этот релиз фикс из #158
описание релиза можно оставить таким же, как и в моем прошлом ревью

Enhance evaluations

Update CONTRIBUTING.md

…ions.

Copilot

Pull Request Overview

This is a major version release (v3.3.0) that introduces significant improvements to attack preset handling, adds new attack methods, and enhances the overall framework functionality.

Replaced parameter-based configuration with a dynamic preset system supporting multiple categories and OWASP classifications
Added new attack modules including Time Machine, Linguistic Sandwich attacks
Enhanced existing attack modules with improved error handling and model compatibility tags

Reviewed Changes

Copilot reviewed 54 out of 56 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
`src/llamator/__version__.py`	Version bump to 3.3.0
`src/llamator/utils/test_presets.py`	Complete rewrite of preset system with dynamic generation based on attack tags
`src/llamator/utils/attack_params.py`	Refactored to support new preset system and improved parameter handling
`src/llamator/attacks/time_machine.py`	New attack module for temporal framing vulnerabilities
`src/llamator/attacks/linguistic_sandwich.py`	New attack exploiting attention blink in low-resource languages
`tests/print_test_preset_test.py`	New utility script for displaying preset configurations
Multiple attack files	Added model compatibility tags and improved descriptions

src/llamator/utils/test_presets.py

src/llamator/attacks/time_machine.py

src/llamator/client/chat_client.py

src/llamator/attacks/vlm_text_hallucination.py

src/llamator/attacks/vlm_lowres_docs.py

nizamovtimur and others added 30 commits December 10, 2024 15:14

Merge pull request #44 from RomiconEZ/add-data-fit-to-num_attempts

d8d2203

add fitting datasets to `num_attempts`

Add WhatsApp example in README

3f9238d

Add WhatsApp example in Doc

c000a1a

WhatsApp example

87b55f3

Add model_description to ClientWhatsAppSelenium init

ccb2159

Merge pull request #45 from RomiconEZ/whatsapp-example

32473ab

Whatsapp example

Main - release v1.1.0 (#46)

847805c

* Set dependency - httpx == 0.27.2 * Release v1.1.0 * Delete deprecate img and and chroma-data to gitignore

Release v1.1.1 (#47)

08d595b

Release v1.1.1

c8156be

Merge branch 'release'

5a59018

Union history

rewrite all examples notebooks in english

e50a277

Merge pull request #50 from RomiconEZ/translate-examples

b275bcc

rewrite all examples notebooks in english

fix attack model system prompt

b2511b9

Merge pull request #52 from RomiconEZ/small-fix-examples

c5c2ac5

fix attack model system prompt

Multi stage attack (#51)

da2b320

* Implement class "MultiStageInteractionSession" for multistage attack. Add new functionality for ChatSession class. * Add multistage to the sycophancy and logical tests --------- Co-authored-by: Roman <roman.nieronov@mail.ru>

move stop_criterion from loop

651999f

fix sycophancy and logical_inconsistencies naming

f7c7e69

rename translation.py to linguistic.py

d3bfe8c

Merge pull request #54 from RomiconEZ/refactor-multistages

28db862

Refactor sycophancy and logical_inconsistencies and linguistic

Add refine_attack_prompt func to MultiStageInteractionSession.

86541a6

Add refine_tested_client_prompt and refine_attacker_prompt funcs to M…

584d362

…ultiStageInteractionSession.

enhance whatsapp example

830b408

sync logic with handling tested_client_response before passing it to …

a10b227

…the attacker

pre-commit

0a3a55c

Merge pull request #55 from RomiconEZ/refine_attack_prompt

87e44cd

Add refine_attack_prompt func to MultiStageInteractionSession.

Merge pull request #56 from RomiconEZ/enhance-whatsapp-example

a854cda

enhance whatsapp example

added harmful_behavior_multistage.py

3adc800

corrected harmful_behavior_multistage.py according to the new logic i…

1c63774

…n chat_client

corrected attack

6bb723b

corrected attack

fed170e

RomiconEZ added 2 commits July 15, 2025 18:34

Enhance test cases and add default handling for num_attempts parameter

78e5346

Release v3.3.0

c898881

RomiconEZ requested review from NickoJo and nizamovtimur July 15, 2025 16:45

RomiconEZ self-assigned this Jul 15, 2025

RomiconEZ added the release label Jul 15, 2025

NickoJo approved these changes Jul 15, 2025

View reviewed changes

nizamovtimur approved these changes Jul 15, 2025

View reviewed changes

Fix NoneTypes and AutoDAN-Turbo (#158)

9a4122e

Add NoneType checking for Judge Model responses fix AutoDAN-Turbo

nizamovtimur self-requested a review July 21, 2025 11:03

nizamovtimur requested changes Jul 21, 2025

View reviewed changes

nizamovtimur and others added 12 commits July 22, 2025 12:24

Update CONTRIBUTING.md

d3fdca3

If response len < 3 do not eval

906b78c

Update linguistic_sandwich.py

2923266

add more stopwords

2eb9604

gg

e5ad473

abc

5ed37ba

fix

b893833

Merge pull request #162 from LLAMATOR-Core/nizamovtimur-patch-1

33fedb1

Enhance evaluations

Merge pull request #161 from LLAMATOR-Core/enhance-contributing

d0e0229

Update CONTRIBUTING.md

Merge remote-tracking branch 'origin/main' into main_last

3deca0f

Update Run Tests section in CONTRIBUTING.md for clearer test instruct…

efac9ff

…ions.

Merge branch 'main_last' into release_last

37298d8

RomiconEZ requested a review from Copilot July 27, 2025 12:48

Copilot AI reviewed Jul 27, 2025

View reviewed changes

nizamovtimur self-requested a review July 27, 2025 15:38

nizamovtimur approved these changes Jul 27, 2025

View reviewed changes

RomiconEZ merged commit 105e215 into release Jul 27, 2025
4 of 5 checks passed

RomiconEZ deleted the release-3-3-0 branch September 7, 2025 15:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release v3.3.0#157

Release v3.3.0#157
RomiconEZ merged 429 commits intoreleasefrom
release-3-3-0

RomiconEZ commented Jul 15, 2025 •

edited

Loading

Uh oh!

nizamovtimur left a comment •

edited

Loading

Uh oh!

NickoJo commented Jul 15, 2025

Uh oh!

nizamovtimur left a comment

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Comments

Conversation

RomiconEZ commented Jul 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changelog v3.3.0

Uh oh!

nizamovtimur left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Changelog v3.2.0..v3.3.0

Uh oh!

NickoJo commented Jul 15, 2025

Uh oh!

nizamovtimur left a comment

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Comments

RomiconEZ commented Jul 15, 2025 •

edited

Loading

nizamovtimur left a comment •

edited

Loading