You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Improved RedTeam coverage across risk sub-categories to ensure comprehensive security testing
Made RedTeam's AttackStrategy.Tense seed prompts dynamic to allow use of this strategy with additional risk categories
Refactors error handling and result semantics in the RedTeam evaluation system to improve clarity and align with Attack Success Rate (ASR) conventions (passed=False means attack success)
Bugs Fixed
Fixed RedTeam evaluation error related to context handling for context-dependent risk categories
Fixed RedTeam prompt application for model targets during Indirect Jailbreak XPIA (Cross-Platform Indirect Attack)