add initial pii doc

knielsen404 · knielsen404 · commit ae9c0bb1483d · 2025-04-11T13:34:55.000+02:00
diff --git a/docs/assets/invariant.css b/docs/assets/invariant.css
@@ -10,6 +10,12 @@
 }
 
 
+/* define primary blue */
+:root {
+    --primary-blue: #3d3affac;
+}
+
+
 body {
     font-family: system-ui, -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, 'Open Sans', 'Helvetica Neue', sans-serif;
     margin: 0;
@@ -371,6 +377,57 @@ span.llm::before {
     border-radius: 4pt;
 }
 
+span.llm-badge::before {
+    content: "LLM-based";
+    color: white;
+    font-size: 8pt;
+    position: relative;
+    top: -3pt;
+    margin-left: 3pt;
+    background-color: rgb(199, 130, 199);
+    display: inline-block;
+    height: 16pt;
+    
+    padding: 2pt 4pt;
+    border-radius: 4pt;
+}
+
+span.detector-badge::before {
+    content: "Detector";
+    color: #eef2ff;
+    font-size: 10pt;
+    position: relative;
+    top: -3pt;
+    margin-left: 3pt;
+    background-color: var(--primary-blue);
+    display: inline-block;
+    height: 18pt;
+    
+    padding: 2pt 4pt;
+    border-radius: 4pt;
+}
+
+.detector-badge {
+    position: relative;
+  }
+  
+  .detector-badge:hover::after {
+    content: 'DETECTOR DESCRIPTION';
+    position: absolute;
+    left: 50%;
+    transform: translateX(-50%);
+    bottom: 100%;
+    margin-bottom: 5px;
+    background: rgba(0, 0, 0, 0.215);
+    color: white;
+    padding: 5px 10px;
+    border-radius: 4px;
+    font-size: 14px;
+    white-space: nowrap;
+    z-index: 99;
+    pointer-events: none;
+  }
+
 .jupyter-wrapper {
     margin-top: -20pt;
 }
@@ -704,6 +761,57 @@ ul.md-nav__list {
     margin-top: -5pt;
 }
 
+.md-typeset__table {
+    width: 100%;
+}
+
+.md-typeset__table table {
+    width: 100%;
+    table-layout: auto;
+}
+
+/* Set minimum widths for the first two columns */
+.md-typeset__table th:nth-child(1), 
+.md-typeset__table td:nth-child(1) {
+    width: 15%;
+    min-width: 100px;
+}
+
+.md-typeset__table th:nth-child(2), 
+.md-typeset__table td:nth-child(2) {
+    width: 25%;
+    min-width: 250px;
+}
+
+/* Let the description column take up remaining space */
+.md-typeset__table th:nth-child(3), 
+.md-typeset__table td:nth-child(3) {
+    width: 60%;
+}
+
+.function-type {
+    display: inline-block;
+    background: #eef2ff;
+    color: var(--primary-blue);
+    padding-left: 6px;
+    padding-right: 6px;
+    border-radius: 4px;
+    font-size: 0.85em;
+    margin-left: 8px;
+    font-weight: 500;
+    font-family: monospace;
+}
+
+
+.code-caption {
+    font-size: 0.65rem;
+    color: #666;
+    margin-top: -0.9rem;
+    padding-left: 4px;
+    font-style: italic;
+  }
+
+  
 .box.secondary {
     position: relative;
 }
diff --git a/docs/guardrails/pii.md b/docs/guardrails/pii.md
@@ -0,0 +1,80 @@
+# PII Detection
+<div class='subtitle'>
+Detect and manage PII in traces.
+</div>
+
+Personally Identifiable Information (PII) refers to sensitive information — like names, emails, or credit card numbers — whether intentionally or not. If not properly handled, this data can be exposed in logs, traces, or external communications, leading to privacy violations, regulatory risks, or user harm.
+
+<div class='risks'/> 
+> **PII Risks**<br/> 
+> Without safeguards, agents may: 
+
+> * Log PII in traces or internal tools 
+> * Share PII in responses or external tool calls
+
+The `pii` function helps prevent these issues by scanning messages for PII, thus acting as a safeguard that lets you detect and block sensitive data before it’s stored, surfaced, or shared.
+
+## pii <span class="detector-badge"/>
+```python
+def pii(
+    data: Union[str, List[str]],
+    entities: Optional[List[str]] = None
+) -> List[str]
+```
+Detector to find personally indentifaible information in text.
+
+**Parameters**
+
+| Name        | Type   | Description                            |
+|-------------|--------|----------------------------------------|
+| `data`      | `Union[str, List[str]]` | A single message or a list of messages to detect PII in |
+| `entities`  | `Optional[List[str]]`   | A list of [PII entity types](https://microsoft.github.io/presidio/supported_entities/) to detect. Defaults to detecting all types. |
+
+**Returns**
+
+| Type   | Description                            |
+|--------|----------------------------------------|
+| `List[str]` | A list of all the detected PII in `data` |
+
+### Detecting PII
+The simplest usage of the `pii` function is to check against any message. The following example will raise an error if any message in the trace contains PII.
+
+**Example:** Detecting any PII in any message.
+``` py
+from invariant.detectors import pii
+
+raise "Found PII in message" if:
+    (msg: Message)
+    any(pii(msg))
+```
+<div class="code-caption"> Any PII in the text of the trace will raise an error. </div>
+
+
+### Detecting Specific PII Types
+You can also specify specific types of PII that you would like to detect, such as phone numbers, emails, or credit card information. The example below demonstrates how to detect credit card numbers in Messages.
+
+**Example:** Detecting Credit Card Numbers.
+```guardrail
+from invariant.detectors import pii
+
+raise "Found PII in message" if:
+    (msg: Message)
+    any(pii(msg, ["CREDIT_CARD"]))
+```
+<div class="code-caption"> Only messages containing credit card numbers will raise an error. </div>
+
+
+### Preventing PII leakage
+It is also possible to use the `pii` function in combination with other filters to get more complex behaviour. The example below shows how you can detect when an agent attempts to send emails outside of your organisation. 
+
+**Example:** Detecting PII Leakage in External Communications.
+```python
+from invariant.detectors import pii
+
+raise "Attempted to send PII in an email" if:
+    (out: ToolOutput) -> (call: ToolCall)
+    any(pii(out.content))
+    call is tool:send_email({ to: "^(?!.*@ourcompany.com$).*$" }) 
+```
+<div class="code-caption"> Explicitly prevent sending emails with PII to non-company email domains. </div>
+