Detector Algorithms for the FMS Guardrails Orchestrator

FMS Guardrails Orchestrator is an open source project led by IBM which provides a server for invocation of detectors on text generation input and output, and standalone detections.

This repository is intended to provide a collection of detector algorithms and microservices that are supported by the TrustyAI team.

Detectors

At the moment, the following detectors are supported:

huggingface -- a generic detector class that is intended to be compatible with any AutoModelForSequenceClassification or a specific kind of AutoModelForCausalLM, namely GraniteForCausalLM; this detector exposes /api/v1/text/contents and thus, could be configured to be a detector of type: text_contents within the FMS Guardrails Orchestrator framework. This detector is also intended to be deployed as a KServe inference service.
llm_judge -- Integrates the vLLM Judge library to use LLM-as-a-judge based guardrailing architecture
builtIn -- Small, lightweight detection functions that are deployed out-of-the-box alongside the Guardrails Orchestrator. The built-in detectors provide a number of heuristic or algorithmic detection functions, such as:
- Regex-based detections, with pre-written regexes for flagging various Personally Identifiable Information items like emails or phone numbers, as well as the ability to provide custom regexes
- File-type validations, for verifying if model input/output is valid JSON, XML, or YAML

Building

huggingface: podman build -f detectors/Dockerfile.hf detectors
llm_judge: podman build -f detectors/Dockerfile.llm_judge detectors
builtIn: podman build -f detectors/Dockerfile.builtIn detectors

Running locally

builtIn: podman run -p 8080:8080 $BUILT_IN_IMAGE

File Type Validation Example

curl -X POST http://localhost:8080/api/v1/text/contents \
  -H "Content-Type: application/json" \
  -d '{
    "contents": [
      "{\"hello\": \"message\"}",
      "not valid json"
    ],
    "detector_params": {
      "file_type": [
        "json"
      ]
    }
  }'

Response:

[
  [],
  [
    {
      "start": 0,
      "end": 14,
      "text": "not valid json",
      "detection": "invalid_json",
      "detection_type": "file_type",
      "score": 1.0,
      "evidences": null
    }
  ]
]

PII Validation Example

curl -X POST http://localhost:8080/api/v1/text/contents \
  -H "Content-Type: application/json" \
  -d '{
    "contents": [
      "Hi my email is [email protected]",
      "There is a party@my house and you can reach me at 123-456-7890"
    ],
    "detector_params": {
      "regex": [
        "email", "us-phone-number"
      ]
    }
  }' | jq

Response:

[
  [
    {
      "start": 15,
      "end": 26,
      "text": "[email protected]",
      "detection": "email_address",
      "detection_type": "pii",
      "score": 1.0,
      "evidences": null
    }
  ],
  [
    {
      "start": 50,
      "end": 62,
      "text": "123-456-7890",
      "detection": "us-phone-number",
      "detection_type": "pii",
      "score": 1.0,
      "evidences": null
    }
  ]
]

Get list of built-in detection algorithms:

curl http://localhost:8080/registry | jq

Response:

{
  "regex": {
    "credit-card": "Detect credit cards in the text contents (Visa, MasterCard, Amex, Discover, Diners Club, JCB) with Luhn check",
    "email": "Detect email addresses in the text contents",
    "ipv4": "Detect IPv4 addresses in the text contents",
    "ipv6": "Detect IPv6 addresses in the text contents",
    "us-phone-number": "Detect US phone numbers in the text contents",
    "us-social-security-number": "Detect social security numbers in the text contents",
    "uk-post-code": "Detect UK post codes in the text contents",
    "$CUSTOM_REGEX": "Replace $CUSTOM_REGEX with a custom regex to define your own regex detector"
  },
  "file_type": {
    "json": "Detect if the text contents is not valid JSON",
    "xml": "Detect if the text contents is not valid XML",
    "yaml": "Detect if the text contents is not valid YAML",
    "json-with-schema:$SCHEMA": "Detect if the text contents does not satisfy a provided JSON schema. To specify a schema, replace $SCHEMA with a JSON schema.",
    "xml-with-schema:$SCHEMA": "Detect if the text contents does not satisfy a provided XML schema. To specify a schema, replace $SCHEMA with an XML Schema Definition (XSD)",
    "yaml-with-schema:$SCHEMA": "Detect if the text contents does not satisfy a provided schema. To specify a schema, replace $SCHEMA with a JSON schema. That's not a typo, you validate YAML with a JSON schema!"
  }
}

API

See IBM Detector API

License

This project is licensed under the Apache License Version 2.0 - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 77 Commits
.github/workflows		.github/workflows
detectors		detectors
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Detector Algorithms for the FMS Guardrails Orchestrator

Detectors

Building

Running locally

File Type Validation Example

PII Validation Example

Get list of built-in detection algorithms:

API

License

About

Uh oh!

Releases

Packages

Languages

License

RobGeada/guardrails-detectors

Folders and files

Latest commit

History

Repository files navigation

Detector Algorithms for the FMS Guardrails Orchestrator

Detectors

Building

Running locally

File Type Validation Example

PII Validation Example

Get list of built-in detection algorithms:

API

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages