panw-api-ollama

Enhance your Ollama deployment with enterprise-grade AI security using Palo Alto Networks Prisma AIRS AI Runtime API Intercept.

What is this?

panw-api-ollama is a security proxy that sits between your OpenWebUI interface and Ollama instance. It works by intercepting all prompts and responses, analyzing them with Palo Alto Networks' Prisma AIRS AI Runtime security technology, and protecting your system from:

Prompt injection attacks
Data exfiltration attempts
Harmful or toxic content
Personally identifiable information (PII) leakage
Other AI-specific security threats

The best part? It's completely transparent to your existing setup - Ollama will still work just as before, but with an added layer of security.

Why use this?

Prevent Security Incidents: Detect and block malicious prompts before they reach your LLM
Protect Sensitive Data: Ensure responses don't contain unauthorized information
Maintain Compliance: Implement guardrails for safe AI usage in enterprise environments
Visibility: Gain insights into usage patterns and potential threats

Use Cases

Secure AI models in production: Validate prompt requests and responses to protect deployed AI models.
Detect data poisoning: Identify contaminated training data before fine-tuning.
Protect adversarial input: Safeguard AI agents from malicious inputs and outputs while maintaining workflow flexibility.
Prevent sensitive data leakage: Use API-based threat detection to block sensitive data leaks during AI interactions.

Installation Options

There are two ways to install and run panw-api-ollama:

Option 1: Docker Setup (Recommended)

Docker is the recommended installation method as it:

Handles all permissions automatically
Provides a pre-configured environment
Eliminates system-level dependency issues
Makes updates and maintenance easier

For Docker-based deployment, please refer to the instructions in the Docker Setup README.

Option 2: Build from Source

If you prefer to build from source or can't use Docker, you can build the Rust application directly. Note that this method may require additional system configuration and could face permission issues depending on your setup.

Step 1: Build

git clone https://github.com/PaloAltoNetworks/panw-api-ollama.git
cd panw-api-ollama
cargo build --release

Step 2: Get a Palo Alto Networks Prisma AIRS AI Runtime API Intercept Key

Follow this tutorial, specifically step 13, to obtain your API key.

Step 3: Configure

Rename config.yaml.example to config.yaml and update it with your API key:

cp config.yaml.example config.yaml

Then edit the file to add your Palo Alto Networks Prisma AIRS AI Runtime API Intercept key:

pan_api:
  key: "your-pan-api-key-here"

Step 4: Update OpenWebUI

For non-Docker installations, you need to change the Ollama port in OpenWebUI from 11434 to 11435:

Go to Settings > Server Management in the OpenWebUI interface
Add a new Ollama server with URL: http://localhost:11435
Save your configuration

Alternatively, update your OpenWebUI environment settings: OpenWebUI Environment Configuration

Step 5: Download a model

Before using the service, make sure you have a model available:

ollama pull llama2-uncensored:latest

Step 6: Run

./target/release/panw-api-ollama

You're all set! You can now use OpenWebUI as normal, but with enterprise security scanning all interactions.

Configuration Examples

The project includes example configuration files in the config-examples directory that demonstrate different setup options:

OpenWebUI Global Configuration

The config-1747909231428.json file shows how to set up OpenWebUI with both secured and unsecured Ollama connections:

{
    "ollama": {
        "enable": true,
        "base_urls": [
            "http://panw-api-ollama:11435",  // Secure connection through panw-api-ollama
            "http://host.docker.internal:11434"  // Direct connection to Ollama
        ],
        "api_configs": {
            "0": {
                "enable": true,
                "tags": [],
                "prefix_id": "PANW",  // Models with this prefix use the security proxy
                "model_ids": [
                    "llama2-uncensored:latest"
                ],
                "key": ""
            },
            "1": {
                "enable": true,
                "tags": [],
                "prefix_id": "NOPAWN",  // Models with this prefix bypass the security proxy
                "model_ids": [
                    "nomic-embed-text:latest",
                    "llama2-uncensored:latest"
                ],
                "key": ""
            }
        }
    }
}

Model Configurations

Two example model configurations are included to demonstrate before/after comparisons:

PANW.llama2-uncensored_latest-1747909321539.json - A model using the security proxy
NOPAWN.llama2-uncensored_latest-1747909327080.json - The same model bypassing the security proxy

These configurations allow you to perform side-by-side comparisons and demonstrations of how the Palo Alto Networks Prisma AIRS AI Runtime Security affects the model responses.

Resources

Support

For issues related to this integration, please file an issue on GitHub. For questions about Palo Alto Networks Prisma AIRS AI Runtime API Intercept, please refer to official support channels.

Name		Name	Last commit message	Last commit date
Latest commit History 104 Commits
.github/workflows		.github/workflows
config-examples		config-examples
docker		docker
src		src
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
README.md		README.md
SUPPORT.md		SUPPORT.md
config.yaml.exemple		config.yaml.exemple

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

panw-api-ollama

What is this?

Why use this?

Use Cases

Installation Options

Option 1: Docker Setup (Recommended)

Option 2: Build from Source

Step 1: Build

Step 2: Get a Palo Alto Networks Prisma AIRS AI Runtime API Intercept Key

Step 3: Configure

Step 4: Update OpenWebUI

Step 5: Download a model

Step 6: Run

Configuration Examples

OpenWebUI Global Configuration

Model Configurations

Resources

Support

About

Uh oh!

Releases 15

Uh oh!

Contributors 4

Uh oh!

Languages

PaloAltoNetworks/panw-api-ollama

Folders and files

Latest commit

History

Repository files navigation

panw-api-ollama

What is this?

Why use this?

Use Cases

Installation Options

Option 1: Docker Setup (Recommended)

Option 2: Build from Source

Step 1: Build

Step 2: Get a Palo Alto Networks Prisma AIRS AI Runtime API Intercept Key

Step 3: Configure

Step 4: Update OpenWebUI

Step 5: Download a model

Step 6: Run

Configuration Examples

OpenWebUI Global Configuration

Model Configurations

Resources

Support

About

Resources

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 15

Uh oh!

Contributors 4

Uh oh!

Languages