av
diff --git a/‎.scripts/seed.ts‎
Lines changed: 1 addition & 1 deletion b/‎.scripts/seed.ts‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎app/package.json‎
Lines changed: 1 addition & 1 deletion b/‎app/package.json‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎app/src-tauri/Cargo.lock‎
Lines changed: 1 addition & 1 deletion b/‎app/src-tauri/Cargo.lock‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎app/src-tauri/Cargo.toml‎
Lines changed: 3 additions & 3 deletions b/‎app/src-tauri/Cargo.toml‎
Lines changed: 3 additions & 3 deletions
diff --git a/‎app/src-tauri/tauri.conf.json‎
Lines changed: 1 addition & 1 deletion b/‎app/src-tauri/tauri.conf.json‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/2.-Services.md‎
Lines changed: 3 additions & 0 deletions b/‎docs/2.-Services.md‎
Lines changed: 3 additions & 0 deletions
diff --git a/‎docs/2.2.2-Backend&colon-llama.cpp.md‎
Lines changed: 6 additions & 6 deletions b/‎docs/2.2.2-Backend&colon-llama.cpp.md‎
Lines changed: 6 additions & 6 deletions
diff --git a/‎harbor.sh‎
Lines changed: 1 addition & 1 deletion b/‎harbor.sh‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎package.json‎
Lines changed: 1 addition & 1 deletion b/‎package.json‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎promptfoo/README.md‎
Lines changed: 137 additions & 0 deletions b/‎promptfoo/README.md‎
Lines changed: 137 additions & 0 deletions
@@ -6,7 +6,7 @@ import * as toml from 'jsr:@std/toml';
 import * as path from 'jsr:@std/path';
 import * as collections from "jsr:@std/collections/deep-merge";
 
-const VERSION = "0.4.5";
+const VERSION = "0.4.6";
 
 type ValueSeed = {
   // Path relative to the project root
 
@@ -1,7 +1,7 @@
 {
   "name": "@avcodes/harbor-app",
   "private": true,
-  "version": "0.4.5",
+  "version": "0.4.6",
   "type": "module",
   "scripts": {
     "dev": "vite",
 
@@ -1,14 +1,14 @@
 
 [package]
 name = "harbor-app"
-version = "0.4.5"
+version = "0.4.6"
 description = "A companion app for Harbor LLM toolkit"
 authors = ["av"]
 edition = "2021"
 
 [lib]
 name = "harbor_lib"
-crate-type = ["staticlib", "cdylib", "rlib"]
+crate-type = ["staticlib","cdylib","rlib"]
 
 [build-dependencies.tauri-build]
 version = "2.1.0"
@@ -24,7 +24,7 @@ tauri-plugin-pty = "0.2"
 
 [dependencies.tauri]
 version = "2.4.0"
-features = ["protocol-asset", "tray-icon"]
+features = ["protocol-asset","tray-icon"]
 
 [dependencies.serde]
 version = "1"
 
@@ -1,7 +1,7 @@
 {
   "$schema": "https://schema.tauri.app/config/2.4.0",
   "productName": "Harbor",
-  "version": "0.4.5",
+  "version": "0.4.6",
   "identifier": "com.harbor.app",
   "build": {
     "beforeDevCommand": "bun run dev",
 
@@ -56,6 +56,9 @@ The text-based terminal client for Ollama.
 - <a href="https://github.com/av/harbor/wiki/2.1.7-Frontend:-parllama"><img src="https://github.com/paulrobello.png?size=200" alt="Parllama logo" width="12" height="12" /> Parllama</a> <span style="opacity: 0.5;">`Frontend`</span><br/>
 TUI for Ollama
 
+- <a href="https://github.com/av/harbor/wiki/2.1.15-Frontend-SillyTavern"><img src="https://www.google.com/s2/favicons?domain=sillytavern.app&sz=128" alt="SillyTavern logo" width="12" height="12" /> SillyTavern</a> <span style="opacity: 0.5;">`Frontend`</span><br/>
+Feature-rich LLM chat frontend for power users. Supports multiple AI backends, personas, advanced prompting, and extensions.
+
 # Backends
 
 This section covers services that provide the LLM inference capabilities.
 
@@ -41,12 +41,12 @@ You can find GGUF models to run on Huggingface [here](https://huggingface.co/mod
 
 ```bash
 # Pull a model directly from HuggingFace (with optional tag)
-# Downloads to llama.cpp cache using an ephemeral server
+# Downloads to HuggingFace cache using an ephemeral server
 harbor pull microsoft/Phi-3.5-mini-instruct-gguf
 harbor pull microsoft/Phi-3.5-mini-instruct-gguf:Q4_K_M
 ```
 
-This method automatically downloads the model to llama.cpp's cache. The model will be available for use immediately.
+This method automatically downloads the model to the HuggingFace cache. The model will be available for use immediately.
 
 When `llamacpp` is running, you can check which models it detects in the cache with:
 
@@ -62,7 +62,7 @@ harbor llamacpp models
 # Quick lookup for the models
 harbor hf find gguf
 
-# 1. With llama.cpp own cache:
+# 1. With HuggingFace cache (default):
 #
 # - Set the model to run, will be downloaded when llamacpp starts
 #   Accepts a full URL to the GGUF file (from Browser address bar)
@@ -99,7 +99,7 @@ harbor llamacpp gguf /app/models/hub/models--av-codes--Trinity-2-Codestral-22B-Q
 > [!NOTE]
 > Please, note that this procedure doesn't download the model. If model is not found in the cache, it will be downloaded on the next start of `llamacpp` service.
 
-Downloaded models are stored in the global `llama.cpp` cache on your local machine (same as native version uses). The server can only run one model at a time and must be restarted to switch models.
+Downloaded models are stored in the HuggingFace cache (`~/.cache/huggingface`) on your local machine. The server can only run one model at a time and must be restarted to switch models.
 
 #### Multiple models (router mode)
 
@@ -126,7 +126,7 @@ HARBOR_LLAMACPP_MODEL_SPECIFIER=""
 
 **Model sources (official docs → Harbor paths)**
 
-- **Cache (default):** llama.cpp uses its cache to discover models. In Harbor this is mounted at `/root/.cache/llama.cpp` from `HARBOR_LLAMACPP_CACHE`.
+- **Cache (default):** llama.cpp uses the HuggingFace cache to discover models. In Harbor this is mounted at `/root/.cache/huggingface` from `HARBOR_HF_CACHE`.
 - **Models directory:** place GGUFs under `./llamacpp/data/models` and point the router to `/app/data/models`.
 - **Preset file:** place an INI file at `./llamacpp/data/models.ini` and point the router to `/app/data/models.ini`.
 
@@ -198,7 +198,7 @@ harbor defaults rm llamacpp
 Following options are available via [`harbor config`](./3.-Harbor-CLI-Reference#harbor-config):
 
 ```bash
-# Location of the llama.cpp own cache, either global
+# Legacy llama.cpp cache path (models are now stored in HF cache)
 # or relative to $(harbor home)
 LLAMACPP_CACHE                 ~/.cache/llama.cpp
 
 
@@ -5107,7 +5107,7 @@ run_modularmax_command() {
 # ========================================================================
 
 # Globals
-version="0.4.5"
+version="0.4.6"
 harbor_repo_url="https://github.com/av/harbor.git"
 harbor_release_url="https://api.github.com/repos/av/harbor/releases/latest"
 delimiter="|"
 
@@ -1,6 +1,6 @@
 {
   "name": "@avcodes/harbor",
-  "version": "0.4.5",
+  "version": "0.4.6",
   "description": "Effortlessly run LLM backends, APIs, frontends, and services with one command.",
   "private": false,
   "author": "av <av@av.codes> (https://av.codes)",
 
@@ -0,0 +1,137 @@
+### [Promptfoo](https://github.com/promptfoo/promptfoo)
+
+> Handle: `promptfoo`<br/>
+> URL: [http://localhost:34233](http://localhost:34233)<br/>
+
+![Promptfoo example screenshot](../docs/promptfoo.png)
+
+[![npm](https://img.shields.io/npm/v/promptfoo)](https://npmjs.com/package/promptfoo)
+[![npm](https://img.shields.io/npm/dm/promptfoo)](https://npmjs.com/package/promptfoo)
+[![GitHub Workflow Status](https://img.shields.io/github/actions/workflow/status/typpo/promptfoo/main.yml)](https://github.com/promptfoo/promptfoo/actions/workflows/main.yml)
+![MIT license](https://img.shields.io/github/license/promptfoo/promptfoo)
+[![Discord](https://github.com/user-attachments/assets/2092591a-ccc5-42a7-aeb6-24a2808950fd)](https://discord.gg/gHPS9jjfbs)
+
+`promptfoo` is a tool for testing, evaluating, and red-teaming LLM apps.
+
+With promptfoo, you can:
+
+- **Build reliable prompts, models, and RAGs** with benchmarks specific to your use-case
+- **Secure your apps** with automated [red teaming](https://www.promptfoo.dev/docs/red-team/) and pentesting
+- **Speed up evaluations** with caching, concurrency, and live reloading
+- **Score outputs automatically** by defining [metrics](https://www.promptfoo.dev/docs/configuration/expected-outputs)
+- Use as a [CLI](https://www.promptfoo.dev/docs/usage/command-line), [library](https://www.promptfoo.dev/docs/usage/node-package), or in [CI/CD](https://www.promptfoo.dev/docs/integrations/github-action)
+- Use OpenAI, Anthropic, Azure, Google, HuggingFace, open-source models like Llama, or integrate custom API providers for [any LLM API](https://www.promptfoo.dev/docs/providers)
+
+#### Starting
+
+```bash
+# [Optional] Pre-pull the image
+harbor pull promptfoo
+```
+
+You'll be running Promptfoo CLI most of the time, it's available as:
+
+```bash
+# Full name
+harbor promptfoo --help
+
+# Alias
+harbor pf --help
+```
+
+Whenever the CLI is called, it'll also automatically start local Promptfoo backend.
+
+```bash
+# Run a CLI command
+harbor pf --help
+
+# Promptfoo backend started
+harbor ps # harbor.promptfoo
+```
+
+Promptfoo backend serves all recorded results in the web UI:
+
+```bash
+# Open the web UI
+harbor open promptfoo
+harbor promptfoo view
+harbor pf o
+```
+
+#### Usage
+
+Most of the time, your workflow will be centered around creating prompts, assets, writing an eval config, running it and then viewing the results.
+
+Harbor will run `pf` CLI from where you call Harbor CLI, so you can use it from any folder on your machine.
+
+```bash
+# Ensure a dedicated folder for the eval
+cd /path/to/your/eval
+
+# Init the eval (here)
+harbor pf init
+
+# Edit the configuration, prompts as needed
+# Run the eval
+harbor pf eval
+
+# View the results
+harbor pf view
+```
+
+> [!NOTE]
+> If you're seeing any kind of file system permission errors you'll need to ensure that files written from within a container are [accessible to your user](../docs/1.-Harbor-User-Guide#file-system-permissions).
+
+#### Configuration
+
+Harbor pre-configures `promptfoo` to run against `ollama` out of the box (must be started before `pf eval`). Any other providers can be configured via:
+
+- env vars (see [`harbor env`](../docs/3.-Harbor-CLI-Reference#harbor-env))
+- directly in promptfooconfig files (see [Providers reference](https://www.promptfoo.dev/docs/providers/) in the official documentation)
+
+```bash
+# For example, use vLLM API
+harbor env promptfoo OPENAI_BASE_URL $(harbor url -i vllm)
+```
+
+Promptfoo is a very rich and extensive tool, we recommend reading through excellent [official documentation](https://www.promptfoo.dev/docs/intro) to get the most out of it.
+
+Harbor comes with two (basic) built-in examples.
+
+##### Promptfoo hello-world
+
+```bash
+# Navigate to eval folder
+cd $(harbor home)/services/promptfoo/examples/hello-promptfoo
+
+# Start ollama and pull the target model
+harbor up ollama
+harbor ollama pull llama3.1:8b
+
+# Run the eval
+harbor pf eval
+
+# View the results
+harbor pf view
+```
+
+##### Promptfoo temp-test
+
+![Promptfoo temp-test example screenshot](../docs/promptfoo-2.png)
+
+Evaluate a model across a range of temperatures to see if there's a sweet spot for a given prompt.
+
+```bash
+# Navigate to eval folder
+cd $(harbor home)/services/promptfoo/examples/temp-test
+
+# Start ollama and pull the target model
+harbor up ollama
+harbor ollama pull llama3.1:8b
+
+# Run the eval
+harbor pf eval
+
+# View the results
+harbor pf view
+```
Original file line number	Diff line number	Diff line change
`@@ -1,7 +1,7 @@`
`1`	`1`	`{`
`2`	`2`	`"name": "@avcodes/harbor-app",`
`3`	`3`	`"private": true,`
`4`		`- "version": "0.4.5",`
	`4`	`+ "version": "0.4.6",`
`5`	`5`	`"type": "module",`
`6`	`6`	`"scripts": {`
`7`	`7`	`"dev": "vite",`
Original file line number	Diff line number	Diff line change
`@@ -1,7 +1,7 @@`
`1`	`1`	`{`
`2`	`2`	`"$schema": "https://schema.tauri.app/config/2.4.0",`
`3`	`3`	`"productName": "Harbor",`
`4`		`- "version": "0.4.5",`
	`4`	`+ "version": "0.4.6",`
`5`	`5`	`"identifier": "com.harbor.app",`
`6`	`6`	`"build": {`
`7`	`7`	`"beforeDevCommand": "bun run dev",`
Original file line number	Diff line number	Diff line change
`@@ -1,6 +1,6 @@`
`1`	`1`	`{`
`2`	`2`	`"name": "@avcodes/harbor",`
`3`		`- "version": "0.4.5",`
	`3`	`+ "version": "0.4.6",`
`4`	`4`	`"description": "Effortlessly run LLM backends, APIs, frontends, and services with one command.",`
`5`	`5`	`"private": false,`
`6`	`6`	`"author": "av <av@av.codes> (https://av.codes)",`