jbulger82
diff --git a/‎.gitignore‎
Lines changed: 27 additions & 0 deletions b/‎.gitignore‎
Lines changed: 27 additions & 0 deletions
diff --git a/‎AGENT.md‎
Lines changed: 131 additions & 0 deletions b/‎AGENT.md‎
Lines changed: 131 additions & 0 deletions
diff --git a/‎README.md‎
Lines changed: 153 additions & 0 deletions b/‎README.md‎
Lines changed: 153 additions & 0 deletions
diff --git a/‎data/rag/profile_template/profile_public.md‎
Lines changed: 30 additions & 0 deletions b/‎data/rag/profile_template/profile_public.md‎
Lines changed: 30 additions & 0 deletions
diff --git a/‎gateway/index.js‎
Lines changed: 2 additions & 0 deletions b/‎gateway/index.js‎
Lines changed: 2 additions & 0 deletions
@@ -0,0 +1,27 @@
+# Node and build outputs
+node_modules/
+gateway/node_modules/
+gateway/ui/node_modules/
+gateway/ui/dist/
+
+# Logs
+*.log
+gateway.log
+
+# Python caches/venvs
+__pycache__/
+*.pyc
+servers/venv_*/
+
+# RAG data (user content)
+data/rag/uploads/
+data/rag/saved_chats/
+data/rag/images/
+data/rag/indexes.pkl
+
+# Local indexes/cache
+data/rag/indexes/
+
+# OS cruft
+.DS_Store
+Thumbs.db
@@ -0,0 +1,131 @@
+# AGENT.MD: FullStack MCP Hub snapshot (2025-12-15)
+
+## 1) Progress summary
+- Gateway + UI stable on :3333 (tools, blocklist, RAG browser, presets, editable descriptions).
+- Tool stack ~53 tools across 12 servers: filesystem, shell, Playwright, sqlite, local_rag, websearch, websearch_adv (deep search + advanced extract), scrape, research (wiki/arXiv/Wikimedia images), python_repl (venv), pollinations, coingecko.
+- local_rag scoped to `data/rag`; `save_chat` writes raw+summary without overwriting; `list_indexes` added.
+- Python REPL owns its venv and `pip_install`; research adds Wikimedia Commons image search.
+- Scraper server for fast HTML→text; blocklist persisted (`tool-blocklist.json`) and managed via UI.
+- Allowlists: pollinations (generateImageUrl/listImageModels), coingecko (4 tools), websearch_adv (deep search + single-page extract).
+
+---
+
+## 2. Project Vision
+
+To create a **universal, model-agnostic, and extensible tool-use architecture**. This system will allow any AI model (Gemini, OpenAI's GPT, Anthropic's Claude, Grok, etc.) and any custom interface (like the Francine GUI) to seamlessly access a single, powerful, and ever-growing stack of tools.
+
+The core principle is **"Write a tool once, use it from any model."**
+
+## 3. Core Architecture
+
+The system is composed of three primary layers, ensuring maximum separation of concerns and scalability.
+
+```
++----------------+      +----------------+      +----------------+
+|   Gemini API   |      |   OpenAI API   |      |  Francine GUI  |
++----------------+      +----------------+      +----------------+
+        |                       |                       |
+        +-----------------------+-----------------------+
+                                |
+                 +------------------------------+
+                 |   Universal API Gateway      |  <-- THE KEY TO UNIVERSALITY
+                 | (Google Cloud Run/Function)  |
+                 +------------------------------+
+                 | - /gemini/v1/execute         |  (Gemini Tool Spec)
+                 | - /openai/v1/openapi.json    |  (OpenAI Plugin Spec)
+                 | - /anthropic/v1/execute      |  (Anthropic Tool Spec)
+                 | - /francine/v1/mcp           |  (Custom MCP Spec)
+                 +------------------------------+
+                                |
+                                | (Standardized Internal Request)
+                                v
+                   +--------------------------+
+                   |         MCP Hub          |  (Borrowed from Fran1)
+                   +--------------------------+
+                   | - Tool Registry          |
+                   | - Connection Manager     |
+                   | - Execution Router       |
+                   +--------------------------+
+                      |          |         |
+    (stdio/sse)       |          |         |
++-----------------+ +----------------+ +-----------------+
+| RAG MCP Server  | | Finch MCP Serv | | Playwright MCP  |
+| (Python)        | | (Node.js)      | | Server (OSS)    |
++-----------------+ +----------------+ +-----------------+
+
+```
+
+### 3.1. Layer 1: Tool Servers (The "Hands")
+
+-   **Standard:** Each tool is an independent process that communicates using the **Model Context Protocol (MCP)** over `stdio` (for local tools) or `sse` (for networked tools), just like in `Fran1`.
+-   **Responsibilities:** A tool server is responsible for one thing only: exposing its capabilities (`tools/list`) and executing them (`tools/call`).
+-   **Examples:** A server for browsing the web with Playwright, a server for reading local files (our RAG tool), a server for interacting with APIs.
+
+### 3.2. Layer 2: The MCP Hub (The "Brainstem")
+
+-   **Logic:** We will adopt the robust logic from your `Fran1` project.
+-   **Responsibilities:**
+    1.  **Registry:** Reads a `master.json` file to discover all available tool servers.
+    2.  **Connection:** Manages the lifecycle of connections to these tool servers.
+    3.  **Routing:** Provides a single internal endpoint to execute any registered tool by name.
+
+### 3.3. Layer 3: The Universal API Gateway (The "Translator")
+
+-   **This is the most critical new component for universal compatibility.** It is a public-facing API that acts as a multi-headed adaptor.
+-   **Responsibilities:**
+    1.  **Expose Model-Specific Endpoints:** It will have different endpoints that conform to the *exact* tool-use specifications of different AI providers.
+    2.  **Translate and Delegate:** When a request comes in from a specific model (e.g., Gemini), the gateway translates the model-specific request into a standardized call to the **MCP Hub**.
+    3.  **Format and Return:** It receives the result from the MCP Hub and formats it back into the response structure the original AI model expects.
+-   **Deployment:** This gateway is the perfect candidate for deployment as a serverless **Google Cloud Run** service, integrating it directly with the environment we've set up.
+
+## 4. Development Roadmap
+
+This plan is designed for parallel work. Different agents can tackle different MCP servers simultaneously once the core is in place.
+
+1.  **Setup Core Infrastructure:**
+    -   `[✅]` Initialize the `MASTER_MCP` project structure (e.g., `/servers`, `/hub`, `/gateway`).
+    -   `[✅]` Port the `McpHub` logic from `Fran1` into this new project.
+    -   `[✅]` Port the `master.json` registry and create a directory for tool server configurations.
+
+2.  **Integrate First Tool (Proof of Concept):**
+    -   `[✅]` Find and integrate an existing open-source MCP-compatible tool (e.g., for Playwright or a similar web browser tool).
+    -   `[✅]` Register it in `master.json` and confirm the `McpHub` can connect to and list its tools.
+
+3.  **Build the Local RAG Tool:**
+    -   `[✅]` Create the new `rag_mcp_server.py`.
+    -   `[✅]` Implement `list_directory`, `read_file`, and `search_files` functions.
+    -   `[✅]` Implement `create_index` and `search_index` with **named collection support**.
+    -   `[✅]` Register it with the `McpHub`.
+
+4.  **Build the Universal Gateway (Gemini First):**
+    -   `[✅]` Create the initial API Gateway project (e.g., using Node.js/Express or Python/FastAPI).
+    -   `[✅]` Implement the `/gemini/v1/execute` endpoint. This endpoint will accept a request body matching the Gemini API's `FunctionCall` format.
+    -   **Status:** Fully functional and tested.
+
+5.  **Expand and Integrate:**
+    -   `[✅]` Integrate **Shell MCP** (`mcp-server-commands`) for command-line access.
+    -   `[✅]` Integrate **Playwright MCP** (`mcp-server-playwright`) for browser automation.
+    -   `[✅]` Integrate **SQLite MCP** (`mcp-server-sqlite`) for persistent memory.
+    -   `[✅]` Initialize `tool_runs` logging table in SQLite.
+    -   `[✅]` Organize documentation into `~/.master_mcp/data/raw/mcp_docs` and create RAG index.
+
+---
+
+## 5. System Status (Live)
+
+Approx tools: 53
+
+| Server | Transport | Tools | Notes |
+| :--- | :--- | :--- | :--- |
+| filesystem | stdio (npx) | 14 | sandbox `/home/jeff` |
+| local_rag | stdio (python) | 6 | save_chat, list_indexes; sandbox `data/rag` |
+| shell | stdio (node) | 1 | commands |
+| playwright | stdio (npx) | 10 | browser automation |
+| sqlite | stdio (python venv) | 6 | db ops |
+| websearch | stdio (node) | 1 | fast DDG |
+| websearch_adv | stdio (node) | 2 | deep search + single-page extract (allowlisted) |
+| scrape | stdio (node) | 1 | fast HTML→text |
+| research | stdio (python) | 3 | wiki, arxiv, Commons images |
+| python_repl | stdio (python) | 3 | exec/reset/pip (venv) |
+| pollinations | stdio (npx) | 2 | image URL + list models |
+| coingecko | sse | 4 | allowlisted to search/price/markets/range |
@@ -0,0 +1,153 @@
+# FullStack MCP Hub
+
+This repo bundles the MCP hub/gateway plus a browser UI for discovering and running MCP tools.
+
+## What’s here
+- `hub/`: connects to configured MCP servers, lists tools, executes calls.
+- `gateway/`: HTTP/SSE surface for the hub (`/tools`, `/gemini/v1/execute`, etc.) and serves the built UI.
+- `gateway/ui/`: React UI (Vite) for browsing tools, editing descriptions, adding servers, running calls, saving presets, and managing blocked tools.
+- `tool-registry/master.json`: MCP server registry (stdio/SSE).  
+  `tool-registry/tool-overrides.json`: description overrides.  
+  `tool-registry/tool-blocklist.json`: persisted blocklist (managed via UI Blocked tab).
+- `servers/`: bundled servers (local_rag, sqlite, python_repl with venv, research incl. Wikipedia/ArXiv/Wikimedia images, scrape, pollinations, coingecko, advanced web search clone).
+  - Key shipped servers:
+    - `local_rag`: chunked search, fuzzy/filters, `save_chat`, `save_image`.
+    - `sqlite`
+    - `python_repl` (with its own venv; use `pip_install` to add packages like pandas/numpy without touching system Python)
+    - `research` (Wikipedia, ArXiv, Wikimedia Commons images)
+    - `scrape` (HTML→text)
+    - `pollinations` (image URL + models)
+    - `coingecko` (SSE, curated 4-tool allowlist: search, price, markets, range chart)
+    - `websearch` (fast DDG)
+    - `websearch_adv` (deep multi-engine search + single-page extract, local clone)
+    - `playwright`, `shell`, `filesystem`
+
+## Why (intent)
+- Make MCP approachable: start the gateway, open the UI, add servers with a guided form, test, and go.
+- Serve ops/dev workflows (LLM ops / “llmOPS” vibes) with a single pane to discover, run, and tune tools.
+- No-required-API-key defaults: ships with stdio-friendly examples like Playwright for browser/search.
+
+## Contact / collab
+- Built by Jeff Bulger — https://jeffbulger.dev | [email protected] | GitHub: https://github.com/jbulger82
+- Looking for collaborators who want to build/extend open LLM OPS tooling (MCP servers, local-first flows, RAG, search, automation).
+
+## Prereqs
+- Node 18+ (gateway/UI), npm.
+- The MCP servers you want to run (stdio commands or SSE endpoints).
+
+## Start everything
+```bash
+export MCP_ROOT=/path/to/Fullstack_MCP_hub   # set to your clone path
+cd gateway
+npm start
+# UI served at http://localhost:3333
+```
+
+First-time setup (deps + UI build):
+```bash
+cd gateway && npm install
+cd ui && npm install && npm run build
+```
+
+One-shot setup helper (does the installs/builds/playwright browsers):
+```bash
+export MCP_ROOT=/path/to/Fullstack_MCP_hub   # set to your clone path
+./setup.sh
+```
+
+Quick start after setup:
+```bash
+export MCP_ROOT=/path/to/Fullstack_MCP_hub
+./start.sh
+```
+
+If port 3333 is busy: `lsof -i :3333` then `kill <pid>` and retry.
+
+## Using the UI
+Open `http://localhost:3333`.
+
+- **Add MCP server (guided)**  
+  - Click “Open form” under Servers (left pane).  
+  - Choose transport:
+    - `stdio`: fill Command (e.g., `npx`), Args (e.g., `-y @automatalabs/mcp-server-playwright`), optional CWD.
+    - `sse`: fill full SSE URL (e.g., `http://localhost:4000/sse`).  
+  - Click **Test connection** (runs a lightweight connect + tools/list).  
+  - On success, click **Add server** to persist to `tool-registry/master.json` and connect live. Tools appear in the list.
+
+- **Block/restore tools**  
+  - Tools tab: “Block tool” hides the selected tool (persisted to `tool-blocklist.json`).  
+  - Blocked tab: view/restore blocked tools.
+
+- **Browse & run tools**  
+  - Select a tool in the left list; right pane shows description + input schema.  
+  - Enter JSON payload (or keep `{}`) and click **Run tool**.  
+  - Responses show in “Result”; status chip shows timing.
+
+- **Presets**  
+  - Save current payload (“Save current”); apply or delete per tool. Stored locally in browser storage.
+
+- **Edit descriptions**  
+  - Tool detail pane has an editable description. **Save** writes to `tool-registry/tool-overrides.json`. **Restore default** removes the override.
+
+- **RAG tab**  
+  - Browse `data/rag/uploads`, `saved_chats`, `indexes`; drag/drop “ADD FILE”; search filenames/paths; delete with confirm. Uploads auto-refresh the `uploads` index.
+  - Indexing: text files are chunked (~500 words + overlap); indexes persist to disk (`indexes.pkl`) so they survive restarts. `search_index` supports `fuzzy`, `path_contains`, `tag` (from a `#tags:` line), and mtime filters. Results return matching chunks (with file/chunk info).
+  - Save tools: `save_chat` writes raw+summary to `data/rag/saved_chats` (no overwrites); `save_image` writes base64 images to `data/rag/images`.
+  - Profiles: a starter template lives at `data/rag/profile_template/profile_public.md`. Copy/rename to your own folder (e.g., `data/rag/profile_me/profile_public.md`) and edit with your details; use `#tags:` if you want tag filtering.
+
+## Common stdio examples (no API key)
+- Playwright (browser automation/search/screenshot):  
+  - Command: `npx`  
+  - Args: `-y @automatalabs/mcp-server-playwright`  
+  - CWD: `servers`
+- DuckDuckGo Websearch (included):  
+  - Command: `node`  
+  - Args: `servers/websearch/server.js`  
+  - CWD: repo root (`MASTER_MCP`)
+- Python REPL (persistent session + its own venv):  
+  - Command: `python3`  
+  - Args: `python_repl_mcp.py`  
+  - CWD: `servers`
+- Research (Wikipedia, ArXiv, Wikimedia Commons images):  
+  - Command: `python3`  
+  - Args: `research_mcp_server.py`  
+  - CWD: `servers`
+- Scraper (fast HTML fetch/clean):  
+  - Command: `node`  
+  - Args: `scrape_mcp_server.js`  
+  - CWD: `servers`
+- Advanced web search (multi-engine + extraction; cloned locally):  
+  - Command: `node`  
+  - Args: `dist/index.js`  
+  - CWD: `servers/web-search-mcp`
+- Coingecko (SSE):  
+  - Transport: `sse`  
+  - URL: `https://mcp.api.coingecko.com/sse`
+
+## Project scripts
+Gateway (`/gateway`):
+- `npm start` – start gateway + serve built UI.
+- `npm run build:ui` – build UI assets into `gateway/ui/dist`.
+- `npm run dev:ui` – UI dev server with proxy to gateway.
+
+UI (`/gateway/ui`):
+- `npm run dev` – Vite dev server (proxies to 3333).
+- `npm run build` – production build.
+
+## Paths of interest
+- Gateway entry: `gateway/server.js`
+- Hub logic: `hub/McpHub.js`
+- UI entry: `gateway/ui/src/App.jsx`
+- Registry: `tool-registry/master.json`
+- Description overrides: `tool-registry/tool-overrides.json`
+- Blocklist: `tool-registry/tool-blocklist.json`
+- RAG data: `data/rag/` (uploads, saved_chats, indexes)
+  - Includes `data/rag/images` for saved screenshots via `local_rag__save_image`.
+- Advanced search clone: `servers/web-search-mcp/`
+
+## Troubleshooting
+- Port in use: `lsof -i :3333` → `kill <pid>` → restart.
+- Test fails when adding server: check command/args or SSE URL; rerun **Test connection** to see error.
+- No tools after add: refresh tools (left pane Refresh) or restart gateway after fixing registry.
+- If a server keeps failing: check paths/CWD, block noisy tools via Blocked tab, then reconnect.
+- Connecting from hosted UIs (e.g., ChatGPT dev mode): if the local URL is marked unsafe, tunnel with ngrok (e.g., ngrok v3/stable 3.34.1) to expose `http://localhost:3333` over HTTPS, then use the ngrok URL.
@@ -0,0 +1,30 @@
+# Profile (Public) – Starter Template
+
+Name: <your name>
+Handle: <optional handle>
+Location: <city/state or region>
+
+Personality & vibe:
+- <how you like answers> (e.g., direct/concise or detailed/friendly)
+- <comfort with tone> (e.g., casual/professional)
+- <boundaries> (e.g., no NSFW, no long-winded filler)
+
+Preferences:
+- <tools you prefer to use> (e.g., local LLMs, certain search tools)
+- <formatting you like> (e.g., bullet summaries, include timestamps)
+- <other preferences> (e.g., keep responses short/long)
+
+Current focus/projects:
+- <project 1 + short note>
+- <project 2 + short note>
+
+System basics (optional):
+- OS / environment:
+- Notable paths:
+- Anything an agent should know:
+
+Usage notes for agents:
+- Treat this file as stable profile context for any model.
+- If the user says “remember this” or “add to profile,” append here or to a notes file under `data/rag/`.
+
+#tags: profile, user-context
@@ -0,0 +1,2 @@
+// Convenience entrypoint: keeps `node .` or `npm start` aligned with the real server.
+import './server.js';
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,2 @@`
	`1`	+// Convenience entrypoint: keeps `node .` or `npm start` aligned with the real server.
	`2`	`+import './server.js';`