BNLNPPS
diff --git a/‎.gitignore‎
Lines changed: 10 additions & 1 deletion b/‎.gitignore‎
Lines changed: 10 additions & 1 deletion
diff --git a/‎.vscode/mcp.json‎
Lines changed: 10 additions & 0 deletions b/‎.vscode/mcp.json‎
Lines changed: 10 additions & 0 deletions
diff --git a/‎CLAUDE-toplevel.md‎
Lines changed: 15 additions & 8 deletions b/‎CLAUDE-toplevel.md‎
Lines changed: 15 additions & 8 deletions
diff --git a/‎CLAUDE.md‎
Lines changed: 83 additions & 1 deletion b/‎CLAUDE.md‎
Lines changed: 83 additions & 1 deletion
diff --git a/‎GEMINI.md‎
Lines changed: 61 additions & 0 deletions b/‎GEMINI.md‎
Lines changed: 61 additions & 0 deletions
diff --git a/‎README.md‎
Lines changed: 21 additions & 5 deletions b/‎README.md‎
Lines changed: 21 additions & 5 deletions
diff --git a/‎current_issue.md‎
Lines changed: 47 additions & 0 deletions b/‎current_issue.md‎
Lines changed: 47 additions & 0 deletions
diff --git a/‎docker-compose.yml‎
Lines changed: 2 additions & 1 deletion b/‎docker-compose.yml‎
Lines changed: 2 additions & 1 deletion
@@ -196,4 +196,13 @@ cython_debug/
 # Supervisord logs and runtime files
 logs/
 *.pid
-supervisord.pid
+supervisord.pid
+
+# macOS
+.DS_Store
+.DS_Store?
+._*
+.Spotlight-V100
+.Trashes
+ehthumbs.db
+Thumbs.db
@@ -0,0 +1,10 @@
+{
+  "servers": {
+    "hf-mcp-server": {
+      "url": "https://huggingface.co/mcp",
+      "headers": {
+        "Authorization": "Bearer $HUGGINGFACE_API_KEY"
+      }
+    }
+  }
+}
@@ -57,29 +57,33 @@ cd swf-testbed && ./run_all_tests.sh
 
 ### System Initialization
 ```bash
-cd swf-testbed
+cd $SWF_PARENT_DIR/swf-testbed
 source .venv/bin/activate
-pip install -e ../swf-common-lib ../swf-monitor .
-swf-testbed init
+pip install -e $SWF_PARENT_DIR/swf-common-lib $SWF_PARENT_DIR/swf-monitor .
+# CRITICAL: Set up Django environment
+cp $SWF_PARENT_DIR/swf-monitor/.env.example $SWF_PARENT_DIR/swf-monitor/.env
+# Edit .env to set DB_PASSWORD='your_db_password' and SECRET_KEY
+cd $SWF_PARENT_DIR/swf-monitor/src && python manage.py migrate
+cd $SWF_PARENT_DIR/swf-testbed && swf-testbed init
 ```
 
 ### Infrastructure Services
 ```bash
 # Start with Docker (recommended)
-cd swf-testbed && swf-testbed start
+cd $SWF_PARENT_DIR/swf-testbed && swf-testbed start
 
 # Or start locally (requires PostgreSQL/ActiveMQ installed)
-cd swf-testbed && swf-testbed start-local
+cd $SWF_PARENT_DIR/swf-testbed && swf-testbed start-local
 ```
 
 ### Testing
 ```bash
 # Test entire ecosystem
-cd swf-testbed && ./run_all_tests.sh
+cd $SWF_PARENT_DIR/swf-testbed && ./run_all_tests.sh
 
 # Test individual components
-cd swf-monitor && ./run_tests.sh
-cd swf-common-lib && ./run_tests.sh
+cd $SWF_PARENT_DIR/swf-monitor && ./run_tests.sh
+cd $SWF_PARENT_DIR/swf-common-lib && ./run_tests.sh
 ```
 
 ## Repository-Specific Guidance
@@ -151,6 +155,9 @@ Each repository contains its own CLAUDE.md with detailed, repository-specific gu
 ## Troubleshooting
 
 ### Common Issues
+- **Virtual Environment Persistence**: The shell environment, including the activated virtual environment, does **not** persist between `run_shell_command` calls. You **MUST** chain environment setup and the command that requires it in a single call.
+  - **Correct**: `cd swf-testbed && source ./install.sh && cd ../swf-monitor && python3 src/manage.py migrate`
+  - **Incorrect**: Running `source ./install.sh` in one call and `python3 src/manage.py migrate` in another.
 - **Core repository structure**: Ensure swf-testbed, swf-monitor, and swf-common-lib are siblings
 - **Environment variables**: Check SWF_HOME is set correctly (auto-configured by CLI)
 - **Database connections**: Verify PostgreSQL is running and accessible
 
@@ -2,6 +2,47 @@
 
 This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.
 
+## Critical Thinking Requirements
+
+Before implementing ANY solution, Claude must explain:
+
+1. **Complete Data Flow Analysis**
+   - Where does data come from?
+   - Where does it get stored?
+   - Where does it get used?
+   - What persists between runs?
+   - What gets cached or reused?
+
+2. **Problem Definition**
+   - What is the actual problem vs what I think it is?
+   - What assumptions am I making?
+   - What evidence do I have that my understanding is correct?
+
+3. **Solution Validation**
+   - Why will this solution work?
+   - What could go wrong?
+   - How can I verify it worked?
+   - What side effects might occur?
+
+## DO NOT CODE UNTIL:
+- You can trace the complete data flow
+- You can explain why the current behavior is happening
+- You can explain exactly what needs to change
+- You have stated all assumptions explicitly
+
+## Common Failure Patterns to Avoid:
+- Jumping to implementation without understanding the system
+- Assuming data behaves as expected without verification
+- Ignoring data persistence between script runs
+- Making changes without understanding their scope
+- Failing to clear cached/persistent data
+
+## When Stuck:
+1. Stop coding
+2. Explain what you think is happening
+3. Ask for verification of your understanding
+4. Only proceed when understanding is confirmed
+
 ## Development Environment
 
 ### Claude Code Setup
@@ -15,6 +56,9 @@ This file provides guidance to Claude Code (claude.ai/code) when working with co
 - `./run_tests.sh` - Run tests for swf-testbed only (uses pytest)
 - `./run_all_tests.sh` - Run tests across all swf-* repositories in parent directory
 - Tests are located in `tests/` directory and use pytest framework
+- **Auto-activation**: Test scripts automatically activate the virtual environment if needed
+  - Just run `./run_all_tests.sh` directly - no manual setup required!
+  - Scripts set up their own environment variables internally
 
 ### Testbed Management
 - `swf-testbed init` - Initialize environment (creates logs/ directory and supervisord.conf)
@@ -32,6 +76,18 @@ This file provides guidance to Claude Code (claude.ai/code) when working with co
 - `source .venv/bin/activate && pip install .[test]` - Install test dependencies
 - Virtual environment located at `.venv/` - ALWAYS activate before any Python commands
 
+**Initial Setup**
+- Run `source install.sh` once when setting up the development environment
+- This installs all dependencies and creates the virtual environment
+- After initial setup, test scripts handle their own environment activation
+
+**CRITICAL: Django .env Configuration Required**
+- Copy `.env.example` to `.env` in swf-monitor directory: `cp ../swf-monitor/.env.example ../swf-monitor/.env`
+- Update database password in `.env` to match Docker: `DB_PASSWORD='your_db_password'`
+- Set Django secret key: `SECRET_KEY='django-insecure-dev-key-for-testing-only-change-for-production-12345678901234567890'`
+- Run Django migrations: `cd ../swf-monitor/src && python manage.py migrate`
+- Without proper .env setup, Django tests will fail with authentication errors
+
 ## Architecture Overview
 
 ### Multi-Repository Structure
@@ -67,6 +123,10 @@ The system implements loosely coupled agents that communicate via ActiveMQ messa
 ### Multi-Repository Development
 - **Always use infrastructure branches**: `infra/baseline-v1`, `infra/baseline-v2`, etc. for all development
 - Create coordinated branches with same name across all affected repositories
+- **CRITICAL: Always push with `-u` flag on first push**: `git push -u origin branch-name`
+  - This sets up branch tracking which is essential for VS Code and git status
+  - Without `-u`, branches appear "unpublished" even after pushing
+  - Example: `git push -u origin infra/baseline-v10`
 - Document specific features and changes through descriptive commit messages
 - Never push directly to main - always use branches and pull requests
 - Run `./run_all_tests.sh` before merging infrastructure changes
@@ -110,4 +170,26 @@ This maintenance should be part of any commit that involves adding, removing, or
 - **Rucio**: Distributed data management system
 - **ActiveMQ**: Message broker for agent communication
 - **PostgreSQL**: Database for monitoring and metadata storage
-- **supervisord**: Process management for Python agents
+- **supervisord**: Process management for Python agents
+
+## AI Development Guidelines
+
+### Directory Awareness (Critical for Claude)
+- **ALWAYS use $SWF_PARENT_DIR for navigation** - Never use relative paths like `../swf-monitor`
+- **ALWAYS run `pwd` before any file operations** - Claude frequently loses track of current directory
+- **NEVER assume your location** - explicitly verify with `pwd` at start of file access attempts
+- **Use absolute paths**: `cd $SWF_PARENT_DIR/swf-testbed` not `cd swf-testbed`
+- **For file operations**: Use `$SWF_PARENT_DIR/swf-monitor/.env` not `../swf-monitor/.env`
+- This is a recurring Claude issue that causes confusion and wasted time
+
+### Git Branch Management
+- **ALWAYS use `git push -u origin branch-name` on first push** - this is non-negotiable
+- After pushing, verify tracking with `git branch -vv` - should show `[origin/branch-name]`
+- If tracking is missing, fix immediately with: `git branch --set-upstream-to=origin/branch-name branch-name`
+- VS Code "Publish branch" button indicates missing tracking - this must be resolved
+
+### Commit and Push Workflow
+1. Create commits with descriptive messages including Claude Code attribution
+2. First push: `git push -u origin branch-name` (sets up tracking)
+3. Subsequent pushes: `git push` (tracking already established)
+4. Always verify tracking is set up correctly before proceeding
@@ -0,0 +1,61 @@
+# Gemini Guidance
+
+This file provides critical operational guidance for the Gemini agent working within the SWF testbed ecosystem.
+
+## **CRITICAL: Command Execution in the Virtual Environment**
+
+**1. Virtual Environment Directory:**
+The virtual environment for this project is named `.venv` (a hidden directory), not `venv`. Always use this correct path.
+
+**2. Execution Method:**
+To ensure commands run reliably, you **MUST** use the full, absolute path to the python executable within the `.venv` directory. This is the most robust method and avoids issues with shell environment persistence.
+
+   - **Python Executable Path:** `/Users/wenaus/github/swf-testbed/.venv/bin/python3`
+
+First, ensure all dependencies are installed by running the `install.sh` script once.
+```bash
+# Run this once to set up or update dependencies
+cd /Users/wenaus/github/swf-testbed && source ./install.sh
+```
+
+### Correct Procedure for Subsequent Commands:
+
+Directly execute commands using the venv's python.
+
+**Example: Running a Django migration in `swf-monitor`**
+```bash
+/Users/wenaus/github/swf-testbed/.venv/bin/python3 /Users/wenaus/github/swf-monitor/src/manage.py migrate
+```
+
+**Example: Running the Django development server**
+```bash
+/Users/wenaus/github/swf-testbed/.venv/bin/python3 /Users/wenaus/github/swf-monitor/src/manage.py runserver 8001 &
+```
+
+---
+
+## **CRITICAL: Checklist for Renaming Components**
+
+Renaming components has far-reaching side effects. A simple rename requires a systematic, multi-step check to ensure the application remains stable. The following checklist is based on recent failures and must be followed for any renaming task.
+
+**Example Scenario:** Renaming a view from `old_name` to `new_name`.
+
+1.  **Rename the View Function:**
+    *   In `views.py`, change `def old_name(request):` to `def new_name(request):`.
+
+2.  **Update URL Configuration (`urls.py`):**
+    *   **Update Import:** Change `from .views import old_name` to `from .views import new_name`.
+    *   **Update `path()`:** Change `path('...', old_name, ...)` to `path('...', new_name, ...)`.
+    *   **Update URL Name:** Change `name='old_name'` to `name='new_name'`. This is critical for template tags.
+    *   **Check URL Parameters:** Ensure any captured URL parameters (e.g., `<str:table_name>`) match the arguments in the new view function's signature.
+
+3.  **Update Templates (`*.html`):**
+    *   **Find and Replace `{% url %}` tags:** Search all templates for `{% url 'monitor_app:old_name' %}` and replace it with `{% url 'monitor_app:new_name' %}`.
+    *   **Rename Template File:** If the view renders a template with a corresponding name (e.g., `old_name.html`), rename the file to `new_name.html`.
+
+4.  **Global Code Search:**
+    *   Perform a project-wide search for the string `"old_name"` to find any other references in Python code, JavaScript, or comments.
+
+5.  **Verification:**
+    *   **Run `manage.py check`:** This is the most important step. It will catch most `ImportError`, `NameError`, and `NoReverseMatch` issues without needing to run the server.
+    *   **Restart and Test:** Only after the check passes, restart the server and manually test the affected pages.
@@ -612,6 +612,11 @@ cd swf-testbed && git checkout -b infra/baseline-v1
 cd ../swf-monitor && git checkout -b infra/baseline-v1
 cd ../swf-common-lib && git checkout -b infra/baseline-v1
 
+# CRITICAL: Push branches to origin immediately to make them available remotely
+cd swf-testbed && git push origin infra/baseline-v1
+cd ../swf-monitor && git push origin infra/baseline-v1
+cd ../swf-common-lib && git push origin infra/baseline-v1
+
 # Work freely across repositories
 # Commit frequently with descriptive messages
 # Let commit messages document the nature and progression of changes
@@ -631,18 +636,22 @@ For features that primarily affect a single repository:
 # Create feature branch in the primary repository
 git checkout -b feature/your-feature-name
 
+# CRITICAL: Push branch to origin immediately to make it available remotely
+git push origin feature/your-feature-name
+
 # Work, commit, and create pull request as normal
 # If cross-repo changes are needed, coordinate with infrastructure approach
 ```
 
 #### Development Guidelines
 
 1. **Never push directly to main** - Always use branches and pull requests
-2. **Coordinate cross-repo changes** - Use matching branch names for related work
-3. **Test system integration** - Run `./run_all_tests.sh` before merging infrastructure changes
-4. **Maintain test coverage** - As you add functionality, extend the tests to ensure `./run_all_tests.sh` reliably evaluates system integrity
-5. **Document through commits** - Use descriptive commit messages to explain the progression of work
-6. **Maintain sibling structure** - Keep all `swf-*` repositories as siblings in the same parent directory
+2. **Push branches to origin immediately** - Always run `git push origin branch-name` right after creating a branch to make it available across all development machines
+3. **Coordinate cross-repo changes** - Use matching branch names for related work
+4. **Test system integration** - Run `./run_all_tests.sh` before merging infrastructure changes
+5. **Maintain test coverage** - As you add functionality, extend the tests to ensure `./run_all_tests.sh` reliably evaluates system integrity
+6. **Document through commits** - Use descriptive commit messages to explain the progression of work
+7. **Maintain sibling structure** - Keep all `swf-*` repositories as siblings in the same parent directory
 
 #### Pull Request Process
 
@@ -655,6 +664,13 @@ git checkout -b feature/your-feature-name
 This workflow ensures that the testbed remains stable and integrated while
 allowing for rapid infrastructure development and feature additions.
 
+### Example Agent Implementations
+
+For developers looking to create new agents or understand how to interact with
+the testbed's messaging and API services, standalone examples are provided in
+the `example_agents/` directory. These provide a clear, modern blueprint for
+agent development.
+
 ## Participants
 
 At present the testbed is a project of the Nuclear and Particle Physics
 
@@ -0,0 +1,47 @@
+# Summary of Current Issue: `example_daqsim_agent.py` Hangs on Connection
+
+## 1. High-Level Goal
+
+The objective is to create a set of standalone, example agents in the `swf-testbed/example_agents/` directory. These agents should serve as a blueprint for real agents, communicating with the `swf-monitor` application via its REST API for logging/heartbeats and with ActiveMQ for messaging.
+
+We began by implementing the `example_daqsim_agent.py` as the first example.
+
+## 2. The Problem
+
+The `example_daqsim_agent.py` script hangs indefinitely when executed. The script successfully starts, but it never proceeds past the ActiveMQ connection logic, and no errors are printed to the console.
+
+## 3. Current State of the Code
+
+### `swf-monitor` Repository (`infra/baseline-v10` branch)
+- A REST API has been established at `/api/v1/`.
+- Endpoints exist for `/logs/` and `/systemagents/`.
+- The `/systemagents/heartbeat/` endpoint was created to allow agents to register/update their status.
+- The API requires token authentication. A user `gemini` with token `39a564f5d3a2952813affa2146b9f4f6587e5273` was created for testing.
+- The API and its authentication have been successfully tested with `curl`.
+
+### `swf-testbed` Repository (`infra/baseline-v10` branch)
+- A new directory `example_agents/` has been created.
+- `base_agent.py`: Contains a reusable `ExampleAgent` class to handle common logic.
+- `example_daqsim_agent.py`: A simple agent that inherits from `ExampleAgent`. Its purpose is to connect to ActiveMQ and (eventually) produce messages.
+- `requirements.txt`: Contains `requests` and `stomp.py`.
+
+## 4. Debugging Steps Taken & Results
+
+The following steps were taken to diagnose the hanging issue:
+
+1.  **Initial Run (Background):** The script was run in the background. **Result:** The agent never appeared in the API's list of system agents. No logs were visible.
+2.  **Foreground Run:** The script was run in the foreground to observe errors. **Result:** The script hangs silently with no output and must be manually interrupted.
+3.  **Verify ActiveMQ Service:** Checked if the ActiveMQ Docker container was running using `docker ps`. **Result:** The container is running correctly.
+4.  **Verify Port Accessibility:** Used `nc -zv localhost 61616` to check if the STOMP port was open. **Result:** The port is open and the connection succeeds, ruling out firewall or port mapping issues.
+5.  **Add STOMP Heartbeats:** Modified `base_agent.py` to include `heartbeats=(10000, 10000)` in the `stomp.Connection` constructor, as a lack of heartbeating is a common cause of hangs. **Result:** The script still hangs.
+
+## 5. Current Hypothesis
+
+- The issue is not with the `swf-monitor` API or basic network connectivity.
+- The problem lies specifically within the `stomp.py` connection logic in `base_agent.py`.
+- The hang occurs at the `self.conn.connect(self.mq_user, self.mq_password, wait=True)` line.
+- Since heartbeats did not solve it, the issue is likely a more subtle STOMP protocol-level problem (e.g., a version mismatch, an issue with vhost, or another parameter disagreement between the client and the Artemis broker) that is causing the handshake to never complete.
+
+## 6. Last Proposed Action
+
+My last action before being stopped was to propose enabling the `stomp.py` library's internal debug logging to print the raw STOMP frames being sent and received. This would provide a low-level view of the handshake and reveal exactly where it is failing.
@@ -4,7 +4,8 @@ services:
     image: apache/activemq-artemis:latest-alpine
     ports:
       - "8161:8161" # Web console
-      - "61616:61616" # Broker port
+      - "61616:61616" # Core protocol port
+      - "61613:61613" # STOMP protocol port
     environment:
       - ARTEMIS_USER=admin
       - ARTEMIS_PASSWORD=admin