markmhendrickson
diff --git a/‎.claude/rules/neotoma_cli.mdc‎ ‎…s/neotoma_cli.mdc.backup.20260302-114019‎.claude/rules/neotoma_cli.mdc renamed to .claude/rules/neotoma_cli.mdc.backup.20260302-114019 b/‎.claude/rules/neotoma_cli.mdc‎ ‎…s/neotoma_cli.mdc.backup.20260302-114019‎.claude/rules/neotoma_cli.mdc renamed to .claude/rules/neotoma_cli.mdc.backup.20260302-114019
diff --git a/‎.codex/neotoma_cli.md‎ ‎…ex/neotoma_cli.md.backup.20260302-114019‎.codex/neotoma_cli.md renamed to .codex/neotoma_cli.md.backup.20260302-114019 b/‎.codex/neotoma_cli.md‎ ‎…ex/neotoma_cli.md.backup.20260302-114019‎.codex/neotoma_cli.md renamed to .codex/neotoma_cli.md.backup.20260302-114019
diff --git a/‎.cursor/rules/neotoma_cli.mdc‎ ‎…s/neotoma_cli.mdc.backup.20260302-114019‎.cursor/rules/neotoma_cli.mdc renamed to .cursor/rules/neotoma_cli.mdc.backup.20260302-114019 b/‎.cursor/rules/neotoma_cli.mdc‎ ‎…s/neotoma_cli.mdc.backup.20260302-114019‎.cursor/rules/neotoma_cli.mdc renamed to .cursor/rules/neotoma_cli.mdc.backup.20260302-114019
diff --git a/‎.env.example‎
Lines changed: 3 additions & 3 deletions b/‎.env.example‎
Lines changed: 3 additions & 3 deletions
diff --git a/‎Dockerfile‎
Lines changed: 2 additions & 2 deletions b/‎Dockerfile‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎README.md‎
Lines changed: 116 additions & 10 deletions b/‎README.md‎
Lines changed: 116 additions & 10 deletions
diff --git a/‎docs/api/rest_api.md‎
Lines changed: 2 additions & 2 deletions b/‎docs/api/rest_api.md‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎docs/developer/cli_agent_instructions.md‎
Lines changed: 1 addition & 1 deletion b/‎docs/developer/cli_agent_instructions.md‎
Lines changed: 1 addition & 1 deletion
@@ -13,7 +13,7 @@
 PORT=3000
 
 # HTTP Actions server port (or set NEOTOMA_HTTP_PORT at runtime)
-HTTP_PORT=8080
+HTTP_PORT=3080
 
 # Frontend URL for CORS
 # NEOTOMA_FRONTEND_URL=http://localhost:5195
@@ -33,7 +33,7 @@ MCP_TOKEN_ENCRYPTION_KEY=your-encryption-key-here
 # Host URL settings (optional)
 # NEOTOMA_HOST_URL=https://your-tunnel.example.com
 # HOST_URL=https://your-tunnel.example.com
-# API_BASE_URL=http://localhost:8080
+# API_BASE_URL=http://localhost:3080
 
 # Optional shared bearer token
 # NEOTOMA_BEARER_TOKEN=your-shared-secret
@@ -55,7 +55,7 @@ OPENAI_API_KEY=sk-your-api-key-here
 # =============================================================================
 # Frontend Configuration (Vite)
 # =============================================================================
-VITE_API_BASE_URL=http://localhost:8080
+VITE_API_BASE_URL=http://localhost:3080
 # VITE_WS_PORT=8280
 # VITE_LOCAL_MCP_URL=ws://127.0.0.1:8280/mcp
 # VITE_MCP_URL=ws://your-mcp-server.com/mcp
 
@@ -10,8 +10,8 @@ RUN npm run build:server
 FROM node:20-alpine AS runtime
 WORKDIR /app
 ENV NODE_ENV=production
-ENV HTTP_PORT=8080
-EXPOSE 8080
+ENV HTTP_PORT=3080
+EXPOSE 3080
 
 COPY package*.json ./
 RUN npm ci --omit=dev
 
@@ -2,21 +2,27 @@
 
 ![Neotoma banner](https://raw.githubusercontent.com/markmhendrickson/neotoma/main/docs/assets/neotoma_banner.png)
 
-[Neotoma](https://neotoma.io) is a **truth layer**: an explicit, inspectable, replayable substrate for personal data that AI agents read and write. When agents act, personal data becomes state. Neotoma treats that state the way production systems do: contract-first, deterministic, immutable, and queryable.
+_Give your agents memory you can inspect, replay, and trust._
 
-**Why it exists:** The thing that keeps breaking in agentic systems is not intelligence but trust. Memory changes implicitly, context drifts, and you cannot see what changed or replay it. Neotoma provides the missing primitive: user-controlled, deterministic, inspectable memory with full provenance, so you can trust agents with real, ongoing state.
+For a guided overview, see [neotoma.io](https://neotoma.io).
+
+Agent memory is forgetful. What keeps breaking automation is trust, not intelligence: memory changes implicitly, context drifts, and you cannot see what changed or replay it. When agents act, personal data becomes state. The missing primitive is a layer of explicit, inspectable, replayable state.
+
+[Neotoma](https://neotoma.io) is that layer. Open-source, privacy-protective, and user-controlled. It is contract-first and deterministic (same input, same output), with immutable, queryable state in one graph for documents you upload and data agents write.
+
+It works with Cursor, Claude, Codex, and other MCP-capable tools, with CLI fallback when MCP is unavailable. Install with npm, then configure MCP for your editor or use the CLI directly.
 
 For the full rationale, see [Building a truth layer for persistent agent memory](https://markmhendrickson.com/posts/truth-layer-agent-memory).
 
 ---
 
 ## What Neotoma Is
 
-Neotoma is a **Truth Layer**, not an app, agent, or workflow engine. It is the lowest-level canonical source of truth for personal data (documents and agent-created data), exposed to AI tools via Model Context Protocol (MCP).
+Neotoma is a _truth layer_, not an app, agent, or workflow engine. It is the lowest-level canonical source of truth for personal data (documents and agent-created data), exposed to AI tools via Model Context Protocol (MCP).
 
-**In practice:** You upload documents (PDFs, images, receipts, contracts) or share information during agent conversations. You don't have to structure it yourself: agents structure and store it via Neotoma when you provide unstructured or semi-structured content. Neotoma resolves entities across all sources, builds timelines from date fields, and keeps every fact traceable to its source. ChatGPT, Claude, and Cursor can read this memory, write new structured data, correct mistakes, and trigger reinterpretation. One graph connects people, companies, events, and relationships across all your data.
+You upload documents (PDFs, images, receipts, contracts) or share information during agent conversations. You don't have to structure it yourself: agents structure and store it via Neotoma when you provide unstructured or semi-structured content. Neotoma resolves entities across all sources, builds timelines from date fields, and keeps every fact traceable to its source. ChatGPT, Claude, and Cursor can read this memory, write new structured data, correct mistakes, and trigger reinterpretation. One graph connects people, companies, events, and relationships across all your data.
 
-**What it is not:** Not a note-taking app or "second brain." Not provider-controlled ChatGPT Memory or Claude Projects (those are conversation-only and platform-locked; Neotoma is structured personal data memory with entity resolution and timelines, cross-platform via MCP). Not a vector store or RAG layer. Not an autonomous agent. It is the memory layer agents read and write; you control what goes in and what stays.
+It's not a note-taking app or "second brain." Not provider-controlled ChatGPT Memory or Claude Projects (those are conversation-only and platform-locked; Neotoma is structured personal data memory with entity resolution and timelines, cross-platform via MCP). Not a vector store or RAG layer. Not an autonomous agent. It is the memory layer agents read and write; you control what goes in and what stays.
 
 ---
 
@@ -63,12 +69,29 @@ graph LR
 
 ---
 
+## Core Terminology
+
+| Term            | Definition                                                                                    |
+| --------------- | --------------------------------------------------------------------------------------------- |
+| Truth Layer     | Deterministic, immutable structured memory substrate that tools and agents read and write.    |
+| Source          | Raw data (file, text, URL, or structured JSON) stored with content-addressed deduplication.   |
+| Observation     | Granular fact extracted from a source; reducers merge observations into entity snapshots.     |
+| Entity          | Canonical representation of a person, company, task, or other object with a deterministic ID. |
+| Entity snapshot | Current truth for an entity computed from all related observations.                           |
+| Provenance      | Origin tracking (source, timestamp, operation) so each value is traceable.                    |
+| Memory graph    | Graph of sources, observations, entities, events, and typed relationships.                    |
+
+For the full glossary, see [Core terminology](https://neotoma.io/#terminology).
+
+---
+
 ## Who It's For
 
-- **AI-native operators** who rely on ChatGPT, Claude, or Cursor and need persistent memory across sessions.
-- **Knowledge workers** (researchers, analysts, consultants, legal) who need cross-data reasoning and entity unification across contracts, invoices, and agent-created data.
-- **Small teams (2–20)** who want a shared truth layer with row-level security.
-- **Builders of agentic systems** who need a deterministic memory and provenance layer for agents and toolchains (e.g. agent frameworks, orchestration pipelines, observability stacks).
+| Who                                   | What they need                                                           | Example data to remember                                                |
+| ------------------------------------- | ------------------------------------------------------------------------ | ----------------------------------------------------------------------- |
+| AI-native individual operators        | Memory that follows across daily tools and sessions                      | Tasks, preferences, notes, recurring reminders, contacts, deadlines     |
+| Knowledge workers with scattered data | Durable context across documents and sessions, with evidence and lineage | Source documents, extracted entities, citations, key quotes, timelines  |
+| Builders of agentic systems           | Structured memory agents can read and write with provenance              | Session histories, accumulated facts, decisions, runbooks, tool configs |
 
 **Why Neotoma:** One memory graph across documents and agent-created data; agents remember context without re-explanation; full provenance and audit trail; works with any MCP-compatible tool; privacy-first and user-controlled. The same substrate serves both human-in-the-loop use and agent frameworks or toolchains that need deterministic memory and provenance.
 
@@ -161,7 +184,7 @@ Full release index: [docs/releases/](docs/releases/).
 
 The **primary entrypoint** for all documentation is the index and navigation guide. All contributors and AI assistants working on the repo should load it first.
 
-- **[Documentation index and navigation](docs/context/index_rules.mdc)** – Map of the docs system, reading order by change type, dependency graph, and quick-reference answers. Start here when contributing or navigating the repo.
+- **[Documentation index and navigation](#documentation-index)** – Map of the docs system, reading order by change type, dependency graph, and quick-reference answers. Start here when contributing or navigating the repo.
 
 **Foundational (load first):**
 
@@ -213,6 +236,16 @@ neotoma init
 neotoma api start
 ```
 
+CLI transport defaults are API-first. Use `--offline` only when you explicitly want in-process local transport:
+
+```bash
+# Default (API transport; no implicit local fallback)
+neotoma entities list --type task
+
+# Explicit local inline mode (no API server required)
+neotoma --offline entities list --type task
+```
+
 After installation, configure MCP for your AI tool:
 
 ```bash
@@ -232,6 +265,56 @@ npm test
 
 **Prerequisites:** Node.js v18.x or v20.x (LTS), npm v9+. Developer preview uses **local storage only**. For local storage, **no `.env` is required**; the app uses defaults (`./data`, `./data/neotoma.db`, `./data/sources`). Optional overrides: [Getting started](docs/developer/getting_started.md).
 
+### Option 3: Run with Docker
+
+```bash
+git clone https://github.com/markmhendrickson/neotoma.git
+cd neotoma
+docker build -t neotoma .
+
+docker run -d \
+  --name neotoma \
+  -p 3080:3080 \
+  -v neotoma-data:/app/data \
+  neotoma
+
+docker exec neotoma neotoma init --yes --data-dir /app/data
+```
+
+Connect MCP from Docker:
+
+```json
+{
+  "mcpServers": {
+    "neotoma": {
+      "command": "docker",
+      "args": ["exec", "-i", "neotoma", "node", "dist/index.js"]
+    }
+  }
+}
+```
+
+Use the CLI from Docker:
+
+```bash
+docker exec neotoma neotoma store \
+  --json='[{"entity_type":"task","title":"Submit expense report","status":"open"}]'
+
+docker exec neotoma neotoma entities list --type task
+```
+
+---
+
+## Get started
+
+After installation:
+
+1. Run `neotoma init` and configure your MCP client.
+2. In a conversation, tell your assistant: "Remind me to review my subscription Friday."
+3. In the same conversation, ask it to list your open tasks.
+
+This gives you a quick end-to-end validation that memory is persisting and retrievable across sessions and tools. For full setup steps, see [Getting started](docs/developer/getting_started.md).
+
 ---
 
 ## Development
@@ -287,6 +370,20 @@ To use the Neotoma MCP server from another workspace, see [Cursor MCP setup](doc
 
 ---
 
+## Agent Instructions (Behavior Summary)
+
+Neotoma-compatible agents follow a consistent behavior contract across MCP and CLI:
+
+- **Store first:** Persist the conversation turn before responding.
+- **Bounded retrieval:** Retrieve likely related entities before storing new ones.
+- **Entity extraction:** Extract and store relevant people, tasks, events, places, and relationships from user input.
+- **Task creation:** Create tasks when users express intent, obligations, deadlines, or reminders.
+- **External data safety:** Store relevant entities from external tool results before responding.
+
+Full instructions: [MCP instructions](docs/developer/mcp/instructions.md) and [CLI agent instructions](docs/developer/cli_agent_instructions.md).
+
+---
+
 ## Core Principles
 
 1. **Deterministic** – Same input → same output. Hash-based IDs, no randomness in core components.
@@ -310,6 +407,15 @@ If remote backend support is needed later, recover it from git history.
 
 ---
 
+## Related posts
+
+- [Neotoma developer release](https://markmhendrickson.com/posts/neotoma-developer-release)
+- [Building a truth layer for persistent agent memory](https://markmhendrickson.com/posts/truth-layer-agent-memory)
+- [Agent memory has a truth problem](https://markmhendrickson.com/posts/agent-memory-truth-problem)
+- [Why agent memory needs more than RAG](https://markmhendrickson.com/posts/why-agent-memory-needs-more-than-rag)
+
+---
+
 ## Contributing
 
 Neotoma is in active development. For questions or collaboration, open an issue or discussion. The work is in the open: [github.com/markmhendrickson/neotoma](https://github.com/markmhendrickson/neotoma). See [CONTRIBUTING.md](CONTRIBUTING.md) and [SECURITY.md](SECURITY.md).
 
@@ -21,7 +21,7 @@ This document covers:
 **Development:**
 
 ```
-http://localhost:8080
+http://localhost:3080
 ```
 
 **Production:**
@@ -265,7 +265,7 @@ Upload a file and optionally create/update a record.
   **Request Example:**
 
 ```bash
-curl -X POST http://localhost:8080/upload_file \
+curl -X POST http://localhost:3080/upload_file \
   -H "Authorization: Bearer $TOKEN" \
   -F "file=@document.pdf" \
   -F "properties={\"title\":\"My Document\"}"
 
@@ -24,7 +24,7 @@ When a Neotoma CLI session starts (dev or prod), the applied rule files (e.g. `.
 
 ## CLI startup protocol (use-existing)
 
-CLI uses connect-only startup for interactive sessions. It does not auto-start servers on session start. On no-args startup, it discovers running local API instances from session ports, defaults (`8080`, `8180`), remembered ports, and optional configured ports. If multiple instances are healthy, it applies `--env` as a preference and then prompts for explicit selection.
+CLI uses connect-only startup for interactive sessions. It does not auto-start servers on session start. On no-args startup, it discovers running local API instances from session ports, defaults (`3080`, `3180`), remembered ports, and optional configured ports. If multiple instances are healthy, it applies `--env` as a preference and then prompts for explicit selection.
 
 ```bash
 neotoma store --json='[...]'