fixes

sarahxsanders · sarahxsanders · commit c4c22172a1c3 · 2026-03-03T13:53:17.000-05:00
diff --git a/docs/monorepo-skill.md b/docs/monorepo-skill.md
@@ -0,0 +1,166 @@
+# Wizard monorepo: the final boss of skills
+
+The wizard now detects and instruments monorepo projects. It's probably the hardest skill I've worked on to date, because no two monorepos are alike. Here's what I built, what broke, and what still needs eyes.
+
+## What makes monorepos hard
+
+Three things:
+
+1. Detection is multi-dimensional
+2. Not everything detected is an app
+3. Concurrency is hard
+
+Let's take them one at a time.
+
+## Detection
+
+Single-project detection is simple: scan files, match framework, go. Monorepos add two layers.
+
+**Layer 1: What workspace manager is this thing using?** pnpm workspaces? npm? Yarn? Nx? Lerna? Each one declares member packages differently. pnpm uses `pnpm-workspace.yaml`. npm/yarn use a `workspaces` field in `package.json`. Nx uses `project.json` files scattered across subdirectories. And some monorepos don't use any formal config at all — just a bunch of directories with stuff in them.
+
+The detection order in `workspace-detection.ts` is: pnpm → npm/yarn → Lerna → Nx → heuristic fallback. Turborepo isn't its own detector — it's a label upgrade. When `turbo.json` exists alongside a pnpm or npm/yarn workspace, we relabel the type as `'turbo'` so downstream code (like context-mill commandments) can act on it.
+
+```ts
+const formal =
+  (await detectPnpmWorkspace(rootDir)) ??
+  (await detectNpmOrYarnWorkspace(rootDir)) ??
+  (await detectLernaWorkspace(rootDir)) ??
+  (await detectNxWorkspace(rootDir));
+
+if (formal) {
+  return supplementWithPolyglotProjects(formal);
+}
+return detectHeuristic(rootDir);
+```
+
+**Layer 2: Monorepos are polyglot.** JS workspace configs only know about JS packages. Real monorepos have Python services, PHP backends, mobile apps sitting right alongside. These live outside the workspace config entirely.
+
+So I built a two-pass system. First, detect any formal workspace config. Then scan for non-JS project indicators the workspace config missed — `pyproject.toml`, `manage.py`, `composer.json`, `build.gradle`, `Package.swift`, `.xcodeproj`. Merge them into the member list. The supplement only scans depth 1-2 and deduplicates against existing members.
+
+## Filtering non-apps
+
+In a single project, if we detect Python, it's almost certainly an app the user wants instrumented. In monorepos, half the detected projects are libraries, build tools, or Playwright test suites.
+
+`app-detection.ts` handles this. Framework-specific integrations (Django, FastAPI, Next.js, etc.) always pass — their detection signals are inherently app signals. But language-level fallbacks (generic Python, generic JS) need to prove they're actually apps:
+
+- **Python**: needs entry point files (`main.py`, `app.py`, `wsgi.py`, etc.) OR script definitions in `pyproject.toml`
+- **JS**: needs HTML entry points, server entry points, OR `start`/`dev`/`serve` scripts with 8+ dependencies (tiny packages with a start script are usually build watchers)
+
+Before any of that, there's a dev-tool blocklist. If your dependencies include `@storybook/`, `@playwright/`, or `cypress`, you're not an app. Sorry.
+
+## Concurrency
+
+Single projects: one agent, linear flow, direct terminal output. Monorepos need N agents running simultaneously.
+
+Two problems surfaced immediately.
+
+**Terminal output becomes garbage.** Four agents writing to stdout at once means interleaved log messages and fighting spinners. The fix: a Proxy-based output silencer in `clack.ts`. The entire clack module is wrapped in a `Proxy` that checks an `AsyncLocalStorage` context. In silent mode: output functions become no-ops, interactive prompts throw errors (safety net — they should never run in concurrent context).
+
+```ts
+export function withSilentOutput<T>(fn: () => Promise<T>): Promise<T> {
+  return silentStore.run({ silent: true }, fn);
+}
+```
+
+But users still need to see progress. The progress spinner is captured *before* entering silent mode. Its `.message()` calls still write to real stdout because the spinner object references real clack, not the proxy. Live updates: `2/4 done (latest: apps/web)`.
+
+**Log files are useless.** All agents writing to the same `/tmp/posthog-wizard.log` means you can't debug individual failures. Fix: another `AsyncLocalStorage` in `debug.ts` for per-project log scoping. Each agent runs inside `withLogFile('/tmp/posthog-wizard-{slug}.log', ...)`.
+
+## Architecture
+
+### Three-phase execution
+
+**Phase 1 — Preparation (sequential).** For each project: version check, package detection, `gatherContext()`. Sequential because some frameworks prompt the user interactively.
+
+**Phase 2 — Instrumentation (concurrent).** All agents launched via `Promise.allSettled`. Each wrapped in `withLogFile()` + `withSilentOutput()`. Prompt fencing tells each agent to stay within its project directory. `Promise.allSettled` gives us two things: deterministic result ordering AND error isolation — one project failing doesn't kill the others.
+
+**Phase 3 — Post-flight (sequential).** MCP client installation (once). Env var upload to hosting (once, merged across all successful projects). Combined summary.
+
+### Shared setup
+
+OAuth, region selection, git checks, AI consent — all run once via `runSharedSetup()`. Each agent receives the result via `mode.sharedSetup`.
+
+### Prompt fencing
+
+In concurrent mode, each agent gets a scope-fencing instruction:
+
+```ts
+if (mode?.concurrentFence) {
+  integrationPrompt += `IMPORTANT: This project is being set up as part of a monorepo...
+  You MUST only modify files within ${options.installDir}...`;
+}
+```
+
+The actual prompt is longer — it also includes guidance about `set_env_values`/`check_env_keys` relative paths and not touching sibling packages.
+
+### The agent runner decomposition
+
+The original agent runner was one big function. To support both single-project and monorepo runs without duplicating code, I broke it into composable phases:
+
+- **`runSharedSetup(options, docsUrl)`** — AI consent, Anthropic status check, region, git check, OAuth. Returns `Promise<SharedSetupData>`.
+- **`runPreflight(config, options)`** — Version check, TS detection, package.json detection, framework version, `gatherContext()`, analytics tags. Returns `Promise<PreflightData | null>`.
+- **`runAgentWizard(config, options, mode?)`** — The main agent runner. `mode` is typed as `AgentWizardMode & { preflight?: PreflightData }`. When called without `mode` (single-project), it calls `runSharedSetup` and `runPreflight` inline — same flow as main. When called with `mode` (monorepo), it skips the phases already run externally.
+- **`handleAgentErrors()`** — Extracted error handling. Changed from `process.exit(1)` to `throw DisplayedError`, so the monorepo orchestrator can continue with other projects when one fails. `run.ts` checks `instanceof DisplayedError` to avoid double-printing.
+
+**When `runAgentWizard(config, options)` is called with no third argument, behavior is functionally identical to main. The mode parameter is additive.**
+
+### The monorepo flow
+
+`monorepo-flow.ts` is purely new orchestration code. It does NOT contain any code extracted from the original wizard. It imports and composes the exported functions:
+
+```ts
+const sharedSetup = await runSharedSetup(options);
+const preflight = await runPreflight(config, projectOptions);
+await runAgentWizard(config, projectOptions, {
+  sharedSetup,
+  preflight,
+  skipPostAgent: true,
+  concurrentFence: true,
+  // also: onStatus callback, additionalContext based on workspaceType
+});
+```
+
+One area of intentional duplication: the post-flight env upload. The monorepo version merges env vars across all successful sub-projects into a single batch. Fundamentally different from the per-project version, so it can't just delegate.
+
+## Changes from main
+
+**Error handling.** Agent errors now `throw DisplayedError` instead of `process.exit(1)`. The `run.ts` catch block checks `instanceof DisplayedError` to skip double-printing.
+
+**Execution order in single-project mode.** OAuth moved earlier (before framework detection). This is because `runSharedSetup()` groups all "shared" steps together. No step depends on the old ordering. Harmless.
+
+**Abort message.** `runSharedSetup()` says "set up PostHog manually" instead of the framework-specific name. Shared setup runs before any framework config is known. Minor UX regression in single-project mode.
+
+**Detection logic changes:**
+- `javascript_web` now requires a browser signal (HTML entry point or `"browser"` field in `package.json`). Intentional — `posthog-js` crashes without `window`/`document`.
+- `javascript_node` now excludes projects with known framework packages (Next.js, Nuxt, etc.). Prevents the generic catch-all from claiming framework-specific projects.
+- `Integration` enum reordered: `javascript_web` before `javascriptNode` (more-specific before catch-all).
+
+**`resolveEnvPath` prefix stripping.** New logic strips redundant path prefixes when the agent passes a relative `filePath` that duplicates the tail of `workingDirectory`. E.g., if `workingDirectory="/ws/services/mcp"` and the agent passes `filePath="services/mcp/.env"`, it strips to `.env`. The traversal guard still runs after.
+
+## What works well
+
+1. **Workspace detection is solid.** Tests cover all workspace types, polyglot supplement, deduplication, and exclusion patterns. The two-pass approach handles real-world "pnpm for JS + Python services alongside."
+
+2. **App filtering catches real problems.** In PostHog's own monorepo, without filtering we'd try to instrument Storybook, Playwright tests, and internal build tools.
+
+3. **Concurrent execution with proper isolation.** `Promise.allSettled` gives deterministic results AND error isolation. Per-project logs let you debug individual failures. The silent output proxy prevents terminal garbage.
+
+4. **The `FrameworkConfig` abstraction.** Monorepo support "just works" for all 20+ frameworks with zero per-framework monorepo code. Each framework's existing `detect()`, `gatherContext()`, and agent prompt carry over unchanged.
+
+5. **Original wizard code is untouched.** No single-project logic was moved into monorepo-only files. The decomposition is purely additive.
+
+## What needs eyes
+
+1. **Heuristic detection.** When there's no formal workspace config, we look for 2+ unique directories with project indicators at depth 1-2. One directory with both `package.json` and `pyproject.toml` counts as one, not two. Could false-positive on repos with `frontend/` and `backend/` that aren't really a monorepo. The 2-dir threshold is conservative but may need tuning.
+
+2. **App filtering thresholds.** `MIN_JS_APP_DEPENDENCY_COUNT = 8` and the dev-tool blocklist are heuristic. We should monitor `monorepo_projects_detected` vs `monorepo_projects_selected` analytics to see if we're filtering too aggressively or not enough.
+
+3. **Benchmark mode fallback.** Concurrent execution is disabled in benchmark mode (the benchmark middleware mutates global log paths). Sequential fallback works, but it currently skips `runPostFlight()` — no env upload or MCP install happens. This might be intentional for benchmarking but worth confirming.
+
+4. **Detection timeout.** Each framework's `detect()` gets 5 seconds (`DETECTION_TIMEOUT_MS`). Per-member-per-framework. A monorepo with 10 members x 20 frameworks = up to 200 detection calls. Most resolve in <100ms, but worth watching.
+
+5. **Polyglot supplement depth.** Currently scans depth 1-2 only. Deeply nested non-JS projects (depth 3+) won't be discovered. Intentional to avoid scanning the whole tree, but worth documenting.
+
+6. **`resolveEnvPath` prefix stripping.** This is heuristic path manipulation — it matches the longest suffix of `workingDirectory` against the prefix of the relative path. Edge cases could surprise us.
+
+7. **Agent adherence verification.** The `1.2-revise.md` verification checklist from our audit plan hasn't been implemented yet in context-mill. Currently it only has basic "check for errors, run linters." The full 5-point checklist (version pinning, server component conversion, redundant pageviews, env vars) still needs to be added.
diff --git a/src/javascript-web/javascript-web-wizard-agent.ts b/src/javascript-web/javascript-web-wizard-agent.ts
@@ -67,7 +67,13 @@ export const JAVASCRIPT_WEB_AGENT_CONFIG: FrameworkConfig<JavaScriptContext> = {
 
       const hasBrowserField = 'browser' in packageJson;
 
-      return hasHtmlEntry || hasBrowserField;
+      // Known browser frameworks without dedicated integrations
+      const BROWSER_FRAMEWORK_PACKAGES = ['gatsby'];
+      const hasBrowserFramework = BROWSER_FRAMEWORK_PACKAGES.some((pkg) =>
+        hasPackageInstalled(pkg, packageJson),
+      );
+
+      return hasHtmlEntry || hasBrowserField || hasBrowserFramework;
     },
   },
 
diff --git a/src/javascript-web/utils.ts b/src/javascript-web/utils.ts
@@ -21,6 +21,8 @@ export const FRAMEWORK_PACKAGES = [
   'nuxt',
   'vue',
   'react-router',
+  '@remix-run/react',
+  '@remix-run/node',
   '@tanstack/react-start',
   '@tanstack/react-router',
   'react-native',
diff --git a/src/lib/framework-config.ts b/src/lib/framework-config.ts
@@ -74,7 +74,9 @@ export interface FrameworkDetection {
   getInstalledVersion?: (options: WizardOptions) => Promise<string | undefined>;
 
   /** Detect whether this framework is present in the project. */
-  detect: (options: Pick<WizardOptions, 'installDir'>) => Promise<boolean>;
+  detect: (
+    options: Pick<WizardOptions, 'installDir' | 'workspaceRootDir'>,
+  ) => Promise<boolean>;
 
   /** Detect the project's package manager(s). Used by the in-process MCP tool. */
   detectPackageManager: PackageManagerDetector;
diff --git a/src/monorepo/monorepo-flow.ts b/src/monorepo/monorepo-flow.ts
@@ -71,7 +71,10 @@ export async function detectWorkspaceProjects(
 
   const results = await Promise.all(
     workspace.memberDirs.map(async (memberDir) => {
-      const integration = await detectIntegration({ installDir: memberDir });
+      const integration = await detectIntegration({
+        installDir: memberDir,
+        workspaceRootDir: workspace.rootDir,
+      });
       if (!integration) return null;
 
       // Filter out library packages for generic language-level integrations
@@ -161,6 +164,7 @@ export async function runMonorepoFlow(
     const projectOptions: WizardOptions = {
       ...options,
       installDir: project.dir,
+      workspaceRootDir: options.installDir,
       cloudRegion: sharedSetup.cloudRegion,
     };
 
@@ -344,6 +348,7 @@ async function runMonorepoSequential(
     const projectOptions: WizardOptions = {
       ...options,
       installDir: project.dir,
+      workspaceRootDir: options.installDir,
       cloudRegion: sharedSetup.cloudRegion,
     };
 
diff --git a/src/react-router/react-router-wizard-agent.ts b/src/react-router/react-router-wizard-agent.ts
@@ -45,9 +45,12 @@ export const REACT_ROUTER_AGENT_CONFIG: FrameworkConfig<ReactRouterContext> = {
     },
     detect: async (options) => {
       const packageJson = await tryGetPackageJson(options);
-      return packageJson
-        ? hasPackageInstalled('react-router', packageJson)
-        : false;
+      if (!packageJson) return false;
+      return (
+        hasPackageInstalled('react-router', packageJson) ||
+        hasPackageInstalled('@remix-run/react', packageJson) ||
+        hasPackageInstalled('@remix-run/node', packageJson)
+      );
     },
     detectPackageManager: detectNodePackageManagers,
   },
diff --git a/src/run.ts b/src/run.ts
@@ -145,7 +145,7 @@ export async function runWizard(argv: Args) {
 const DETECTION_TIMEOUT_MS = 5000;
 
 export async function detectIntegration(
-  options: Pick<WizardOptions, 'installDir'>,
+  options: Pick<WizardOptions, 'installDir' | 'workspaceRootDir'>,
 ): Promise<Integration | undefined> {
   for (const integration of Object.values(Integration)) {
     const config = FRAMEWORK_REGISTRY[integration];
diff --git a/src/utils/clack-utils.ts b/src/utils/clack-utils.ts
@@ -477,13 +477,45 @@ export async function getPackageDotJson({
  */
 export async function tryGetPackageJson({
   installDir,
-}: Pick<WizardOptions, 'installDir'>): Promise<PackageDotJson | null> {
+  workspaceRootDir,
+}: Pick<
+  WizardOptions,
+  'installDir' | 'workspaceRootDir'
+>): Promise<PackageDotJson | null> {
   try {
     const packageJsonFileContents = await fs.promises.readFile(
       join(installDir, 'package.json'),
       'utf8',
     );
-    return JSON.parse(packageJsonFileContents) as PackageDotJson;
+    const localPkg = JSON.parse(packageJsonFileContents) as PackageDotJson;
+
+    // In Nx monorepos, all deps are hoisted to the root package.json.
+    // Per-project package.json files are stubs with zero deps. When a
+    // workspace root is available and the local file has no deps, merge
+    // the root's dependencies so framework detectors can match.
+    if (workspaceRootDir && workspaceRootDir !== installDir) {
+      const localDepCount =
+        Object.keys(localPkg.dependencies ?? {}).length +
+        Object.keys(localPkg.devDependencies ?? {}).length;
+      if (localDepCount === 0) {
+        try {
+          const rootRaw = await fs.promises.readFile(
+            join(workspaceRootDir, 'package.json'),
+            'utf8',
+          );
+          const rootPkg = JSON.parse(rootRaw) as PackageDotJson;
+          return {
+            ...localPkg,
+            dependencies: { ...rootPkg.dependencies },
+            devDependencies: { ...rootPkg.devDependencies },
+          };
+        } catch {
+          // Root package.json missing or invalid — fall through
+        }
+      }
+    }
+
+    return localPkg;
   } catch {
     return null;
   }
diff --git a/src/utils/types.ts b/src/utils/types.ts
@@ -67,6 +67,12 @@ export type WizardOptions = {
    */
   menu: boolean;
 
+  /**
+   * Root directory of the monorepo workspace. When set, detection merges
+   * root package.json deps for projects with hoisted dependencies (e.g. Nx).
+   */
+  workspaceRootDir?: string;
+
   /**
    * Whether to run in benchmark mode with per-phase token tracking.
    * When enabled, the wizard runs each workflow phase as a separate agent call