NatLabRockies
diff --git a/‎docs/src/specialized/hpc/allocation-strategies.md‎
Lines changed: 155 additions & 0 deletions b/‎docs/src/specialized/hpc/allocation-strategies.md‎
Lines changed: 155 additions & 0 deletions
diff --git a/‎docs/src/specialized/tools/ai-assistant.md‎
Lines changed: 44 additions & 0 deletions b/‎docs/src/specialized/tools/ai-assistant.md‎
Lines changed: 44 additions & 0 deletions
diff --git a/‎docs/src/specialized/tools/ai-assistants.md‎
Lines changed: 1 addition & 0 deletions b/‎docs/src/specialized/tools/ai-assistants.md‎
Lines changed: 1 addition & 0 deletions
@@ -0,0 +1,155 @@
+# Slurm Allocation Strategies
+
+When submitting a workflow with many jobs to Slurm, you must decide how to split work across
+allocations. The `torc slurm plan-allocations` command (or the `plan_allocations` MCP tool for AI
+assistants) analyzes your workflow and cluster state to recommend a strategy.
+
+## The Core Tradeoff: Single Large vs Many Small
+
+Given N nodes worth of work, there are two extremes:
+
+| Strategy                 | Description                           | Pros                                                                                                | Cons                                                                                    |
+| ------------------------ | ------------------------------------- | --------------------------------------------------------------------------------------------------- | --------------------------------------------------------------------------------------- |
+| **1 x N** (single large) | One allocation requesting all N nodes | Slurm prioritizes larger jobs; all work completes in one walltime window; no fair-share degradation | Must wait for N nodes to be available simultaneously                                    |
+| **N x 1** (many small)   | N separate single-node allocations    | First jobs start as soon as any node is free                                                        | Fair-share degrades as allocations start; last jobs may wait much longer than the first |
+
+### When Single Large Wins
+
+- **Slurm backfill priority**: Slurm's scheduler reserves nodes for large pending jobs. A 167-node
+  request gets a reserved slot in the queue, while 167 individual jobs compete with everyone.
+- **Fair-share preservation**: A single allocation consumes your fair-share budget once. Many small
+  allocations drain it progressively, causing later jobs to lose priority.
+- **Deterministic completion**: All jobs start processing simultaneously and finish within one
+  walltime window.
+- **Busy clusters**: Counter-intuitively, a fully loaded cluster often favors large allocations
+  because Slurm will schedule the large job as a block when enough nodes free up, rather than
+  letting small jobs trickle through.
+
+### When Many Small Wins
+
+- **Extremely long queues**: If the cluster is oversubscribed for weeks, small jobs may fit into
+  backfill gaps that a large allocation cannot.
+- **Partial results needed**: If you need some results quickly rather than waiting for all of them.
+- **Near partition limits**: If your ideal node count exceeds `max_nodes_per_user`, you cannot
+  request a single allocation that large.
+
+## Using `sbatch --test-only`
+
+The `plan-allocations` command runs `sbatch --test-only` to ask Slurm's scheduler when each strategy
+would start, without actually submitting jobs. For a plan with K nodes per allocation and N total
+allocations:
+
+```bash
+# Single large: when would all K*N nodes start together?
+sbatch --test-only --nodes=<K*N> --time=04:00:00 --account=myproject --wrap="hostname"
+
+# Many small: when would one K-node allocation start?
+sbatch --test-only --nodes=<K> --time=04:00:00 --account=myproject --wrap="hostname"
+```
+
+When no partition is explicitly configured, the `--partition` flag is omitted so Slurm uses its
+default partition.
+
+The single-large estimated start + walltime gives the completion time directly. The many-small
+estimate is **optimistic** — it only predicts when the _first_ allocation would start. Later
+allocations will be delayed by fair-share degradation.
+
+### Fair-Share Degradation Estimate
+
+The tool estimates the last small allocation's completion as:
+
+```
+last_completion ≈ first_wait × min(N, 10) + walltime
+```
+
+This is a rough approximation. The actual degradation depends on your account's fair-share balance,
+other users' activity, and the scheduler's configuration.
+
+## Interpreting Results
+
+Example output:
+
+```
+Recommendations
+===============
+  "work_resources": 1 allocation(s) x 167 node(s) [single]
+    sbatch --test-only: large (167 nodes) completes in ~4h 30min,
+    faster than 167 small allocations (~6h 30min).
+    Slurm prioritizes larger allocations
+
+    Scheduler Estimate (sbatch --test-only):
+      Single large (167 nodes): start in ~30min, complete in ~4h 30min
+      Many small  (1 node):     start in ~5min, complete in ~4h 5min
+        Note: estimate is for first job only; later jobs delayed by fair-share
+```
+
+Key things to check:
+
+- **Large completion vs small completion**: The tool accounts for fair-share degradation in its
+  recommendation, but review the raw estimates yourself.
+- **Wait time for large**: If the large allocation won't start for hours while small jobs start
+  immediately, small may still be better for partial results.
+- **Dependency depth**: A DAG with deep dependency chains cannot exploit N-node parallelism fully.
+  Check `max_parallelism` in the workflow analysis — if it's much less than `ideal_nodes`, you may
+  need fewer nodes than calculated.
+
+## Skipping the Probes
+
+Use `--skip-test-only` to rely on heuristics only (faster, no `sbatch` calls):
+
+```bash
+torc slurm plan-allocations --account myproject --skip-test-only workflow.yaml
+```
+
+Use `--offline` to skip all cluster queries (sinfo, squeue, sbatch):
+
+```bash
+torc slurm plan-allocations --account myproject --offline workflow.yaml
+```
+
+## Applying the Recommendation
+
+After reviewing the recommendation:
+
+```bash
+# Single large allocation (1 x N nodes)
+torc slurm generate --account myproject --single-allocation workflow.yaml
+
+# Many small allocations (N x 1 node, the default)
+torc slurm generate --account myproject workflow.yaml
+```
+
+## Using with AI Assistants
+
+The `plan_allocations` MCP tool provides the same analysis through AI assistants (Claude Code,
+GitHub Copilot). The assistant can analyze your workflow spec and present the recommendation
+conversationally:
+
+```
+> I have a workflow with 167 parallel jobs. Should I use one big Slurm
+> allocation or many small ones? My account is "myproject".
+
+⏺ torc - plan_allocations (MCP)(spec_json: {...}, account: "myproject")
+  ⎿  { "recommendations": [...], "cluster_state": [...], ... }
+
+⏺ Based on the current cluster state and sbatch --test-only probes:
+
+  Recommendation: Single large allocation (1 x 167 nodes)
+
+  | Strategy    | Estimated Start | Estimated Completion |
+  |-------------|-----------------|----------------------|
+  | Single large| ~30 min         | ~4h 30min            |
+  | Many small  | ~5 min (first)  | ~6h 30min (last)     |
+
+  The single large allocation completes ~2 hours faster because Slurm
+  prioritizes it in the backfill queue, and you avoid fair-share
+  degradation from 167 separate allocations.
+
+  Shall I generate the workflow with a single allocation?
+```
+
+The tool accepts a workflow spec as a JSON object, the Slurm account, and optional partition and HPC
+profile overrides. Use `skip_test_only: true` to skip the sbatch probes for faster results based on
+heuristics only.
+
+See [Configuring AI Assistants](../tools/ai-assistants.md) for setup instructions.
@@ -25,6 +25,7 @@ With Torc's AI integration, you can manage workflows using natural language:
 | Fix problems       | _"Restart the failed jobs with doubled memory"_         |
 | Investigate        | _"Check if any jobs exceeded their memory limits"_      |
 | Optimize resources | _"Analyze usage and suggest better resource groupings"_ |
+| Plan allocations   | _"Should I use one big allocation or many small ones?"_ |
 
 ---
 
@@ -262,6 +263,49 @@ In this example, 6 data-processing jobs are split across 2 resource groups (`sta
 
 ---
 
+## Example: Planning Slurm allocation strategy
+
+When submitting a workflow to Slurm, the AI can analyze the cluster state and recommend whether to
+use a single large allocation or many small ones.
+
+```
+> I'm about to submit this workflow to Slurm. Should I use one big
+> allocation or many small ones? My account is "climate_sim".
+
+⏺ torc - plan_allocations (MCP)(spec_json: {...}, account: "climate_sim")
+  ⎿  { "recommendations": [...], "cluster_state": [...], ... }
+
+⏺ Allocation Strategy Analysis
+
+  Workflow: 167 parallel work jobs, 1 preprocess, 1 postprocess
+  Cluster: short partition — 42 idle, 180 allocated, 12 pending jobs
+
+  Recommendation: Single large allocation (1 x 167 nodes)
+
+  | Strategy     | Est. Start | Est. Completion |
+  |--------------|------------|-----------------|
+  | Single large | ~30 min    | ~4h 30min       |
+  | Many small   | ~5 min     | ~6h 30min       |
+
+  Reason: Slurm's backfill scheduler gives the large allocation a
+  reserved slot. Many small allocations would start faster initially,
+  but fair-share degradation delays the last jobs by ~2 hours.
+
+  Your DAG has max_parallelism=167, so all work jobs can run
+  simultaneously — a single allocation fully utilizes all nodes.
+
+> Go ahead and generate it with the single allocation.
+
+⏺ Running: torc slurm generate --account climate_sim --single-allocation workflow.yaml
+  ⎿  Generated scheduler plan with 1 allocation x 167 nodes.
+
+⏺ Done. Submit with: torc submit workflow.yaml
+```
+
+For background on the tradeoffs, see [Allocation Strategies](../hpc/allocation-strategies.md).
+
+---
+
 ## Example: Recovering a failed workflow that timed out
 
 The workflow in `tests/workflows/timeout_auto_recovery_test` is used to test torc's auto-recovery of
 
@@ -32,6 +32,7 @@ The AI assistant has access to these Torc operations:
 | `list_pending_failed_jobs`      | List jobs awaiting AI-assisted classification            |
 | `classify_and_resolve_failures` | Classify failures as transient or permanent              |
 | `get_slurm_sacct`               | Get Slurm accounting data for a workflow                 |
+| `plan_allocations`              | Analyze workflow and recommend Slurm allocation strategy |
 | `get_docs`                      | Retrieve Torc documentation by topic                     |
 | `list_examples`                 | List available example workflow specifications           |
 | `get_example`                   | Retrieve a complete example workflow spec                |