Skip to content

docs: consolidate documentation, add examples, and mdBook site#40

Merged
miguelgila merged 5 commits intomainfrom
docs-improvement
Mar 19, 2026
Merged

docs: consolidate documentation, add examples, and mdBook site#40
miguelgila merged 5 commits intomainfrom
docs-improvement

Conversation

@miguelgila
Copy link
Owner

Summary

  • Remove deprecated example 10 (wren-job-api) and all references
  • Add example 10-slurm-hpc: Mixed runtimes — containerized Slurm scheduler + Reaper worker DaemonSet with direct host access for CPU pinning and device management
  • Add example 11-node-monitoring: Prometheus node_exporter as Reaper DaemonSet (accurate host metrics) + containerized Prometheus server
  • Implement --release mode in setup-playground.sh — pulls pre-built GHCR images instead of building locally (--release for latest, --release v0.2.14 for specific version)
  • Fix broken CI badges in README (test.yml/build.ymlci.yml)
  • Restructure README from 373 → 205 lines: extract Pod Compatibility table to docs/COMPATIBILITY.md, annotations/config to docs/CONFIGURATION.md
  • Set up mdBook documentation site (docs/book/) with {{#include}} directives to avoid content duplication
  • Add GitHub Actions workflow (.github/workflows/docs.yml) for building docs on PR and deploying to GitHub Pages on push to main

Test plan

  • cargo clippy --workspace --all-targets -- -D warnings — clean
  • cargo test --workspace — 152 tests pass
  • scripts/test-examples.sh --skip-cluster — 32 YAML validations pass (including 5 new)
  • bash -n scripts/setup-playground.sh — syntax OK
  • setup-playground.sh --help shows --release option
  • All mdBook {{#include}} paths verified to resolve correctly
  • mdbook build docs/book succeeds in CI (mdbook not installed locally)
  • Badge URLs render correctly on GitHub after merge

🤖 Generated with Claude Code

miguelgila and others added 4 commits March 16, 2026 23:10
…ease mode

- Remove deprecated example 10 (wren-job-api)
- Add example 10-slurm-hpc (mixed runtimes: containerized scheduler + Reaper workers)
- Add example 11-node-monitoring (Prometheus node_exporter via Reaper + Prometheus server)
- Implement --release mode in setup-playground.sh (pull pre-built GHCR images)
- Fix broken CI badges in README (test.yml/build.yml → ci.yml)
- Restructure README (~200 lines): extract Pod Compatibility and Annotations to docs/
- Create docs/CONFIGURATION.md and docs/COMPATIBILITY.md
- Set up mdBook documentation site (docs/book/) with include directives
- Add GitHub Actions workflow for docs build and GitHub Pages deployment
- Update docs/DEVELOPMENT.md CI section to reflect unified ci.yml

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Move raw Pod + runtimeClassName examples to the quick-start guide
in the docs site. README now leads with ReaperPod CRD for all usage
examples (run command, volumes, node selector).

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
No local mdbook install needed — uses peaceiris/mdbook container.
- `make docs` builds the site to docs/book/book/
- `make docs-serve` serves with live-reload on http://localhost:3000
- `make clean` now also removes built docs

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Examples 10 and 11 now accept --release [VERSION] which is passed
through to setup-playground.sh to use pre-built GHCR images instead
of building from source.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@miguelgila miguelgila marked this pull request as draft March 16, 2026 22:53
@codecov
Copy link

codecov bot commented Mar 16, 2026

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 85.04%. Comparing base (f5291d6) to head (12b1639).
⚠️ Report is 2 commits behind head on main.

Additional details and impacted files
@@           Coverage Diff           @@
##             main      #40   +/-   ##
=======================================
  Coverage   85.04%   85.04%           
=======================================
  Files           6        6           
  Lines         301      301           
=======================================
  Hits          256      256           
  Misses         45       45           

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

- Use debian:12 for slurmctld and test-job (matches Kind node Slurm 22.05)
- Run all Slurm daemons as root (overlay filters /etc/gshadow, see #41)
- Add cgroup.conf with CgroupPlugin=cgroup/v1 (Kind lacks dbus/systemd)
- Auto-detect node names and CPU count in setup.sh ConfigMap generation
- Use --no-install-recommends and dpkg --fix-broken for robust apt installs
- Pass -N $(hostname) to slurmd for dynamic node registration
- Switch test job from srun to sbatch (avoids reverse connection issue)

Tested: slurmctld running, 2 slurmd nodes registered and executing jobs.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@miguelgila miguelgila marked this pull request as ready for review March 19, 2026 15:27
@miguelgila miguelgila merged commit 9fde935 into main Mar 19, 2026
12 checks passed
@miguelgila miguelgila deleted the docs-improvement branch March 19, 2026 15:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant