Update documentation and Gemfile.lock for consistency and clarity (#7)

paulz · tkersey · Copilot · web-flow · commit 6d790826cdb5 · 2025-03-04T09:45:16.000-08:00
* Update documentation and Gemfile.lock for consistency and clarity

* Apply suggestions from code review

Co-authored-by: Copilot &lt;175728472+Copilot@users.noreply.github.com&gt;

* Fix some typos

Co-authored-by: Copilot &lt;175728472+Copilot@users.noreply.github.com&gt;

---------

Co-authored-by: Paul Zabelin &lt;paulz@users.noreply.github.com&gt;
Co-authored-by: Tim Kersey &lt;tkersey@users.noreply.github.com&gt;
Co-authored-by: Copilot &lt;175728472+Copilot@users.noreply.github.com&gt;
diff --git a/docs/Gemfile.lock b/docs/Gemfile.lock
@@ -13,15 +13,15 @@ GEM
       http_parser.rb (~> 0)
     eventmachine (1.2.7)
     ffi (1.17.1)
-    ffi (1.17.1-aarch64-linux-gnu)
+    ffi (1.17.1-aarch64-linux)
     ffi (1.17.1-aarch64-linux-musl)
-    ffi (1.17.1-arm-linux-gnu)
+    ffi (1.17.1-arm-linux)
     ffi (1.17.1-arm-linux-musl)
     ffi (1.17.1-arm64-darwin)
-    ffi (1.17.1-x86-linux-gnu)
+    ffi (1.17.1-x86-linux)
     ffi (1.17.1-x86-linux-musl)
     ffi (1.17.1-x86_64-darwin)
-    ffi (1.17.1-x86_64-linux-gnu)
+    ffi (1.17.1-x86_64-linux)
     ffi (1.17.1-x86_64-linux-musl)
     forwardable-extended (2.6.0)
     google-protobuf (4.29.3)
@@ -99,35 +99,35 @@ GEM
     sass-embedded (1.83.4)
       google-protobuf (~> 4.29)
       rake (>= 13)
-    sass-embedded (1.83.4-aarch64-linux-android)
+    sass-embedded (1.83.4-aarch64-linux)
       google-protobuf (~> 4.29)
-    sass-embedded (1.83.4-aarch64-linux-gnu)
+    sass-embedded (1.83.4-aarch64-linux-android)
       google-protobuf (~> 4.29)
     sass-embedded (1.83.4-aarch64-linux-musl)
       google-protobuf (~> 4.29)
     sass-embedded (1.83.4-aarch64-mingw-ucrt)
       google-protobuf (~> 4.29)
-    sass-embedded (1.83.4-arm-linux-androideabi)
+    sass-embedded (1.83.4-arm-linux)
       google-protobuf (~> 4.29)
-    sass-embedded (1.83.4-arm-linux-gnueabihf)
+    sass-embedded (1.83.4-arm-linux-androideabi)
       google-protobuf (~> 4.29)
     sass-embedded (1.83.4-arm-linux-musleabihf)
       google-protobuf (~> 4.29)
     sass-embedded (1.83.4-arm64-darwin)
       google-protobuf (~> 4.29)
-    sass-embedded (1.83.4-riscv64-linux-android)
+    sass-embedded (1.83.4-riscv64-linux)
       google-protobuf (~> 4.29)
-    sass-embedded (1.83.4-riscv64-linux-gnu)
+    sass-embedded (1.83.4-riscv64-linux-android)
       google-protobuf (~> 4.29)
     sass-embedded (1.83.4-riscv64-linux-musl)
       google-protobuf (~> 4.29)
     sass-embedded (1.83.4-x86_64-cygwin)
       google-protobuf (~> 4.29)
     sass-embedded (1.83.4-x86_64-darwin)
       google-protobuf (~> 4.29)
-    sass-embedded (1.83.4-x86_64-linux-android)
+    sass-embedded (1.83.4-x86_64-linux)
       google-protobuf (~> 4.29)
-    sass-embedded (1.83.4-x86_64-linux-gnu)
+    sass-embedded (1.83.4-x86_64-linux-android)
       google-protobuf (~> 4.29)
     sass-embedded (1.83.4-x86_64-linux-musl)
       google-protobuf (~> 4.29)
@@ -137,29 +137,29 @@ GEM
     webrick (1.9.1)
 
 PLATFORMS
+  aarch64-linux
   aarch64-linux
   aarch64-linux-android
-  aarch64-linux-gnu
   aarch64-linux-musl
   aarch64-mingw-ucrt
+  arm-linux
+  arm-linux
   arm-linux-androideabi
-  arm-linux-gnu
-  arm-linux-gnueabihf
   arm-linux-musl
   arm-linux-musleabihf
   arm64-darwin
+  riscv64-linux
   riscv64-linux-android
-  riscv64-linux-gnu
   riscv64-linux-musl
   ruby
   x86-linux
-  x86-linux-gnu
+  x86-linux
   x86-linux-musl
   x86_64-cygwin
   x86_64-darwin
   x86_64-linux
+  x86_64-linux
   x86_64-linux-android
-  x86_64-linux-gnu
   x86_64-linux-musl
 
 DEPENDENCIES
diff --git a/docs/_config.yml b/docs/_config.yml
@@ -25,7 +25,7 @@ description: >- # this means to ignore newlines until "baseurl:"
 baseurl: "/continuous-alignment-testing" # the subpath of your site, e.g. /blog
 url: "https://thisisartium.github.io/continuous-alignment-testing" # the base hostname & protocol for your site, e.g. http://example.com
 # twitter_username: jekyllrb
-# github_username:  jekyll
+github_username:  thisisartium
 show_excerpts: false 
 # Build settings
 theme: minima
diff --git a/docs/about.markdown b/docs/about.markdown
@@ -1,18 +1,16 @@
 ---
 layout: page
 title: About
-permalink: /about/
 ---
 
-This is the base Jekyll theme. You can find out more info about customizing your Jekyll theme, as well as basic Jekyll usage documentation at [jekyllrb.com](https://jekyllrb.com/)
+## Overview
 
-You can find the source code for Minima at GitHub:
-[jekyll][jekyll-organization] /
-[minima](https://github.com/jekyll/minima)
+CAT Harness provides the infrastructure needed to:
 
-You can find the source code for Jekyll at GitHub:
-[jekyll][jekyll-organization] /
-[jekyll](https://github.com/jekyll/jekyll)
+- Run and track CAT tests against LLM outputs
+- Store and analyze test results over time
+- Monitor changes in LLM behavior as prompts/models/data evolve
+- Integrate validation into CI/CD pipelines
 
-
-[jekyll-organization]: https://github.com/jekyll
+[Getting Started](getting-started.html)
+[Reference](api/index.html)
diff --git a/docs/getting-started.md b/docs/getting-started.md
@@ -7,13 +7,14 @@ title: Getting Started
 
 ## Poetry
 ```sh
-poetry install cat-ai
-
+poetry add cat-ai
+```
 ## UV
 
+```sh
 uv add cat-ai
 ```
 
 # Driving out non-deterministic projects with CAT
 
-Let's do a step by step journey through the lifecycle of a project to show how and why to use CAT. We will use an example of a project using an LLM and prompt to give recommendations of software teams for a project. The first step will be working with the prompt and LLM in [local development](local-development.md)
+Let's do a step by step journey through the lifecycle of a project to show how and why to use CAT. We will use an example of a project using an LLM and prompt to give recommendations of software teams for a project. The first step will be working with the prompt and LLM in [local development](local-development.html)
diff --git a/docs/index.markdown b/docs/index.markdown
@@ -5,14 +5,7 @@
 layout: home
 ---
 
-## Overview
+## Index
 
-CAT Harness provides the infrastructure needed to:
-
-- Run and track CAT tests against LLM outputs
-- Store and analyze test results over time
-- Monitor changes in LLM behavior as prompts/models/data evolve
-- Integrate validation into CI/CD pipelines
-
-[Getting Started](getting-started.html)
-[Refernece](api/index.html)
+- [Getting Started](getting-started.html)
+- [API Reference](api/index.html)
diff --git a/docs/local-development.md b/docs/local-development.md
@@ -8,20 +8,25 @@ The first step will be just to be able to run the first version of your prompt a
 Imagine we have a python project called `team_recommender` where we recommend teams of developers to be used on a given project. The basic structure looks like this:
 
 ```
-team_recommender/
-├── README.md
-├── requirements.txt
-├── src/
-│   ├── __init__.py
-│   ├── main.py
-│   └── utils.py
-└── tests/
-    ├── fixtures/
-    |   ├── example_output.json
-    |   └── skills.json
-    ├── __init__.py
-    ├── test_allocations.py
+examples/team_recommender
+├── conftest.py
+├── readme.md
+└── tests
+    ├── example_0_text_output
+    ├── example_1_unit
+    │   └── test_allocations_unit.py
+    ├── example_2_loop
+    │   └── test_allocations_loop.py
+    ├── example_3_loop_no_hallucinating
+    │   └── test_allocations_hallucinating.py
+    ├── example_4_gate_on_success_threshold
+    │   └── test_allocations_threshold.py
+    ├── fixtures
+    │   ├── example_output.json
+    │   ├── output_schema.json
+    │   └── skills.json
     └── settings.py
+
 ```
 
 ## Single Test
@@ -457,4 +462,4 @@ O.k! Great! Lets look at our second failure:
     }
 }
 ```
-WOW! We didn't get any developers at all. Great! We can work with this! From here we can update our prompt to be more reslient. Once we make our updates, we will want to make sure these promblems are decreasing and not not regressing over time. Obviously, that isn't something you would try to control on your local machine, and the amount of test runs to get statisticle confidence about the rates of failure/hallucination are staying low. The best surface to gate and monitor this is going to be in your [Continous Integration](running-in-ci.md).
+WOW! We didn't get any developers at all. Great! We can work with this! From here we can update our prompt to be more resilient. Once we make our updates, we will want to make sure these problems are decreasing and not not regressing over time. Obviously, that isn't something you would try to control on your local machine, and the amount of test runs to get statisticle confidence about the rates of failure/hallucination are staying low. The best surface to gate and monitor this is going to be in your [Continous Integration](running-in-ci.html).