lisadunlap
diff --git a/‎.github/workflows/nightly.yml‎
Lines changed: 73 additions & 0 deletions b/‎.github/workflows/nightly.yml‎
Lines changed: 73 additions & 0 deletions
diff --git a/‎README.md‎
Lines changed: 81 additions & 42 deletions b/‎README.md‎
Lines changed: 81 additions & 42 deletions
diff --git a/‎docs/assets/stringsight_logo.png‎
104 KB b/‎docs/assets/stringsight_logo.png‎
104 KB
diff --git a/‎docs/getting-started/installation.md‎
Lines changed: 22 additions & 17 deletions b/‎docs/getting-started/installation.md‎
Lines changed: 22 additions & 17 deletions
@@ -0,0 +1,73 @@
+name: Nightly Build and Publish
+
+on:
+  schedule:
+    - cron: "0 3 * * *"  # Run daily at 3:00 AM UTC
+  workflow_dispatch:  # Allow manual triggering
+
+jobs:
+  build-and-publish-nightly:
+    runs-on: ubuntu-latest
+    steps:
+    - name: Checkout code
+      uses: actions/checkout@v4
+
+    - name: Set up Python
+      uses: actions/setup-python@v5
+      with:
+        python-version: '3.10'
+
+    - name: Install build dependencies
+      run: |
+        python -m pip install --upgrade pip
+        pip install build twine tomli
+
+    - name: Update version with dev suffix
+      run: |
+        python << 'EOF'
+        import re
+        from datetime import datetime
+        
+        # Read current pyproject.toml
+        with open('pyproject.toml', 'r') as f:
+            content = f.read()
+        
+        # Extract current version
+        version_match = re.search(r'version = "([^"]+)"', content)
+        if not version_match:
+            raise ValueError("Could not find version in pyproject.toml")
+        
+        current_version = version_match.group(1)
+        
+        # Remove any existing dev suffix if present
+        base_version = re.sub(r'\.dev\d+$', '', current_version)
+        
+        # Generate dev version with date (YYYYMMDD format)
+        date_suffix = datetime.utcnow().strftime('%Y%m%d')
+        dev_version = f"{base_version}.dev{date_suffix}"
+        
+        # Replace version in content
+        new_content = re.sub(
+            r'version = "[^"]+"',
+            f'version = "{dev_version}"',
+            content
+        )
+        
+        # Write back to pyproject.toml
+        with open('pyproject.toml', 'w') as f:
+            f.write(new_content)
+        
+        print(f"Updated version from {current_version} to {dev_version}")
+        EOF
+
+    - name: Build package
+      run: |
+        python -m build
+
+    - name: Publish to PyPI
+      env:
+        TWINE_USERNAME: __token__
+        TWINE_PASSWORD: ${{ secrets.PYPI_API_TOKEN }}
+      run: |
+        twine upload dist/*
+
@@ -1,41 +1,80 @@
-<div align="center">
+<p align="center">
+  <img src="stringsight_github.png" alt="StringSight logo" width="600">
+</p>
+
+<h1 align="center">StringSight</h1>
+
+<p align="center">
+  <em>Extract, cluster, and analyze behavioral properties from Large Multimodal Models</em>
+</p>
+
+<p align="center">
+  <a href="https://www.python.org/downloads/">
+    <img src="https://img.shields.io/badge/python-3.8+-blue.svg" alt="Python 3.8+">
+  </a>
+  <a href="LICENSE">
+    <img src="https://img.shields.io/badge/license-MIT-green.svg" alt="MIT License">
+  </a>
+  <a href="https://lisadunlap.github.io/StringSight/">
+    <img src="https://img.shields.io/badge/docs-Documentation-blue" alt="Docs">
+  </a>
+  <a href="https://blog.stringsight.com">
+    <img src="https://img.shields.io/badge/blog-blog.stringsight.com-orange" alt="Blog">
+  </a>
+  <a href="https://stringsight.com">
+    <img src="https://img.shields.io/badge/website-stringsight.com-green" alt="Website">
+  </a>
+</p>
+
+<p align="center">
+  <strong>Annoyed at having to look through your long model conversations or agentic traces? Fear not, StringSight has come to ease your woes. Understand and compare model behavior by automatically extracting behavioral properties from their responses, grouping similar behaviors together, and quantifying how important these behaviors are.</strong>
+</p>
 
-# StringSight
-### *Extract, cluster, and analyze behavioral properties from Large Language Models*
-
-[![Python 3.8+](https://img.shields.io/badge/python-3.8+-blue.svg)](https://www.python.org/downloads/)
-[![License](https://img.shields.io/badge/license-MIT-green.svg)](LICENSE)
+## Installation
 
-[![Docs](https://img.shields.io/badge/docs-Documentation-blue)](https://stringsight.com/docs)
-[![Website](https://img.shields.io/badge/website-stringsight.com-green)](https://stringsight.com)
+```bash
+# (Optional) create and activate a dedicated environment
+conda create -n stringsight python=3.11
+conda activate stringsight
 
-**Understand how different generative models behave by automatically extracting behavioral properties from their responses, grouping similar behaviors together, and quantifying how important these behaviors are.**
+# Install the core library from PyPI
+pip install stringsight
 
-</div>
+# Install with all optional extras (recommended for notebooks and advanced workflows)
+pip install "stringsight[full]"
+```
 
-## Installation
+For local development or contributing, you can install from source in editable mode:
 
 ```bash
-# Create conda environment
+# Clone the repository
+git clone https://github.com/lisabdunlap/stringsight.git
+cd stringsight
+
+# (Optional) create and activate a dedicated environment
 conda create -n stringsight python=3.11
 conda activate stringsight
 
-# Install StringSight
+# Install StringSight in editable mode with full extras
 pip install -e ".[full]"
 
-
 # Install StringSight in editable mode with dev dependencies
 pip install -e ".[dev]"
+```
 
-# Set API keys
+Set your API keys (required for running LLM-backed pipelines):
+
+```bash
 export OPENAI_API_KEY="your-openai-key"
-export ANTHROPIC_API_KEY="your-anthropic-key"  # optional
-export GOOGLE_API_KEY="your-google-key"        # optional
+export ANTHROPIC_API_KEY="your-anthropic-key" 
+export GOOGLE_API_KEY="your-google-key" 
 ```
 
+vLLM support coming soon, I promise!
+
 ## Quick Start
 
-For a comprehensive tutorial with detailed explanations, see [starter_notebook.ipynb](starter_notebook.ipynb).
+For a comprehensive tutorial with detailed explanations, see [starter_notebook.ipynb](starter_notebook.ipynb) or open it directly in [Google Colab](https://colab.research.google.com/drive/1XBQqDqTK6-9wopqRB51j8cPfnTS5Wjqh?usp=drive_link).
 
 ### 1. Extract and Cluster Properties with `explain()`
 
@@ -221,30 +260,7 @@ Use the React frontend or other visualization tools to explore your results.
 
 ### Side-by-Side Comparisons
 
-**Option 1: Pre-paired Data**
-
-**Required Columns:**
-| Column | Description | Example |
-|--------|-------------|---------|
-| `prompt` | Question given to both models | `"What is machine learning?"` |
-| `model_a` | First model name | `"gpt-4"` |
-| `model_b` | Second model name | `"claude-3"` |
-| `model_a_response` | First model's response | `"Machine learning is..."` |
-| `model_b_response` | Second model's response | `"ML involves..."` |
-
-**Optional Columns:**
-| Column | Description | Example |
-|--------|-------------|---------|
-| `score` | Winner and metrics | `{"winner": "model_a", "helpfulness_a": 4.2, "helpfulness_b": 3.8}` |
-| `score_columns` | Alternative: separate columns for each metric with `_a` and `_b` suffixes (e.g., `accuracy_a`, `accuracy_b`) | `score_columns=["accuracy_a", "accuracy_b", "helpfulness_a", "helpfulness_b"]` |
-| `prompt_column` | Name of the prompt column in your dataframe (default: `"prompt"`) | `prompt_column="query"` |
-| `model_a_column` | Name of the model_a column (default: `"model_a"`) | `model_a_column="model_1"` |
-| `model_b_column` | Name of the model_b column (default: `"model_b"`) | `model_b_column="model_2"` |
-| `model_a_response_column` | Name of the model_a_response column (default: `"model_a_response"`) | `model_a_response_column="response_1"` |
-| `model_b_response_column` | Name of the model_b_response column (default: `"model_b_response"`) | `model_b_response_column="response_2"` |
-| `question_id_column` | Name of the question_id column (default: `"question_id"` if column exists) | `question_id_column="qid"` |
-
-**Option 2: Tidy Data (Auto-pairing)**
+**Option 1: Tidy Data (Auto-pairing)**
 
 If your data is in tidy single-model format with multiple models, StringSight can automatically pair them:
 
@@ -268,6 +284,29 @@ clustered_df, model_stats = explain(
 
 The pipeline will automatically pair rows where both models answered the same prompt.
 
+**Option 2: Pre-paired Data**
+
+**Required Columns:**
+| Column | Description | Example |
+|--------|-------------|---------|
+| `prompt` | Question given to both models | `"What is machine learning?"` |
+| `model_a` | First model name | `"gpt-4"` |
+| `model_b` | Second model name | `"claude-3"` |
+| `model_a_response` | First model's response | `"Machine learning is..."` |
+| `model_b_response` | Second model's response | `"ML involves..."` |
+
+**Optional Columns:**
+| Column | Description | Example |
+|--------|-------------|---------|
+| `score` | Winner and metrics | `{"winner": "model_a", "helpfulness_a": 4.2, "helpfulness_b": 3.8}` |
+| `score_columns` | Alternative: separate columns for each metric with `_a` and `_b` suffixes (e.g., `accuracy_a`, `accuracy_b`) | `score_columns=["accuracy_a", "accuracy_b", "helpfulness_a", "helpfulness_b"]` |
+| `prompt_column` | Name of the prompt column in your dataframe (default: `"prompt"`) | `prompt_column="query"` |
+| `model_a_column` | Name of the model_a column (default: `"model_a"`) | `model_a_column="model_1"` |
+| `model_b_column` | Name of the model_b column (default: `"model_b"`) | `model_b_column="model_2"` |
+| `model_a_response_column` | Name of the model_a_response column (default: `"model_a_response"`) | `model_a_response_column="response_1"` |
+| `model_b_response_column` | Name of the model_b_response column (default: `"model_b_response"`) | `model_b_response_column="response_2"` |
+| `question_id_column` | Name of the question_id column (default: `"question_id"` if column exists) | `question_id_column="qid"` |
+
 ## Outputs
 
 ### `clustered_df` (DataFrame)
 
@@ -26,27 +26,25 @@ conda activate stringsight
 ### 2. Install StringSight
 
 ```bash
-# Clone the repository
-git clone https://github.com/lisabdunlap/stringsight.git
-cd stringsight
+# From PyPI (recommended): install core package + optional ML/embedding tooling
+pip install "stringsight[full]"
 
-# Install in development mode with all dependencies
-pip install -e ".[full]"
+# Or, for local development from source:
+# git clone https://github.com/lisabdunlap/stringsight.git
+# cd stringsight
+# pip install -e ".[full]"
 ```
 
-### 3. Set API Key
+### 3. Set API Key(s)
 
-**Option A: Environment Variable**
 ```bash
 # Add to your shell profile (.bashrc, .zshrc, etc.)
+# We dafult to OAI but we support any litellm compatable api
 export OPENAI_API_KEY="your-api-key-here"
+... # any other provider keys
 ```
 
-**Option B: .env File**
-```bash
-# Create .env file in project root
-echo "OPENAI_API_KEY=your-api-key-here" > .env
-```
+vLLM support coming very soon. I promise!
 
 ### 4. Verify Installation
 
@@ -66,21 +64,28 @@ curl http://127.0.0.1:8000/health
 
 ### Core Package Only
 ```bash
-pip install -e .
-```
+# From PyPI
+pip install stringsight
 
-### With Visualization Tools
-```bash
-pip install -e ".[viz]"
+# From a local clone (development)
+pip install -e .
 ```
 
 ### With Development Tools
 ```bash
+# From PyPI
+pip install "stringsight[dev]"
+
+# From a local clone (development)
 pip install -e ".[dev]"
 ```
 
 ### All Features
 ```bash
+# From PyPI (recommended for most users)
+pip install "stringsight[full]"
+
+# From a local clone (development)
 pip install -e ".[full]"
 ```