cvs-health
diff --git a/‎.github/workflows/ci.yaml‎
Lines changed: 15 additions & 3 deletions b/‎.github/workflows/ci.yaml‎
Lines changed: 15 additions & 3 deletions
diff --git a/‎README.md‎
Lines changed: 16 additions & 0 deletions b/‎README.md‎
Lines changed: 16 additions & 0 deletions
diff --git a/‎data/counterfactual_templates.json‎
Lines changed: 1 addition & 0 deletions b/‎data/counterfactual_templates.json‎
Lines changed: 1 addition & 0 deletions
@@ -40,8 +40,20 @@ jobs:
         with:
           persist-credentials: false
 
+      - name: Free Disk Space (Ubuntu)
+        if: matrix.os == 'ubuntu-latest'
+        uses: jlumbroso/free-disk-space@main
+        with:
+         tool-cache: false
+         android: true
+         dotnet: true
+         haskell: true
+         large-packages: true
+         docker-images: true
+         swap-storage: true     
+
       - name: Set up Python
-        uses: actions/setup-python@v5.6.0
+        uses: actions/setup-python@v6.0.0
         with:
           python-version: ${{matrix.python-version}}
 
@@ -65,7 +77,7 @@ jobs:
           persist-credentials: false
 
       - name: Set up Python
-        uses: actions/setup-python@v5.6.0
+        uses: actions/setup-python@v6.0.0
         with:
           python-version: "3.12"
 
@@ -117,4 +129,4 @@ jobs:
         uses: github/codeql-action/upload-sarif@v3
         with:
           sarif_file: semgrep.sarif
-        if: always()
+        if: always()
@@ -155,6 +155,20 @@ results['metrics']
 #    'Sentiment Bias': 0.0009947145187601957}}}
 ```
 
+##### Bias and Fairness Red-Teaming
+To assess worst-case toxicity and counterfactual generations for a given use case, LangFair also offers off-the-shelf red-teaming evaluations. The following code can be used:
+```python
+from langfair.generator import AdversarialGenerator
+ag = AdversarialGenerator(langchain_llm=llm)
+
+# Generate responses to adversarial prompts (toxicity)
+toxicity_generations = await ag.toxicity() 
+
+# Generate responses to adversarial prompts (counterfactual fairness)
+counterfactual_generations = await ag.counterfactual(group_categories=["Gender", "Race/ethnicity"],)
+```
+
+
 ## 📚 Example Notebooks
 Explore the following demo notebooks to see how to use LangFair for various bias and fairness evaluation metrics:
 
@@ -164,6 +178,8 @@ Explore the following demo notebooks to see how to use LangFair for various bias
 - [AutoEval for Text Generation / Summarization (Toxicity, Stereotypes, Counterfactual)](https://github.com/cvs-health/langfair/blob/main/examples/evaluations/text_generation/auto_eval_demo.ipynb): A notebook illustrating how to use LangFair's `AutoEval` class for a comprehensive fairness assessment of text generation / summarization use cases. This assessment includes toxicity, stereotype, and counterfactual metrics.
 - [Classification Fairness Evaluation](https://github.com/cvs-health/langfair/blob/main/examples/evaluations/classification/classification_metrics_demo.ipynb): A notebook demonstrating classification fairness metrics.
 - [Recommendation Fairness Evaluation](https://github.com/cvs-health/langfair/blob/main/examples/evaluations/recommendation/recommendation_metrics_demo.ipynb): A notebook demonstrating recommendation fairness metrics.
+- [Adversarial Toxicity Evaluation](https://github.com/cvs-health/langfair/blob/main/examples/adversarial/adversarial_toxicity.ipynb): A notebook demonstrating red-teaming using adversarial toxicity prompts.
+- [Adversarial Counterfactual Fairness Evaluation](https://github.com/cvs-health/langfair/blob/main/examples/adversarial/adversarial_counterfactual.ipynb): A notebook demonstrating red-teaming using adversarial counterfactual fairness prompts.
 
 
 ## 🛠 Choosing Bias and Fairness Metrics for an LLM Use Case