Mitigating Reasoning LLM Social Bias by Assessing and Filtering Reasoning Steps with a Multi-Judge Pipeline

Authors

Fatemehzahra Ghafari Ghomi (fatemezahra.ghafari@studio.unibo.it)
Shafagh Rastegari (shafagh.rastegari@studio.unibo.it)
Habib Kazemi (habib.kazemi2@studio.unibo.it)

Description

This project introduces and evaluates a novel pipeline designed to mitigate social biases perpetuated by Reasoning Large Language Models (LLMs). Chain-of-Thought (CoT) in reasoning models can introduce or amplify stereotypes within the reasoning steps themselves. Our pipeline addresses this by identifying and filtering these biased reasoning steps before they influence the final answer.

Our approach utilizes a multi-judge system, empolying LLMs to assess each step of a CoT sequence for social bias. Biased steps are then removed, creating a “debiased” CoT that is passed to a final model for answer generation.

We conducted extensive experiments on two datasets:

Bias Benchmark for Question Answering (BBQ)
Multilingual Bias Benchmark for Question Answering (MBBQ)

Project Structure

The repository is organized into two main directories:

BBQ/: Contains the code, dataset and results of running experiments on the English BBQ dataset.
MBBQ/: Contains the code, dataset and results of our multilingual extension of the BBQ benchmark, covering English, Spanish, Dutch, and Turkish.

Setup and Installation

Prerequisites

Python 3.12 or higher
uv (Python package manager)

Installation

Install uv (if not already installed)

# macOS/Linux with curl
curl -LsSf https://astral.sh/uv/install.sh | sh

# Windows with PowerShell
powershell -c "irm https://astral.sh/uv/install.ps1 | iex"

Set up the project

# Clone the repository
git clone <repository-url>
cd path/to/project

# Create virtual environment and install dependencies using uv
uv sync

# Activate a virtual environment using uv
source .venv/bin/activate

Running the projects

The main project notebooks are interactive Marimo notebooks. They provide a reactive and reproducible environment for running the experiments.

To run notebooks of BBQ, First enter the BBQ directory:

cd BBQ

To run notebooks of MBBQ, First enter the MBBQ directory:

cd MBBQ

Then use the following command:

marimo edit <notebook-name>.py

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 163 Commits
BBQ		BBQ
MBBQ		MBBQ
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
dataset_visualizer.py		dataset_visualizer.py
project_report.pdf		project_report.pdf
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mitigating Reasoning LLM Social Bias by Assessing and Filtering Reasoning Steps with a Multi-Judge Pipeline

Authors

Description

Project Structure

Setup and Installation

Prerequisites

Installation

Running the projects

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Mitigating Reasoning LLM Social Bias by Assessing and Filtering Reasoning Steps with a Multi-Judge Pipeline

Authors

Description

Project Structure

Setup and Installation

Prerequisites

Installation

Running the projects

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages