Skip to content

Minor release: v0.8.0#231

Merged
dylanbouchard merged 27 commits intomainfrom
release-branch/v0.8.0
Jan 9, 2026
Merged

Minor release: v0.8.0#231
dylanbouchard merged 27 commits intomainfrom
release-branch/v0.8.0

Conversation

@dylanbouchard
Copy link
Collaborator

@dylanbouchard dylanbouchard commented Jan 8, 2026

This release adds two new features:

Progress bars with rich

Progress tracking is now available for all metric computation and generation steps using rich progress bars. For the applicable methods, these progress bars can be turned on/off using the show_progress_bar boolean parameter.

Adversarial evaluations

We now offer two out-of-the-box adversarial evaluations with LangFair, for scenarios when users wish to explore worst-case generations for their LLM of choice. These evaluations, which include toxicity and counterfactual fairness assessments, are implemented using the AdversarialGenerator class.

1. Adversarial toxicity evaluation

This implementation generates responses with the provided LLM to prompts from the RealToxicityPrompts dataset.

2. Adversarial counterfactual evaluation

This implementation generates counterfactual responses with the provided LLM to adversarially designed counterfactual prompt sets. Off-the-shelf templates are available for a wide variety of attributes, including gender, race, age, etc.

dependabot bot and others added 27 commits September 5, 2025 01:20
Bumps [actions/setup-python](https://github.com/actions/setup-python) from 5.6.0 to 6.0.0.
- [Release notes](https://github.com/actions/setup-python/releases)
- [Commits](actions/setup-python@v5.6.0...v6.0.0)

---
updated-dependencies:
- dependency-name: actions/setup-python
  dependency-version: 6.0.0
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
…elop/actions/setup-python-6.0.0

chore(deps): bump actions/setup-python from 5.6.0 to 6.0.0
add unit tests for adversarial generator
* completed changes for progress bar implementation

* added try catch to stop progress for execption

* added progress bar description

* added progressbar parameters for all notebooks

* fixed linting and formatting errors

* completed progress bar - whitespace issue and gender attrib issue

* fixed linting

* update attribute info in notebook

* reverting old notebook metadata

* reverting metadata for counterfactual metrics

* refactored message based on comments

* added attributes for response generator

* reverting changes

* added attribute info and removed print statement

* added attribute and removed print statement

* added attribute info

* completed progress bar for toxicity

* completed changes for stereotype metrics progress bar

* fixed bugs and linting issues

* fixed output indentation issues

* updated poetry lock file

* updated attributions and info on example notebooks

* bug fixes

* updated poetry file and add rich

* fixed linting issues

* deprecate estimate_token_cost

* fix toxicity progress bars

* update progress bars for COBS

* update progress bars for SA

* create progress bar utilities

* use progress utilities with toxicity

* polish progress bars for counterfactual

* polish progress bars for stereotype

* polish progress bars for toxicity

* polish progress bars for generator classes

* update notebooks

* fix logic when progress bar not displayed

* add rich dependency

* ruff changes

* progress bar updates

* polish autoeval progress bars

* ruff changes

* update progress bar for adversarial toxicity and sentiment bug

* update uni tests

* add disk space cleanup to github workflow

* add unit tests for progress bar

* update unit test

* disable live rendering in CI

* update unit test

* update unit test

* update ci.yaml

* add conftest.py

* mock progress bar to fix LiveError

* mock progress bar to fix Rich  Liv
eError

* mock progress bar

* update unit tests

* update unit tests

* update ci.yaml

* update progress bar completion

* fix progress bars for adversarial

---------

Co-authored-by: Jinesh Mehta <mehtajineshs@gmail.com>
Co-authored-by: Zeya Ahmad <za2291@columbia.edu>
@dylanbouchard dylanbouchard marked this pull request as ready for review January 8, 2026 23:26
@dylanbouchard dylanbouchard merged commit fc7afa7 into main Jan 9, 2026
33 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants