GitHub - rwth-i6/Gender-Bias-in-German-LLMs

The framework developed for the work on Gender Bias in German LLMs.

It can be used to generate text with LLMs. The generated output can be evaluated with regard to gender bias. The datasets are in German. Three datasets include open text generation prompts:

GenderPersona
StereoPersona
NeutralPersona

Two datasets are made up of Q&A tasks:

GerBBQ+
SexistStatements

A comprehensive description of the datasets, metrics and implementation can be found in chapters 4,5 and 6 the thesis file.

The settings file can be edited to specify models and datasets that should be applied. Huggingface login token, and api keys can be specified here.

Currently, models supported by the AutoModelForCausalLM class of Huggingface's transfomer library, and models reachable via Anthropic and OpenAI APIs can be used. When applying other models, use the data files directly for generation, or adapt the lm.py file (and generate_output.py.

The GenderPersona dataset is a translation and extension of the HONEST dataset of Nozza et al. The GerBBQ+ dataset is mainly a translation of the BBQ dataset of Parrish et al.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
code		code
data		data
make_data		make_data
sample_output		sample_output
.gitattributes		.gitattributes
Gender Bias in German LLMs (Full Thesis).pdf		Gender Bias in German LLMs (Full Thesis).pdf
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Languages

rwth-i6/Gender-Bias-in-German-LLMs

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages