Building the Söd★mizer: AI-Driven Machine Translation for Gender-Sensitive to Gender-Inclusive Language Transformation

Introduction

Söd★mizer is an experimental machine translation system designed to convert gender-sensitive German texts into either generic masculine/feminine forms or explicitly gender-inclusive formulations.

What is Gendering?

The discussion surrounding gender-sensitive language in German linguistics, politics, and society is highly controversial. At the center of the debate is the generic masculine—the use of male forms to refer to mixed or unspecified groups (Genderleicht, 2025). While proponents of this language convention argue that it is gender-neutral, empirical studies show that this is not the case. For example, Braun, Sczesny, and Stahlberg (Braun et al., 2005) found that the generic masculine is primarily associated with men, which can reinforce cognitive biases and potentially gender-specific prejudices. In response, various strategies for gender-fair language have been developed, which are comprehensively discussed in linguistic guidelines (Diewald and Steinhauer, 2020).

There are several forms for gender-fair language; however, for the sake of simplicity, they are collectively referred to in this project as:

Neutral form: Neutralization of the substantive, e.g., Student:innen → Studierende (students)

Generic masculine/feminine: Adaptation to the respective context, e.g., Student:innen → Studenten or Studentinnen (male or female students)

Gender-inclusive form: Explicit mention of all genders, e.g., Student:innen → Studenten, Studentinnen und Personen an derer geschlechtlicher Identität, die an einer Hochschule studieren (male and female students and people of other gender identities studying at a university)

For further information, please refer to 1. Introduction in paper.pdf.

Project Naming

The project is named "Söd★mizer," a play on the name of Bavarian Prime Minister Markus Söder, who has publicly opposed gender-sensitive language. The aim of this project is to develop an automated translator that removes gender forms from texts, thereby creating a "Söder-compliant" language. The choice of name highlights the satirical approach of the project: through deliberate exaggeration and particularly convoluted formulations, the debate is humorously exaggerated.

Research Goals

Goal 1: Implementing a simple proof-of-concept for the removal of gender forms using Google’s Flan-T5-small. This serves as a basis for evaluating the suitability of small language models for translation tasks in German. It involves the reduction of gender-sensitive special characters to generic masculine or feminine. For example: Die Student:innen sind sehr fleißig. → Die Studenten sind sehr fleißig.
Goal 2: Goal 2: Transforming gender-sensitive sentences into a gender-inclusive form. This is carried out in two steps: (A) Expanding an existing gold standard from CorrelAid using the OpenAI API to generate sentences with inclusive versions. (B) Fine-tuning Flan-T5-small on this new dataset to test whether smaller models can also perform such translations

Usage

1. Set Up the Environment

Create a virtual environment:
```
python -m venv .env
```
Activate the virtual environment:
- On Windows:
```
.\.env\Scripts\activate
```
- On Linux/MacOS:
```
source .env/bin/activate
```
Install the dependencies:
```
pip install -r requirements.txt
```

2. Create Dataset (Failed Attempt)

Run the dataset script:
```
cd archive
python dataset.py
```

NOTE: A Java 8 or higher installation is required for grammar correction using language-tool-python.

Why this method failed

The automated dataset creation approach failed due to several reasons. Look into subsection 3.3.1 in paper.pdf for details.

"However, this method proved insufficient, as many sentences contained syntactical errors, and the dataset overall was not robust enough. The naive gender translation caused several issues: grammar errors occurred when articles and cases were not properly adjusted by the grammar tool (e.g., "der Regie" instead of "die Regie"). Additionally, masculine pronouns were often retained, making gender transformation inconsistent. In some cases, sentences became incomprehensible, for example, when names or references were altered without considering the context (e.g., "seither ist es Thomas Lee")."

3. Dataset Creation for Goal 1 + 2

Run the preprocessing script:
```
python preprocessing.py
```

4. Train Model

Install PyTorch:
```
pip3 install torch --index-url https://download.pytorch.org/whl/cu124
```
NOTE: Ensure you select the correct CUDA version. Refer to the PyTorch documentation for more details.
Start Training:
```
python training.py
```

5. Model Evaluation

Generate Predictions:
```
python evaluation.py
```
Analyze Results:
```
python analysis_results.py
```

Results

Manual evaluations were performed for determining the translation success rate. Please look into manual_evaluation.xlsx contained in the respective experiments/<SELECT GOAL>/results/output/-folders for the manually evaluated results of goal 1 and 2. For the manual evaluation of the synthetic data, refer to data/inclusive_form/v1/manual_evaluation.xlsx.

"The experiments showed that the Söd★mizer system achieved a weighted translation success rate of 92.7% when tasked with converting gender-sensitive text into the generic masculine or feminine forms. However, certain recurring failure cases—such as encoding artifacts, word omissions, and unintended neutralizations—were identified and remain areas for improvement. When generating explicitly gender-inclusive formulations, the model’s performance was significantly lower, with only 32% of outputs being semantically correct. While the model effectively applied syntactic transformations, it struggled with contextual coherence and precision, suggesting a need for future refinements such as enhanced prompt engineering, constrained decoding techniques, and domain-specific fine-tuning."

For a more comprehensive analysis, please refer to the detailed report at paper.pdf.

Contact

For any additional questions or comments, please contact me at dennis.mustafic@fau.de

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
archive		archive
data		data
experiments		experiments
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
analysis_results.py		analysis_results.py
evaluation.py		evaluation.py
folder_structure.py		folder_structure.py
paper.pdf		paper.pdf
preprocessing.py		preprocessing.py
requirements.txt		requirements.txt
sentence_structure.py		sentence_structure.py
sentences.csv		sentences.csv
synthetic_data.py		synthetic_data.py
training.py		training.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Building the Söd★mizer: AI-Driven Machine Translation for Gender-Sensitive to Gender-Inclusive Language Transformation

Table of contents

Introduction

What is Gendering?

Project Naming

Research Goals

Usage

1. Set Up the Environment

2. Create Dataset (Failed Attempt)

Why this method failed

3. Dataset Creation for Goal 1 + 2

4. Train Model

5. Model Evaluation

Results

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Building the Söd★mizer: AI-Driven Machine Translation for Gender-Sensitive to Gender-Inclusive Language Transformation

Table of contents

Introduction

What is Gendering?

Project Naming

Research Goals

Usage

1. Set Up the Environment

2. Create Dataset (Failed Attempt)

Why this method failed

3. Dataset Creation for Goal 1 + 2

4. Train Model

5. Model Evaluation

Results

Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages