Setup

This repository accompanies the paper AustroTox: A Dataset for Target-Based Austrian German Offensive Language Detection.

Setup

Clone the repository

git clone https://github.com/pi-pa/austrotox.git
$ cd austrotox

Create the virtual environment and install requirements...

... using Python 3.9.2

$ python3 -m venv env
$ source env/bin/activate

For the encoder experiments

$ pip3 install -r requirements_encoders.txt

For the LLM experiments

$ pip3 install -r requirements_llms.txt

...using Conda

$ conda create -n env python=3.9 pip
$ conda activate env

For the encoder experiments

$ pip install -r requirements_encoders.txt

For the LLM experiments

$ pip install -r requirements_llms.txt

Inspect the cross validation splits

Validate the splits by running:

python3 src/check_split_overlap.py --num_versions 10 --path_data_dir data/german/train_dev_test
python3 src/compute_split_stats.py --num_splits 10 --path_data_dir data/german/train_dev_test

Classifications using LLMs

GPT 3.5

Predict 0-shot:

python3 src/get_chatgpt_predictions.py --input_path data/german/train_dev_test/ --output_path data/german/predictions/gpt35/0shot --multitask --num_shots 0 --random_seed 1 --model_name gpt-3.5-turbo-1106
python3 src/compute_metrics.py --path_predictions data/german/predictions/gpt35/0shot/ --path_true_labels data/german/train_dev_test/ --path_metrics data/german/metrics/gpt35/0shot/ --multitask
python3 src/compute_metrics.py --path_predictions data/german/predictions/gpt35/0shot/ --path_true_labels data/german/train_dev_test/ --path_metrics data/german/metrics/gpt35/0shot/ --multitask --consider_only_spans --span_requirement "both"

Predict 5-shot:

python3 src/get_chatgpt_predictions.py --input_path data/german/train_dev_test/ --output_path data/german/predictions/gpt35/5shot --multitask --num_shots 5 --random_seed 1 --model_name gpt-3.5-turbo-1106
python3 src/compute_metrics.py --path_predictions data/german/predictions/gpt35/5shot/ --path_true_labels data/german/train_dev_test/ --path_metrics data/german/metrics/gpt35/5shot/ --multitask
python3 src/compute_metrics.py --path_predictions data/german/predictions/gpt35/5shot/ --path_true_labels data/german/train_dev_test/ --path_metrics data/german/metrics/gpt35/5shot/ --multitask --consider_only_spans --span_requirement "both"

GPT 4

Predict 0-shot:

python3 src/get_chatgpt_predictions.py --input_path data/german/train_dev_test/ --output_path data/german/predictions/gpt4/0shot --multitask --num_shots 0 --random_seed 1 --model_name gpt-4-1106-preview
python3 src/compute_metrics.py --path_predictions data/german/predictions/gpt4/0shot/ --path_true_labels data/german/train_dev_test/ --path_metrics data/german/metrics/gpt4/0shot/ --multitask
python3 src/compute_metrics.py --path_predictions data/german/predictions/gpt4/0shot/ --path_true_labels data/german/train_dev_test/ --path_metrics data/german/metrics/gpt4/0shot/ --multitask --consider_only_spans --span_requirement "both"

Predict 5-shot:

python3 src/get_chatgpt_predictions.py --input_path data/german/train_dev_test/ --output_path data/german/predictions/gpt4/5shot --multitask --num_shots 5 --random_seed 1 --model_name gpt-4-1106-preview
python3 src/compute_metrics.py --path_predictions data/german/predictions/gpt4/5shot/ --path_true_labels data/german/train_dev_test/ --path_metrics data/german/metrics/gpt4/5shot/ --multitask
python3 src/compute_metrics.py --path_predictions data/german/predictions/gpt4/5shot/ --path_true_labels data/german/train_dev_test/ --path_metrics data/german/metrics/gpt4/5shot/ --multitask --consider_only_spans --span_requirement "both"

Mistral

Use Generation

0-shot:

CUDA_VISIBLE_DEVICES=7 python3 src/predict.py --path_splits_dir data/german/train_dev_test/ --path_predictions_dir data/german/predictions/mistral-instruct-v02-generation/0shot/ --model_path_or_identifier mistralai/Mistral-7B-Instruct-v0.2 --num_new_tokens 5 --num_shots 0 
python3 src/compute_metrics.py --path_predictions data/german/predictions/mistral-instruct-v02-generation/0shot/ --path_true_labels data/german/train_dev_test/ --path_metrics data/german/metrics/mistral-instruct-v02-generation/0shot/

5-shot:

CUDA_VISIBLE_DEVICES=6 python3 src/predict.py --path_splits_dir data/german/train_dev_test/ --path_predictions_dir data/german/predictions/mistral-instruct-v02-generation/5shot/ --model_path_or_identifier mistralai/Mistral-7B-Instruct-v0.2 --num_new_tokens 5 --num_shots 5 
python3 src/compute_metrics.py --path_predictions data/german/predictions/mistral-instruct-v02-generation/5shot/ --path_true_labels data/german/train_dev_test/ --path_metrics data/german/metrics/mistral-instruct-v02-generation/5shot/

Use Logits

0-shot:

CUDA_VISIBLE_DEVICES=6,7 python3 src/predict.py --path_splits_dir data/german/train_dev_test/ --path_predictions_dir data/german/predictions/mistral-instruct-v02-logits/0shot/ --model_path_or_identifier mistralai/Mistral-7B-Instruct-v0.2 --use_logits --num_shots 0
python3 src/compute_metrics.py --path_predictions data/german/predictions/mistral-instruct-v02-logits/0shot/ --path_true_labels data/german/train_dev_test/ --path_metrics data/german/metrics/mistral-instruct-v02-logits/0shot/

5-shot:

CUDA_VISIBLE_DEVICES=4,5 python3 src/predict.py --path_splits_dir data/german/train_dev_test/ --path_predictions_dir data/german/predictions/mistral-instruct-v02-logits/5shot/ --model_path_or_identifier mistralai/Mistral-7B-Instruct-v0.2 --use_logits --num_shots 5
python3 src/compute_metrics.py --path_predictions data/german/predictions/mistral-instruct-v02-logits/5shot/ --path_true_labels data/german/train_dev_test/ --path_metrics data/german/metrics/mistral-instruct-v02-logits/5shot/

LEO LM

Use Logits

Predict 0-shot:

CUDA_VISIBLE_DEVICES=6,7 python3 src/predict.py --path_splits_dir data/german/train_dev_test/ --path_predictions_dir data/german/predictions/leo-hessianai-7b-chat-logits/0shot/ --model_path_or_identifier LeoLM/leo-hessianai-7b-chat --use_logits --num_shots 0
python3 src/compute_metrics.py --path_predictions data/german/predictions/leo-hessianai-7b-chat-logits/0shot/ --path_true_labels data/german/train_dev_test/ --path_metrics data/german/metrics/leo-hessianai-7b-chat-logits/0shot/

Predict 5-shot:

CUDA_VISIBLE_DEVICES=4,5 python3 src/predict.py --path_splits_dir data/german/train_dev_test/ --path_predictions_dir data/german/predictions/leo-hessianai-7b-chat-logits/5shot/ --model_path_or_identifier LeoLM/leo-hessianai-7b-chat --use_logits --num_shots 5
python3 src/compute_metrics.py --path_predictions data/german/predictions/leo-hessianai-7b-chat-logits/5shot/ --path_true_labels data/german/train_dev_test/ --path_metrics data/german/metrics/leo-hessianai-7b-chat-logits/5shot/

Llama 3

Use Logits

Predict 0-shot:

CUDA_VISIBLE_DEVICES=5,6 python3 src/predict.py --path_splits_dir data/german/train_dev_test/ --path_predictions_dir data/predictions/german/Llama3_8B/0shot/ --model_path_or_identifier meta-llama/Meta-Llama-3-8B-Instruct --use_logits --num_shots 0 --language de
python3 src/compute_metrics.py --path_predictions data/predictions/german/Llama3_8B/0shot/ --path_true_labels data/german/train_dev_test/ --path_metrics data/metrics/german/Llama3_8B/0shot/

Predict 5-shot:

CUDA_VISIBLE_DEVICES=5,6 python3 src/predict.py --path_splits_dir data/german/train_dev_test/ --path_predictions_dir data/predictions/german/Llama3_8B/5shot/ --model_path_or_identifier meta-llama/Meta-Llama-3-8B-Instruct --use_logits --num_shots 5 --language de
python3 src/compute_metrics.py --path_predictions data/predictions/german/Llama3_8B/5shot/ --path_true_labels data/german/train_dev_test/ --path_metrics data/metrics/german/meta-llama/Llama3_8B/5shot/

Predict 0-shot Multitask:

CUDA_VISIBLE_DEVICES=0,1 python3 src/predict.py --path_splits_dir data/german/train_dev_test/ --path_predictions_dir data/predictions/german/Llama3_8B/0shot-MT/ --model_path_or_identifier meta-llama/Meta-Llama-3-8B-Instruct --num_shots 0 --language de --multitask
python3 src/compute_metrics.py --path_predictions data/predictions/german/Llama3_8B/0shot-MT/ --path_true_labels data/german/train_dev_test/ --path_metrics data/metrics/german/Llama3_8B/0shot-MT/

Predict 5-shot Multitask:

CUDA_VISIBLE_DEVICES=5,6 python3 src/predict.py --path_splits_dir data/german/train_dev_test/ --path_predictions_dir data/predictions/german/Llama3_8B/5shot-MT/ --model_path_or_identifier meta-llama/Meta-Llama-3-8B-Instruct --use_logits --num_shots 5 --language de --multitask
python3 src/compute_metrics.py --path_predictions data/predictions/german/Llama3_8B/5shot-MT/ --path_true_labels data/german/train_dev_test/ --path_metrics data/metrics/german/meta-llama/Llama3_8B/5shot-MT/

Fine-tuning LLMs

Optionally, the LLMs can be fine-tuned to the task. We didn't use this code for the experiments in the paper.

Fine-tune 10 models with a train/dev/test ratio of 8/1/1:

CUDA_VISIBLE_DEVICES=0 python3 src/train.py --num_cross_eval_splits 3 --path_splits_dir data/german/train_dev_test/ --path_model_dir data/german/models/ --hf_identifier LeoLM/leo-hessianai-7b-chat --num_epochs 3

Predict with the models on test splits:

python3 src/predict.py --path_splits_dir data/german/train_dev_test/ --path_model_dir data/german/models/ --path_predictions_dir data/german/predictions --hf_identifier LeoLM/leo-hessianai-7b-chat --num_new_tokens 50 --gpu 1

Evaluate predictions and compute average scores:

python3 src/compute_metrics.py --path_predictions data/german/predictions/ --path_true_labels data/german/train_dev_test/ --path_metrics data/german/metrics/

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data		data
results/german		results/german
src		src
.gitignore		.gitignore
README.md		README.md
requirements_encoders.txt		requirements_encoders.txt
requirements_llms.txt		requirements_llms.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Setup

Clone the repository

Create the virtual environment and install requirements...

... using Python 3.9.2

...using Conda

Inspect the cross validation splits

Classifications using LLMs

GPT 3.5

GPT 4

Mistral

Use Generation

Use Logits

LEO LM

Use Logits

Llama 3

Use Logits

Fine-tuning LLMs

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Setup

Clone the repository

Create the virtual environment and install requirements...

... using Python 3.9.2

...using Conda

Inspect the cross validation splits

Classifications using LLMs

GPT 3.5

GPT 4

Mistral

Use Generation

Use Logits

LEO LM

Use Logits

Llama 3

Use Logits

Fine-tuning LLMs

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages