Skip to content

LazerLambda/MetricsComparison

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Comparison of evaluation measures for generated text based on pre-trained language models

Experimental framework for the paper Pre-trained language models evaluating themselves - A comparative study.

Mark-Evaluate

  • The implemented version of Mark-Evaluate is available at src/metric/ME/

Prerequisites

Re-Run the experiment

The experiment was run using the "run_all_screens.sh" script. This resutls will be written in hidden folders bearing the specific name of the experiment run.

Experiment-Project

  • src/Experiment.py:

    • Handling of artificial impairment and evaluation
  • Tasks/:

    • Task.py: Base class for all other tasks
    • OneDim.py: Base class for all other tasks, where only one dimension of damages is applied
    • *.py: Specific tasks
  • metrics/:

    • Metric.py: Base class for all metrics, including specific information regarding the behaviour of the metric
    • Custom metric folders must be stored here
    • custom_metrics/
      • *.py: Inheriting form Metric.py.
  • Plot.py

    • Plots the results by task
  • PlotByMetric:

    • inherits from Plot.py, plots the results by metric
  • *entry.py:

    • modified main.py entry points for the GNU screen version in run_all_screens.sh
  • main.py

    • Entry point for the experiment

Folder Structure

  • The folder metrics entails all data to compute the metrics
    • To evaluate each metric, a wrapper must be written based on the class Metric.py
    • All custom metric wrapper files must be stored in the custom metrics folder
    • All metrics related files (e.g. repositories, models etc.) should be stored in the metrics folder so that:
    .
    ├── ...
    ├── metrics
    │   ├── RespectiveMetric1Folder     # Folder including prerequisites for metric_1_wrapper.py
    │   ├── RespectiveMetric2Folder     # Folder including prerequisites for metric_2_wrapper.py
    │   ├── Metric.py                   # Parent class for metric wrappers
    │   └── custom_etrics               # Put metric wrapper python files here
    │       ├── metric_1_wrapper.py
    │       ├── metric_2_wrapper.py
    │       └── ...
    └── ...

Visualizing the results

  • To visualize the results, all folders with examples must be collected and moved to another folder, called 'outputs' in the root folder of the project. Another folder 'figures' must be created in the root directory of the project the Jupyter-Notebook plot.ipynb can then create the plots and the tables. The obtained figure will be stored in the figures folder.

About

Comparison of different pre-trained metrics to assess the quality of generated text.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published