Skip to content

labri-progress/cidiff-analysis

Repository files navigation

CiDiff article data analysis

Organisation

This repository is split in 3 projects. See each project README for more informations.

Java Project (annotator-java)

Generate the annotations csv for the following algorithms:

  • cidiff
  • lcs-diff
  • bigram
  • bigram-drain
  • cidiff-drainsim

Rust Project (annotator-rust)

Generate the annotations csv for the following algorithms:

  • human
  • keyword
  • gpt

This project has also a TUI to annotate the logs by a human.

Python project

The analysis code using the generated CSVs.

  • csv/dataset.csv: infomations about all the pairs of logs of our dataset
  • csv/annotations.csv: lines flagged as relevant to understand the failures on the 100 randomly drawn cases
  • csv/benchmark.csv: metrics about the results of LCS-diff and CiDiff and the whole dataset
  • csv/survey.csv: user preferences between LCS-diff and CiDif on the 100 randomly drawn cases
  • analysis.ipynb: notebook containing all the analysis code

Viewers (viewers-generator)

The code to generate the viewers for the user evaluations.

It also contains the guide used for the evaluation.

About

Analysis code for the CiDiff algorithm

Resources

License

Stars

Watchers

Forks

Contributors 2

  •  
  •