Skip to content

Releases: corentin-ryr/MultiMedEval

MultiMedEval 1.0.0

23 Jul 14:44

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.1.1...v1.0.0

MultiMedEval 0.1.1

18 Mar 16:31
0f7cffe

Choose a tag to compare

New features:

  • Dynamic datasets: use any dataset formatted correctly and apply the metrics of a task "family" (QA, VQA, Report Comparison, Image Classification, and NLI). This feature adds more flexibility to MultiMedEval.
  • Added Diff-VQA [Paper] to the list of supported tasks.
  • Updated RadCliQ to reflect more closely the results in the [Paper]

In addition to the new features, we added a suite of unit tests and corrected some bugs.

MultiMedEval 0.1

15 Feb 15:41

Choose a tag to compare

v0.1.0

Removed BnB from benchmarking code