Skip to content

Releases: llm-jp/llm-jp-eval-mm

v0.4.1

18 May 04:27
e91ba55

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.4.0...v0.4.1

v0.4.0

27 Mar 06:27
d9d18a8

Choose a tag to compare

What's Changed

Full Changelog: v0.3.0...v0.4.0

v0.3.0

23 Mar 16:23
14d32c8

Choose a tag to compare

What's Changed

  • Generalize aggregate() output type and Remove unnecessary methods by @speed1313 in #131
  • Improve JDocQA's preparation time and Fix JMMMU scoring and Add phi4 and Refactoring by @speed1313 in #141
  • Add visualization script by @speed1313 in #143
  • Fix Heron-bench scoring and Add Asagi model by @speed1313 in #146

Full Changelog: v0.2.2...v0.3.0

v0.2.2

19 Mar 06:34
b1f25c0

Choose a tag to compare

v0.2.2 Pre-release
Pre-release

What's Changed

  • [WIP] Add gemma3 and Qwen2.5 VL and sarashina and Refactoring by @speed1313 in #123

Full Changelog: v0.2.1...v0.2.2

v0.2.1

17 Mar 10:23
006eb58

Choose a tag to compare

v0.2.1 Pre-release
Pre-release

What's Changed

New Contributors

Full Changelog: v0.2.0...v0.2.1

v0.2.0

27 Jan 02:27
95c9279

Choose a tag to compare

v0.2.0 Pre-release
Pre-release

What's Changed

Full Changelog: v0.1.2...v0.2.0

v0.1.2

05 Dec 06:43

Choose a tag to compare

v0.1.2 Pre-release
Pre-release

What's Changed

Full Changelog: v0.1.1...v0.1.2

v0.1.1

28 Nov 03:39

Choose a tag to compare

v0.1.1 Pre-release
Pre-release

Full Changelog: v0.1.0...v0.1.1

Fix dependencies to publish the package.

v0.1.0

28 Nov 03:34
f593fec

Choose a tag to compare

v0.1.0 Pre-release
Pre-release

What's Changed

Full Changelog: v0.0.7...v0.1.0

v0.0.7

08 Nov 12:57
ad1a24d

Choose a tag to compare

v0.0.7 Pre-release
Pre-release

Full Changelog: v0.0.6...v0.0.7