Standardized modelgauge column names #50

bkorycki · 2025-07-08T22:21:35Z

No description provided.

github-actions · 2025-07-08T22:21:47Z

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

.github/workflows/tests.yml

pyproject.toml

src/modelplane/utils/input.py

tests/it/runways/random_annotator.py

superdosh · 2025-07-09T14:10:45Z

src/modelplane/runways/annotator.py

-            except Exception as e:
-                print(f"Failed to log stats for {annotator_uid}: {e}")
+    with AnnotationDataset(data_path, "r") as dataset:
+        for item in dataset:


I'm not totally sure how this works when there are multiple annotators? If I'm reading the __iter__ in modelgauge.dataset correctly, each row produces one item, but I think if there are multiple annotators, each row will contain multiple annotator_uids? Or did that change too in the modelgauge PR?

That changed in the modelgauge PR! Every row is one response and one annotation.

That's probably why the randomness changed! Since before it was looping row, and then a for loop around the annotators per row, but the order of the annotators may not match the order modelgauge is now producing.

I'm good on this then!

bkorycki added 5 commits July 7, 2025 14:51

Update modelbench

796c9a8

update tests for sut responses

fd6419a

Update annotation + tests

19a4b93

update scorer

61f83e8

sneak in dvc branch fix

d37baa0

bkorycki requested a review from superdosh July 8, 2025 22:21

bkorycki requested a review from a team as a code owner July 8, 2025 22:21

bkorycki added 6 commits July 8, 2025 15:39

update groundtruth header

211f0ff

move unique sample id check

1913534

Merge branch 'main' into standardized-column-names-data-refactor

65bd132

Try clearing poetry cache in ci

f3f8912

add |

163f6af

try removing virtualenv

050d572

superdosh reviewed Jul 9, 2025

View reviewed changes

bkorycki added 2 commits July 9, 2025 09:12

update plugins

c2a82d1

Rename random annotator

5f5ff70

bkorycki requested a review from superdosh July 9, 2025 16:25

superdosh approved these changes Jul 9, 2025

View reviewed changes

bkorycki merged commit 6a5641a into main Jul 9, 2025
3 checks passed

github-actions bot locked and limited conversation to collaborators Jul 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Standardized modelgauge column names #50

Standardized modelgauge column names #50

Uh oh!

bkorycki commented Jul 8, 2025

Uh oh!

github-actions bot commented Jul 8, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

superdosh Jul 9, 2025

Uh oh!

bkorycki Jul 9, 2025

Uh oh!

superdosh Jul 9, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Standardized modelgauge column names #50

Standardized modelgauge column names #50

Uh oh!

Conversation

bkorycki commented Jul 8, 2025

Uh oh!

github-actions bot commented Jul 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

superdosh Jul 9, 2025

Choose a reason for hiding this comment

Uh oh!

bkorycki Jul 9, 2025

Choose a reason for hiding this comment

Uh oh!

superdosh Jul 9, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

github-actions bot commented Jul 8, 2025 •

edited

Loading