Retrieval Metrics Indexes Parameter needs Continuous Indexes? #2757
Replies: 1 comment
-
Thank you for raising this important point about the |
Beta Was this translation helpful? Give feedback.
-
Thank you for raising this important point about the |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Sadly, there doesn't seem to be a retrieval metrics category, so I've added it here.
I seem to have had a crucial misunderstanding of the retrieval metrics indexes parameter and want confirmation that I got it right now and raise awareness of this issue.
So I've always understood the indexes parameter as a way for torchmetrics to differentiate the queries on BATCH-LEVEL. As in, the following code snippets should be equal:
When in reality the first one is equal to:
It believes the first query of the first batch belongs to the same query as the first query of the first batch instead of acknowledging that it's a different batch and therefore probably a different query.
So the code I've used in my collator:
That only iterates on the batch level seems to have been wrong the whole time.
Might be that this is something obvious for people that work more retrieval, but it wasn't obvious to me from the docs and I found absolutely zero discussions about this online. ChatGPT-4o also didn't get that one right, whatever that's worth (Chat).
Could somebody confirm this find? And tell me whether I was stupid and this should have been obvious or whether some clarification should be added to the docs?
Beta Was this translation helpful? Give feedback.
All reactions