Replies: 2 comments 6 replies
-
|
Hey! For matching the full item ID (like 2024.findings-emnlp.355) vs author IDs, you might need to combine the existing RE_ITEM_ID with the collection and volume prefixes. Since the anthology IDs have transitioned from the old letter-based format (like P19-1140) to the newer year-based one, a more comprehensive regex would be something like ^([A-Z]\d{2}-\d{4}|(\d{4}).[a-z0-9.-]+.\d+)$. This targets the structure of both legacy and modern IDs. Author IDs typically don't have the dot-delimited structure or the specific legacy year codes. Hope this helps refine your search! |
Beta Was this translation helpful? Give feedback.
-
|
@mbollmann can a regex or validation function go in utils.ids? |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
https://acl-anthology.readthedocs.io/py-v1.0.0/api/utils.ids/ lists RE_ITEM_ID, but that is just for locating the item within the collection, not the full ID. I am looking for a regex that would match
"2024.findings-emnlp.355","P19-1140", and maybe also volumes like"2024.findings-emnlp","P19-1".Beta Was this translation helpful? Give feedback.
All reactions