|
| 1 | +--- |
| 2 | +title: "Dataverse, CEDAR and RO-Crate: The building blocks of ARP, the federated research data repository of Hungary" |
| 3 | +author: ["Balázs E. Pataki"] |
| 4 | +email: "pataki@sztaki.hu" |
| 5 | +date: 2025-10-13 |
| 6 | +related_pages: |
| 7 | + use_cases: [aroma, dataverse, ] # should match items in pages/use_cases |
| 8 | +--- |
| 9 | + |
| 10 | + |
| 11 | +The [ARP Adatrepozitórium Platform](http://researchdata.hu/en) (Data Repository Platform) is a national initiative of the |
| 12 | +[Hungarian Research Network (HUN-REN)](https://hun-ren.hu/home), led by [SZTAKI (HUN-REN Institute for Computer |
| 13 | +Science and Control)](https://sztaki.hun-ren.hu/en), to build a multidisciplinary repository system capable of hosting the |
| 14 | +scientific data output of the Hungarian research community. ARP is a federated service that |
| 15 | +collects data from its own Dataverse repository as well as other institutional data repositories in |
| 16 | +Hungary to establish a European Open Science Cloud (EOSC)-compliant research |
| 17 | +infrastructure. |
| 18 | + |
| 19 | +<img src="arp_services_en.png" |
| 20 | +alt="ARP is built integrating Dataverse with added RO-Crate authoring capabilities via AROMA, CEDAR as the schema registry, and a custom search engine based on a a knowledge graph collecting RO-Crate data from ARP and data of other datasets from various Hungarian data repositories" |
| 21 | +title="ARP core services" |
| 22 | +style="width:30%; max-width:800px; display:block; margin:auto;"> |
| 23 | +<figcaption style="text-align:center; margin-bottom: 10px;">ARP core services</figcaption> |
| 24 | + |
| 25 | +ARP is built around [Harvard's Dataverse](https://dataverse.org/). However, we found that to make ARP truly multidisciplinary, |
| 26 | +research communities need the ability to easily author and publish metadata schemas relevant |
| 27 | +to their scientific fields. While Dataverse provides metadata schemas via its metadata blocks |
| 28 | +and TSV input format, creating, updating, and curating these are challenging for researchers |
| 29 | +who are not metadata experts. To address this, we integrated [Stanford's CEDAR](https://metadatacenter.org/) as the schema authoring |
| 30 | +registry tool for ARP. With this integration, researchers can now easily create complex schemas |
| 31 | +using CEDAR’s drag-and-drop template editor and export and synchronize these with |
| 32 | +Dataverse, effectively enabling direct metadatablock authoring within Dataverse. |
| 33 | + |
| 34 | +Another limitation of Dataverse is that it provides only minimal metadata for the files within a |
| 35 | +dataset. To overcome this and enhance dataset accessibility, we introduced the Research |
| 36 | +Object Crate (RO-Crate) exchange format as a first-class object of ARP. We not only added |
| 37 | +export and import functionality for RO-Crate but also developed a complete authoring tool, |
| 38 | +[AROMA]({{baseurl}}{% link pages/use_cases/aroma.md %}) |
| 39 | +, based on the RO-Crate editor component of the [Describo](https://github.com/describo/crate-builder-component) project. |
| 40 | +The use of RO-Crate and the integration of CEDAR schemas make it possible to associate |
| 41 | +metadata not only with the root dataset but also with any files and subdirectories. Users can |
| 42 | +define their own specific schemas in CEDAR, associate them with selected files and directories |
| 43 | +of their dataset in AROMA, and use these schemas to describe their data in greater depth than |
| 44 | +was previously possible in Dataverse. |
| 45 | + |
| 46 | +In this video we demonstrate how Dataverse, CEDAR, and RO-Crate enhance the metadata workflow of the |
| 47 | +Hungarian research community and how our achievements could be applied to the broader |
| 48 | +Dataverse community. |
| 49 | + |
| 50 | +<div style="position: relative; width: 100%; height: 0; padding-bottom: 56.25%;"> |
| 51 | + <iframe |
| 52 | + src="https://www.youtube.com/embed/o_ENdITtIQg?si=1_c4BTG-esD04Duc" |
| 53 | + title="YouTube video player" |
| 54 | + style="position: absolute; top: 0; left: 0; width: 100%; height: 100%;" |
| 55 | + frameborder="0" |
| 56 | + allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" |
| 57 | + referrerpolicy="strict-origin-when-cross-origin" |
| 58 | + allowfullscreen> |
| 59 | + </iframe> |
| 60 | +</div> |
| 61 | + |
| 62 | + |
0 commit comments