-
Notifications
You must be signed in to change notification settings - Fork 14
Description
Project Description
This project will explore the use of emerging AI tools to aid in the transformation of "raw" biological occurrence data produced by researchers into standardized formats (Darwin Core (DwC)) for ingestion into systems like OBIS and GBIF.
Examples from the Standardizing Marine Biological Data (SMBD) working group and associated annual Biodata Mobilization Workshops are available for use to guide AI prompt tailoring.
A successful project will likely include multiple AI prompts working through processes like:
- AI prompt generation and AI service API usage
- exposing a variety of "raw" data to an AI agent
- using AI to map between "raw" and standardized vocabularies like DwC and NERC
- generating requests for additional information from data providers
- adding information received from data providers into standardized documentation
- generating standard documents for example datasets
Automation of the process through the GitHub issue tracker may be possible.
Use of automated emailing systems may be helpful as well.
Expected Outcomes
A software project published to GitHub demonstrating the concept.
Skills required
Ideally python. API usage familiarity. Experience with automated emailing or GitHub issues API desired but not necessary.
Expertise
Intermediate
Topic Lead(s)
Relevant links
No response