Skip to content

[Project Proposal]: AI for Darwin Core alignment #67

@7yl4r

Description

@7yl4r

Project Description

This project will explore the use of emerging AI tools to aid in the transformation of "raw" biological occurrence data produced by researchers into standardized formats (Darwin Core (DwC)) for ingestion into systems like OBIS and GBIF.

Examples from the Standardizing Marine Biological Data (SMBD) working group and associated annual Biodata Mobilization Workshops are available for use to guide AI prompt tailoring.

A successful project will likely include multiple AI prompts working through processes like:

  • AI prompt generation and AI service API usage
  • exposing a variety of "raw" data to an AI agent
  • using AI to map between "raw" and standardized vocabularies like DwC and NERC
  • generating requests for additional information from data providers
  • adding information received from data providers into standardized documentation
  • generating standard documents for example datasets

Automation of the process through the GitHub issue tracker may be possible.
Use of automated emailing systems may be helpful as well.

Expected Outcomes

A software project published to GitHub demonstrating the concept.

Skills required

Ideally python. API usage familiarity. Experience with automated emailing or GitHub issues API desired but not necessary.

Expertise

Intermediate

Topic Lead(s)

@7yl4r

Relevant links

No response

Metadata

Metadata

Labels

code sprint topicProposed topic for a code sprint activity

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions