Skip to content

gbif/obis-dataset-candidates

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

OBIS Dataset Candidates

A project to identify GBIF datasets that are marine-related but not yet part of OBIS (Ocean Biodiversity Information System).

Overview

This project scans GBIF for datasets that should potentially be included in the OBIS network. It uses marine keywords to find candidates, calculates their WORMS (World Register of Marine Species) taxonomic coverage, and filters datasets with ≥70% WORMS coverage.

Output Files

The filtered candidate datasets are saved to:

📁 exports/obis_candidates_filtered.tsv - Final list of GBIF datasets recommended for OBIS inclusion

Additional intermediate files:

Selection Criteria

Datasets are included if they:

  • Match marine-related keywords
  • Are NOT already in the OBIS network
  • Are NOT from PANGAEA publisher (d5778510-eb28-11da-8629-b8a03c50a862)
  • Have ≥70% of species present in the WORMS taxonomy

Automated Updates

The workflow runs automatically on the 1st of each month via GitHub Actions, updating the candidate list with newly published datasets.

OBIS Network

About

Creates issues and keep track of potential datasets that should part of OBIS network.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors