Skip to content

Indexed COL contains redundant ES records #318

@mdoering

Description

@mdoering

There are many more ES records for COL XR in GBIF than there should be.
The 2025.10 XR dwca contains 9.4 million Taxon.txt records, but the ES index says 14m.

This happened before. I had then removed the dataset from ES and indexed again. Sth must be wrong in the code or setup. Maybe this effects other datasets too?

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions