There are many more ES records for COL XR in GBIF than there should be.
The 2025.10 XR dwca contains 9.4 million Taxon.txt records, but the ES index says 14m.
This happened before. I had then removed the dataset from ES and indexed again. Sth must be wrong in the code or setup. Maybe this effects other datasets too?