You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
#### Distribution of species-level median improvements
1042
1016
At the species level, median improvement scores were centred close to zero, indicating that for most species the omission of individual datasets had limited influence on prevalence estimates. A smaller number of species exhibited consistently positive or negative median improvements, suggesting higher sensitivity to data composition. Overall, cross-validation tends to move prevalence estimates closer to the true value.
There is apparently one very common species that is also highly influenced by one dataset.
1091
1065
1092
-
#### Identifying *outlying* species (diagnostic, not reported)
1066
+
#### Identifying outlying species (diagnostic, not reported)
1093
1067
One species showed unusually large relative deteriorations when datasets were omitted. This species can be treated as a diagnostic case which serves to identify potential data or modelling issues.
Without the *waarnemingen.be* dataset, the estimate is much lower.
1114
1088
1115
1089
### Influence of individual component datasets
1116
-
1117
-
The influence of component datasets varied substantially. A small number of datasets consistently yielded positive improvements when omitted, indicating a disproportionate influence on prevalence estimates. Other datasets had neutral or stabilising effects across most species.
1090
+
The *waarnemingen.be* dataset(s) show the largest influences (in both directions). For rare species, we see improvements, for common, we see deterioration.
where $\widetilde{RI}_s$ is the median relative improvement across all leave-one-dataset-out cross-validation runs for species $s$.
1154
+
1155
+
This metric is bounded between 0 (low robustness) and 1 (high robustness).
1156
+
1157
+
***Robustness ≈ 1**
1158
+
Prevalence estimates are largely insensitive to the omission of individual datasets.
1159
+
1160
+
***Robustness ≈ 0.5**
1161
+
Dataset removal typically changes error magnitude by ~50%.
1162
+
1163
+
***Robustness ≈ 0**
1164
+
Prevalence estimates are highly dependent on individual datasets.
1165
+
1166
+
Using the median ensures robustness against outlying datasets and prevents single influential components from dominating the score.
1167
+
1168
+
### Robustness by species
1169
+
Species-level robustness scores were generally high, with most species exhibiting values close to one, indicating limited sensitivity to the omission of individual datasets. A smaller subset of species showed lower robustness scores, reflecting stronger dependence on specific data components.
Using leave-one-dataset-out cross-validation, we assessed the sensitivity of prevalence estimates derived from the bird data cube to the composition of the underlying datasets, using ABV prevalence as a reference benchmark. Overall, omission of individual component datasets more often reduced than increased the deviation from the reference prevalence, although the magnitude of these improvements was typically small. This indicates that the prevalence indicator is generally robust to changes in dataset composition.
1166
1211
1167
-
> Mixed-effects modelling confirmed systematic differences in sensitivity across rarity classes, while accounting for species- and dataset-specific variability. Random effects indicated that both species identity and dataset identity contribute to variability in improvement scores, highlighting heterogeneous data influence within the monitoring network.
1212
+
Sensitivity patterns differed systematically across rarity classes. Rare species showed larger relative improvements and greater variability in error reduction than common species, reflecting their stronger dependence on individual datasets. In contrast, prevalence estimates for common species were more stable, but occasionally deteriorated substantially when influential datasets were removed. These differences largely arise from the mathematical properties of the indicator and the contrasting prevalence distributions between structured and opportunistic data sources, rather than from data quality issues alone. In particular, high-prevalence species have limited scope for improvement through dataset removal, whereas rare species can show appreciable gains.
1168
1213
1214
+
Aggregating results at the species level confirmed that most species exhibit limited sensitivity to dataset removal, while a small number act as diagnostic cases with pronounced dependence on specific datasets. Analysis at the dataset level showed that large datasets exert the strongest influence on prevalence estimates. The *waarnemingen.be* datasets, in particular, had a substantial impact, improving estimates for rare species while worsening them for common species. This dual effect highlights the central role of large opportunistic datasets in shaping prevalence indicators.
1169
1215
1216
+
To summarise sensitivity in a compact and interpretable way, we introduced a species-level robustness metric based on the median relative improvement in error across cross-validation runs. Most species exhibited high robustness scores, indicating stable prevalence estimates, while lower scores identified species for which estimates are more dependent on data composition. This metric provides a practical tool for summarising robustness in species-rich indicator systems and for identifying cases that may warrant closer scrutiny.
0 commit comments