-
Notifications
You must be signed in to change notification settings - Fork 6
Description
We know that BB only considers each individual photo, by itself, when determining likely species. This opens the possibility for some really wacky decisions. For example:
- 8 photos of "Carolina Wren", identified correctly
- 1 photo of a bird butt, identified as "Fish Crow"
- The created-at timestamps of all 9 photos are identical, to the millisecond (because they all came in the same postcard)
The bird butt was obviously the same Carolina Wren as the other 8, and the Fish Crow identification is obviously wrong.
Looking at the postcard data (internal notes at issue-40.json), I think we can work towards being smarter about the one anomalous sighting by checking the media timestamps. If all the media are very close together, yet the media seem to be split into separate sighting instances, one of which has a low-confidence suggestion, while most of the other media have a high-confidence recognized species, we can reasonably infer that the low-confidence one is an anomaly, and assign the other sighting's species to the anomaly.
We could also work this behavior into the collect_postcard strategy configs.