Following next fig, the green data show there are same coords and cates but different id, why? How to predict this result using network? 