Hi, I encounter the same problem as in #10.
And I found the reason why 2 examples are filtered is that the obj_label are 1970s and 1990s. And in common_vocab_cased.txt generated by vocab_intersection.py, there are no 1970s and 1990s.
236: {"masked_sentences": ["Income inequality began to increase in the US in the [MASK]."], "obj_label": "1970s", "id": "57287b322ca10214002da3bf_0", "sub_label": "Squad"}
206: {"masked_sentences": ["The perception of Genghis Khan in Mongolia brightened in the [MASK]."], "obj_label": "1990s", "id": "5727404b708984140094db59_0", "sub_label": "Squad"}