Is there any plans to support masked token prediction (e.g. for training Tab Transformers)? I would love to contribute if someone is willing to guide.