Here are recent literature on masked autoencoder (MAE). - [Masked Autoencoders As Spatiotemporal Learners- 2022](https://arxiv.org/abs/2205.09113): studies a conceptually simple extension of Masked Autoencoders (MAE) to spatiotemporal representation learning from videos.