MaskViT

1 Post

Ground truth video of a road on the left and predicted video with MaskViT on the right
MaskViT

Seeing What Comes Next: Transformers predict future video frames.

If a robot can predict what it’s likely to see next, it may have a better basis for choosing an appropriate action — but it has to predict quickly. Transformers, for all their utility in computer vision, aren’t well suited to this because of their steep computational and memory requirements...

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox