DINOv2

1 Post

Multitask Vision Transformer

The original DINO showed that a vision transformer pretrained on unlabeled images could learn representations that were sufficient for classifying and segmenting images. In an update of that work, the model learned representations useful in a wider variety of tasks.

DINOv2

Multitask Vision Transformer

Subscribe to The Batch