ScanNet
Transformers See in 3D: Using transformers to visualize depth in 2D images.
Visual robots typically perceive the three-dimensional world through sequences of two-dimensional images, but they don’t always know what they’re looking at. For instance, Tesla’s self-driving system has been known to mistake a full moon for a traffic light.