NesT
Transformer Speed-Up Sped Up: How to Speed Up Image Transformers
The transformer architecture is notoriously inefficient when processing long sequences — a problem in processing images, which are essentially long sequences of pixels. One way around this is to break up input images and process the pieces