ResNeXt

2 Posts

Graphs and data related to visualized tokens (or vokens)

Better Language Through Vision: Study improved Bert performance using visual tokens.

For children, associating a word with a picture that illustrates it helps them learn the word’s meaning. Research aims to do something similar for machine learning models. Researchers improved a BERT model’s performance on some language tasks by training it on a large dataset of image-word pairs.

Facebook service describing a photo on Instagram

ResNeXt

Every Picture Tells a Story: Facebook expands automated alternative text.

Facebook expanded a system of vision, language, and speech models designed to open the social network to users who are visually impaired. A Facebook service that describes photos in a synthesized voice now recognizes 1,200 visual concepts — 10 times more than the previous version.

ResNeXt

Better Language Through Vision: Study improved Bert performance using visual tokens.

Every Picture Tells a Story: Facebook expands automated alternative text.

Subscribe to The Batch