Allen Institute for AI
One Model for Vision-Language: A general purpose AI for vision and language tasks.
Researchers have proposed task-agnostic architectures for image classification tasks and language tasks. New work proposes a single architecture for vision-language tasks.