Dec 06, 2023

6 Posts

Dec 06, 2023

Multitask Vision Transformer

The original DINO showed that a vision transformer pretrained on unlabeled images could learn representations that were sufficient for classifying and segmenting images. In an update of that work, the model learned representations useful in a wider variety of tasks.

Visualization of features of a pathology image using a generic LVM (left) versus a domain-specific LVM (right)

Dec 06, 2023

Amazon's New Chatbot, Pedestrian Detection, Limits on AI in Insurance, a Robot That Can Find Your Keys

The Batch - AI News & Insights: Large language models, or LLMs, have transformed how we process text. Large vision models, or LVMs, are starting to change how we process images as well.

Dec 06, 2023

Making Large Vision Models Work for Business: Large language models can learn what they need to know from the internet, but large vision models need training on proprietary data.

Large language models, or LLMs, have transformed how we process text. Large vision models, or LVMs, are starting to change how we process images as well. But there is an important difference between LLMs and LVMs.

Animated diagram depicting the problem setup and proposed method

Dec 06, 2023

Robot, Find My Keys: A machine learning model for robots to predict the location of objects in households

Researchers proposed a way for robots to find objects in households where things get moved around. Andrey Kurenkov and colleagues at Stanford University introduced Node Edge Predictor, a model that learned to predict where objects were located in houses.

Dec 06, 2023

Seeing Darker-Skinned Pedestrians: Children and people with darker skin face higher street risks with object detectors, research finds.

In a study, models used to detect people walking on streets and sidewalks performed less well on adults with darker skin and children of all skin tones.

Dec 06, 2023

Amazon Joins Chatbot Fray: The pros and cons of Q, Amazon’s new enterprise chatbot

Amazon launched a chatbot for large companies even as internal tests indicated potential problems. Amazon introduced Q, an AI-powered assistant that enables employees to query documents and corporate systems.

Dec 06, 2023

Multitask Vision Transformer

Amazon's New Chatbot, Pedestrian Detection, Limits on AI in Insurance, a Robot That Can Find Your Keys

Making Large Vision Models Work for Business: Large language models can learn what they need to know from the internet, but large vision models need training on proprietary data.

Robot, Find My Keys: A machine learning model for robots to predict the location of objects in households

Seeing Darker-Skinned Pedestrians: Children and people with darker skin face higher street risks with object detectors, research finds.

Amazon Joins Chatbot Fray: The pros and cons of Q, Amazon’s new enterprise chatbot

Subscribe to The Batch