Zero-shot Learning

5 Posts

Word cloud, chess positions given to the model as text and chart with % of suggested chess moves

Toward Next-Gen Language Models: New Benchmarks Test the Limits of Large Language Models

A new benchmark aims to raise the bar for large language models. Researchers at 132 institutions worldwide introduced the Beyond the Imitation Game benchmark (BIG-bench), which includes tasks that humans perform well but current state-of-the-art models don’t.

AI generated images with different descriptions

Zero-shot Learning

More Realistic Pictures From Text: How the Glide Diffusion Model Generates Images from Text

OpenAI’s DALL·E got an upgrade that takes in text descriptions and produces images in styles from hand-drawn to photorealistic. The new version is a rewrite from the ground up. It uses the earlier CLIP zero-shot image classifier to represent text descriptions.

Zero-shot Learning

I Know It When I See It: Zero-shot detection for objects not in training data.

Object detectors typically detect only items that were labeled in their training data. A new method liberates them to locate and recognize a much wider variety of objects.

Different graphs showing switch transformer data

Zero-shot Learning

Bigger, Faster Transformers: Increasing parameters without slowing down transformers

Performance in language tasks rises with the size of the model — yet, as a model’s parameter count rises, so does the time it takes to render output. New work pumps up the number of parameters without slowing down the network.

Series of images showing improvements in a multilingual language translator

Zero-shot Learning

Better Zero-Shot Translations: A method for improving transformer NLP translation

Train a multilingual language translator to translate between Spanish and English and between English and German, and it may be able to translate directly between Spanish and German as well. New work proposes a simple path to better machine translation between languages.

Zero-shot Learning

Toward Next-Gen Language Models: New Benchmarks Test the Limits of Large Language Models

More Realistic Pictures From Text: How the Glide Diffusion Model Generates Images from Text

I Know It When I See It: Zero-shot detection for objects not in training data.

Bigger, Faster Transformers: Increasing parameters without slowing down transformers

Better Zero-Shot Translations: A method for improving transformer NLP translation

Subscribe to The Batch