Few-shot Learning
Toward Next-Gen Language Models: New Benchmarks Test the Limits of Large Language Models
A new benchmark aims to raise the bar for large language models. Researchers at 132 institutions worldwide introduced the Beyond the Imitation Game benchmark (BIG-bench), which includes tasks that humans perform well but current state-of-the-art models don’t.