Dec 18, 2024

6 Posts

Graph showing cross-validation accuracy vs. number of features for raw and whitened inputs.

Dec 18, 2024

Focus on the Future, Learn From the Past: 15 years ago, the idea of scaling up deep learning was controversial — but it was right. Keep your eyes open for such ideas in 2025.

I’m thrilled that former students and postdocs of mine won both of this year’s NeurIPS Test of Time Paper Awards.

Dec 18, 2024

Phi-4 Breaks Size Barrier, HunyuanVideo Narrows Open Source Gap, Gemini 2.0 Flash Accelerates Multimodal Modeling, LLMs Propose Research Ideas

The Batch AI News and Insights: I’m thrilled that former students and postdocs of mine won both of this year’s NeurIPS Test of Time Paper Awards.

Animation showcasing 7 key NLP topics visually expanding on the screen.

Dec 18, 2024

When LLMs Propose Research Ideas: Stanford study finds AI matches human experts at writing research proposals

How do agents based on large language models compare to human experts when it comes to proposing machine learning research? Pretty well, according to one study.

Performance comparison for Gemini models across benchmarks.

Dec 18, 2024

Multimodal Modeling on the Double: Google introduces Gemini 2.0 Flash, a faster, more capable AI model

Google’s Gemini 2.0 Flash, the first member of its updated Gemini family of large multimodal models, combines speed with performance that exceeds that of its earlier flagship model, Gemini 1.5 Pro, on several measures.

A GIF with scenes of a man at a café, a working robot, a ghost in a mirror, and a speeding truck.

Dec 18, 2024

Open Video Gen Closes the Gap: Tencent releases HunyuanVideo, an open source model rivaling commercial video generators

The gap is narrowing between closed and open models for video generation.

Benchmark results for Phi-4, GPT, LLaMA-3.3, and Qwen 2.5 models.

Dec 18, 2024

Phi-4 Beats Models Five Times Its Size: Microsoft’s Phi-4 learned from a blend of synthetic and organic data to surpass larger models in math and reasoning benchmarks

Microsoft updated its smallest model family with a single, surprisingly high-performance model.

Dec 18, 2024

Focus on the Future, Learn From the Past: 15 years ago, the idea of scaling up deep learning was controversial — but it was right. Keep your eyes open for such ideas in 2025.

Phi-4 Breaks Size Barrier, HunyuanVideo Narrows Open Source Gap, Gemini 2.0 Flash Accelerates Multimodal Modeling, LLMs Propose Research Ideas

When LLMs Propose Research Ideas: Stanford study finds AI matches human experts at writing research proposals

Multimodal Modeling on the Double: Google introduces Gemini 2.0 Flash, a faster, more capable AI model

Open Video Gen Closes the Gap: Tencent releases HunyuanVideo, an open source model rivaling commercial video generators

Phi-4 Beats Models Five Times Its Size: Microsoft’s Phi-4 learned from a blend of synthetic and organic data to surpass larger models in math and reasoning benchmarks

Subscribe to The Batch