Jan 29, 2025

6 Posts

Blue whale logo biting and breaking a computer chip, with debris flying.
Jan 29, 2025

Three Takeaways from DeepSeek’s Big Week: Innvations by China’s AI powerhouse DeepSeek highlight major shifts in the international scene

The buzz over DeepSeek this week crystallized, for many people, a few important trends that have been happening in plain sight.
Blue whale logo biting and breaking a computer chip, with debris flying.
Jan 29, 2025

Reinforcement Learning Heats Up, White House Orders Muscular AI Policy, Computer Use Gains Momentum, Fine Control of Fine-Tuning

The Batch AI News and Insights: The buzz over DeepSeek this week crystallized, for many people, a few important trends that have been happening in plain sight.
Bar chart comparing active vs. random sampling effects on length, diversity, and toxicity after fine-tuning.
Jan 29, 2025

Fine-Tuning Fine Points: Active inheritance, a smarter way to fine-tune models on synthetic data

The practice of fine-tuning models on synthetic data is becoming well established. But synthetic training data, even if it represents the training task well, may include characteristics like toxicity that impart unwelcome properties in the trained model’s output...
Front view of the White House with a fountain, green lawn, and the U.S. flag flying on top.
Jan 29, 2025

White House Orders Muscular AI Policy: U.S. shifts AI strategy to remove regulations and reinforce global leadership

Under a new president, the United States reversed its approach to AI regulation, seeking global dominance by reducing restrictions.
AI assistant processes ‘Find me a family-friendly campsite’ and suggests options.
Jan 29, 2025

Computer Use Gains Momentum: OpenAI’s Operator automates online tasks with a new AI agent

OpenAI introduced an AI agent that performs simple web tasks on a user’s behalf.
Diagram of a reinforcement learning system for training LLMs, showing data and weight flow processes.
Jan 29, 2025

Reinforcement Learning Heats Up: How DeepSeek-R1 and Kimi k1.5 use reinforcement learning to improve reasoning

Reinforcement learning is emerging as an avenue for building large language models with advanced reasoning capabilities.

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox