Generative AI

126 Posts

Amazon smart display with widgets for recipes, calendar, weather, events, and streaming (Prime Video, Netflix, Disney+).
Generative AI

Amazon’s Next-Gen Voice Assistant: Alexa+ adds generative AI and agents, using Claude and other models

Amazon announced Alexa+, a major upgrade to its long-running voice assistant.
Diagram of Coconut, a method training LLMs to process thought chains as vectors, comparing it to Chain-of-Thought (CoT).
Generative AI

Reasoning in Vectors, Not Text: Meta introduces Chain of Continuous Thought (Coconut) to improve next-token prediction

Although large language models can improve their performance by generating a chain of thought (CoT) — intermediate text tokens that break down the process of responding to a prompt into a series of steps.
Bar chart comparing active vs. random sampling effects on length, diversity, and toxicity after fine-tuning.
Generative AI

Fine-Tuning Fine Points: Active inheritance, a smarter way to fine-tune models on synthetic data

The practice of fine-tuning models on synthetic data is becoming well established. But synthetic training data, even if it represents the training task well, may include characteristics like toxicity that impart unwelcome properties in the trained model’s output...
AI assistant processes ‘Find me a family-friendly campsite’ and suggests options.
Generative AI

Computer Use Gains Momentum: OpenAI’s Operator automates online tasks with a new AI agent

OpenAI introduced an AI agent that performs simple web tasks on a user’s behalf.
DAVID DING
Generative AI

David Ding: Generated video with music, sound effects, and dialogue

Last year, we saw an explosion of models that generate either video or audio outputs in high quality. In the coming year, I look forward to models that produce video clips complete with audio soundtracks including speech, music, and sound effects.
HANNO BASSE
Generative AI

Hanno Basse: Generative AI for artists

Stability AI’s aim is to liberate artists of all trades from the repetitive, mechanical aspects of their work and help them spend the majority of their time on the creative side. So our highest hope for next year is that generative AI will help people to be more creative and productive.
Snowman using a camera during snowfall.
Generative AI

Generative Video Takes Off: Generative video models revolutionize content creation with stunning realism

Video generation exploded in an abundance of powerful models.
A GIF with scenes of a man at a café, a working robot, a ghost in a mirror, and a speeding truck.
Generative AI

Open Video Gen Closes the Gap: Tencent releases HunyuanVideo, an open source model rivaling commercial video generators

The gap is narrowing between closed and open models for video generation.
Game character climbing a ladder with visible controls (QWASD) and health bars.
Generative AI

Game Worlds on Tap: Genie 2 brings interactive 3D worlds to life

A new model improves on recent progress in generating interactive virtual worlds from still images.
Berkeley Function Calling Leaderboard with metrics like accuracy, latency, and relevance.
Generative AI

Competitive Performance, Competitive Prices: Amazon introduces Nova models for text, image, and video

Amazon introduced a range of models that confront competitors head-on.
Pile of discarded green circuit boards from electronic devices.
Generative AI

Garbage Out: Generative AI and GPU boom spawns growing e-waste problem

Rapid progress in generative AI comes with a hidden environmental cost: mountains of obsolete hardware.
Comparison of Minecraft terrain with and without player modifications.
Generative AI

No Game Engine Required: AI creates an interactive Minecraft-like world in real time

A real-time video generator lets you explore an open-ended, interactive virtual world — a video game without a game engine.
Green creatures with confused expressions surrounded by mirrors creating infinite reflections.
Generative AI

Synthetic Data Distorts Models: Could training on generated output doom AI’s future?

Training successive neural networks on the outputs of previous networks gradually degrades performance. Will future models succumb to the curse of recursive training?
Temporal pyramids in rows (left) and position encoding in space-time pyramid shown in the pyramidal flow matching process.
Generative AI

Faster, Cheaper Video Generation: Pyramidal Flow Matching, a cost-cutting method for training video generators

Researchers devised a way to cut the cost of training video generators. They used it to build a competitive open source text-to-video model and promised to release the training code.
OpenAI logo next to the Microsoft logo with their shadows visible.
Generative AI

AI Bromance Turns Turbulent: Microsoft and OpenAI partnership faces strain as both seek less dependence

Once hailed by OpenAI chief Sam Altman as the “best bromance in tech,” the partnership between Microsoft and OpenAI is facing challenges as both companies seek greater independence.
Load More

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox