Generative AI

118 Posts

Game character climbing a ladder with visible controls (QWASD) and health bars.
Generative AI

Game Worlds on Tap: Genie 2 brings interactive 3D worlds to life

A new model improves on recent progress in generating interactive virtual worlds from still images.
Berkeley Function Calling Leaderboard with metrics like accuracy, latency, and relevance.
Generative AI

Competitive Performance, Competitive Prices: Amazon introduces Nova models for text, image, and video

Amazon introduced a range of models that confront competitors head-on.
Pile of discarded green circuit boards from electronic devices.
Generative AI

Garbage Out: Generative AI and GPU boom spawns growing e-waste problem

Rapid progress in generative AI comes with a hidden environmental cost: mountains of obsolete hardware.
Comparison of Minecraft terrain with and without player modifications.
Generative AI

No Game Engine Required: AI creates an interactive Minecraft-like world in real time

A real-time video generator lets you explore an open-ended, interactive virtual world — a video game without a game engine.
Green creatures with confused expressions surrounded by mirrors creating infinite reflections.
Generative AI

Synthetic Data Distorts Models: Could training on generated output doom AI’s future?

Training successive neural networks on the outputs of previous networks gradually degrades performance. Will future models succumb to the curse of recursive training?
Temporal pyramids in rows (left) and position encoding in space-time pyramid shown in the pyramidal flow matching process.
Generative AI

Faster, Cheaper Video Generation: Pyramidal Flow Matching, a cost-cutting method for training video generators

Researchers devised a way to cut the cost of training video generators. They used it to build a competitive open source text-to-video model and promised to release the training code.
OpenAI logo next to the Microsoft logo with their shadows visible.
Generative AI

AI Bromance Turns Turbulent: Microsoft and OpenAI partnership faces strain as both seek less dependence

Once hailed by OpenAI chief Sam Altman as the “best bromance in tech,” the partnership between Microsoft and OpenAI is facing challenges as both companies seek greater independence.
Diagram of a transformer model using Jina embeddings and LoRA adapters, tailored for tasks like sentiment classification.
Generative AI

Better Text Embeddings: Jina AI launches jina-embeddings-v3, a text embedding model with task-specific adapters

Text embedding models are often used to retrieve text, cluster text, determine similarity between texts, and generate initial embeddings for text classifiers. A new embedding model comes with adapters that specialize it to each of these use cases.
Collage of various images featuring a baseball player, movie scenes, portraits, landscapes, and diverse wildlife.
Generative AI

German Court Says LAION Didn’t Violate Copyrights: LAION wins copyright case in Germany

A German court dismissed a copyright lawsuit against LAION, the nonprofit responsible for large-scale image datasets used to train Midjourney, Stable Diffusion, and other image generators.
A smartphone on a table showing an incoming call with voice waveform displayed on screen.
Generative AI

Voice-to-Voice and More for GPT-4o API: OpenAI unveils tools for speech, vision, and cost-efficiency at DevDay

OpenAI launched a suite of new and updated tools to help AI developers build applications and reduce costs.
A demonstration of video editing through text input, altering a runner’s background and costume.
Generative AI

Familiar Faces, Synthetic Soundtracks: Meta debuts Movie Gen for text-to-video generation with consistent characters

Meta upped the ante for text-to-video generation with new systems that produce consistent characters and matching soundtracks.
A GIF showcasing a dynamic spreadsheet interaction using AI, with cells being populated and analyzed automatically.
Generative AI

Enabling LLMs to Read Spreadsheets: A method to process large spreadsheets for accurate question answering

Large language models can process small spreadsheets, but very large spreadsheets often exceed their limits for input length. Researchers devised a method that processes large spreadsheets so LLMs can answer questions about them.
A dynamic GIF featuring erupting volcanoes, a reindeer in the snow, animated fuzzy creatures, and a close-up of a human eye.
Generative AI

Generative Video in the Editing Suite: Adobe integrates AI video generation into Premiere Pro

Adobe is putting a video generator directly into its popular video editing application.
An interior design assistant tool analyzing an image of a modern living room.
Generative AI

Llama Herd Expands: Meta updates Llama models with vision-language, edge sizes, and agentic APIs

Meta extended its Llama family of models into two new categories: vision-language and sizes that are small enough to fit in edge devices.
GIF featuring various scenes including people in a protest, close-ups of eyes, and outdoor landscapes.
Generative AI

Hollywood Embraces Video Generation: Lionsgate teams with Runway to develop a custom fine-tuned video model

The AI startup Runway is helping to retool Lionsgate, the producer of blockbuster movie franchises like The Hunger Games and John Wick, for the era of generated video.
Load More

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox