Generative AI

114 Posts

Green creatures with confused expressions surrounded by mirrors creating infinite reflections.
Generative AI

Synthetic Data Distorts Models: Could training on generated output doom AI’s future?

Training successive neural networks on the outputs of previous networks gradually degrades performance. Will future models succumb to the curse of recursive training?
Temporal pyramids in rows (left) and position encoding in space-time pyramid shown in the pyramidal flow matching process.
Generative AI

Faster, Cheaper Video Generation: Pyramidal Flow Matching, a cost-cutting method for training video generators

Researchers devised a way to cut the cost of training video generators. They used it to build a competitive open source text-to-video model and promised to release the training code.
OpenAI logo next to the Microsoft logo with their shadows visible.
Generative AI

AI Bromance Turns Turbulent: Microsoft and OpenAI partnership faces strain as both seek less dependence

Once hailed by OpenAI chief Sam Altman as the “best bromance in tech,” the partnership between Microsoft and OpenAI is facing challenges as both companies seek greater independence.
Diagram of a transformer model using Jina embeddings and LoRA adapters, tailored for tasks like sentiment classification.
Generative AI

Better Text Embeddings: Jina AI launches jina-embeddings-v3, a text embedding model with task-specific adapters

Text embedding models are often used to retrieve text, cluster text, determine similarity between texts, and generate initial embeddings for text classifiers. A new embedding model comes with adapters that specialize it to each of these use cases.
Collage of various images featuring a baseball player, movie scenes, portraits, landscapes, and diverse wildlife.
Generative AI

German Court Says LAION Didn’t Violate Copyrights: LAION wins copyright case in Germany

A German court dismissed a copyright lawsuit against LAION, the nonprofit responsible for large-scale image datasets used to train Midjourney, Stable Diffusion, and other image generators.
A smartphone on a table showing an incoming call with voice waveform displayed on screen.
Generative AI

Voice-to-Voice and More for GPT-4o API: OpenAI unveils tools for speech, vision, and cost-efficiency at DevDay

OpenAI launched a suite of new and updated tools to help AI developers build applications and reduce costs.
A demonstration of video editing through text input, altering a runner’s background and costume.
Generative AI

Familiar Faces, Synthetic Soundtracks: Meta debuts Movie Gen for text-to-video generation with consistent characters

Meta upped the ante for text-to-video generation with new systems that produce consistent characters and matching soundtracks.
A GIF showcasing a dynamic spreadsheet interaction using AI, with cells being populated and analyzed automatically.
Generative AI

Enabling LLMs to Read Spreadsheets: A method to process large spreadsheets for accurate question answering

Large language models can process small spreadsheets, but very large spreadsheets often exceed their limits for input length. Researchers devised a method that processes large spreadsheets so LLMs can answer questions about them.
A dynamic GIF featuring erupting volcanoes, a reindeer in the snow, animated fuzzy creatures, and a close-up of a human eye.
Generative AI

Generative Video in the Editing Suite: Adobe integrates AI video generation into Premiere Pro

Adobe is putting a video generator directly into its popular video editing application.
An interior design assistant tool analyzing an image of a modern living room.
Generative AI

Llama Herd Expands: Meta updates Llama models with vision-language, edge sizes, and agentic APIs

Meta extended its Llama family of models into two new categories: vision-language and sizes that are small enough to fit in edge devices.
GIF featuring various scenes including people in a protest, close-ups of eyes, and outdoor landscapes.
Generative AI

Hollywood Embraces Video Generation: Lionsgate teams with Runway to develop a custom fine-tuned video model

The AI startup Runway is helping to retool Lionsgate, the producer of blockbuster movie franchises like The Hunger Games and John Wick, for the era of generated video.
More, Better Open Source Options: Alibaba releases Qwen 2.5 models, raising the bar for open weight LLMs
Generative AI

More, Better Open Source Options: Alibaba releases Qwen 2.5 models, raising the bar for open weight LLMs

The parade of ever more capable LLMs continues with Qwen 2.5.
Reducing Memorization in LLMs: A technique that masks tokens in large language models, protecting data privacy
Generative AI

Reducing Memorization in LLMs: A technique that masks tokens in large language models, protecting data privacy

Studies have established that large language models can memorize the text passages they’ve been trained on repeatedly and regurgitate them when prompted in adversarial and, though rarely, in benign ways.
High Gear for Llama 3.1 405B: SambaNova boosts Llama 3.1 performance with fast, free access to largest model
Generative AI

High Gear for Llama 3.1 405B: SambaNova boosts Llama 3.1 performance with fast, free access to largest model

SambaNova raised the speed limit for access to the largest model in the Llama 3.1 family — and it’s free.
OpenAI's model scores on the GPQA Diamond tests in biology, chemistry, and physics, along with their overall score.
Generative AI

OpenAI Forges Chains of Thought: OpenAI’s o1 models excel in reasoning, outperform GPT-4o in math and coding

Preliminary versions of OpenAI’s new model family were trained explicitly to think step-by-step, yielding outstanding marks in math, science, and coding — but users can’t see their reasoning steps.
Load More

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox