Generative AI

54 Posts

Think D̶i̶f̶f̶e̶r̶e̶n̶t̶ Small: Apple releases OpenELM, a family of smaller large language models.
Generative AI

Think D̶i̶f̶f̶e̶r̶e̶n̶t̶ Small: Apple releases OpenELM, a family of smaller large language models.

Apple is thinking small — very small — with a new family of open large language models.
Stable Video 3D (SV3D)
Generative AI

A 3D Model From One 2D Image: Neural Radiance Field (NeRF), a method to generate a 3D model from a single image

Video diffusion provides a new basis for generating 3D models.
Udio and Suno web pages
Generative AI

Songs Made to Order: Text-to-music services evolve with Udio and Suno's customized song creations.

A new breed of audio generator produces synthetic performances of songs in a variety of popular styles.
Tuning LLMs for Better RAG: Meta’s RA-DIT boosts language model output by optimizing text retrieval
Generative AI

Tuning LLMs for Better RAG: Meta’s RA-DIT boosts language model output by optimizing text retrieval

Retrieval-augmented generation (RAG) enables large language models to generate better output by retrieving documents that are relevant to a user’s prompt. Fine-tuning further improves RAG performance.
Hallucination Creates Security Holes: Researcher exposes risks in AI-generated code
Generative AI

Hallucination Creates Security Holes: Researcher exposes risks in AI-generated code

Language models can generate code that erroneously points to software packages, creating vulnerabilities that attackers can exploit.
Instability at Stability AI: Stability AI CEO steps down as company faces financial and market challenges
Generative AI

Instability at Stability AI: Stability AI CEO steps down as company faces financial and market challenges

The CEO of Stability AI resigned as the company faces an increasingly competitive market.
More Factual LLMs: FactTune, a method to fine-tune LLMs for factual accuracy without human feedback
Generative AI

More Factual LLMs: FactTune, a method to fine-tune LLMs for factual accuracy without human feedback

Large language models sometimes generate false statements. New work makes them more likely to produce factual output.
Deepfakes Become Politics as Usual: Deepfakes dominate as India's election season unfolds.
Generative AI

Deepfakes Become Politics as Usual: Deepfakes dominate as India's election season unfolds.

Synthetic depictions of politicians are taking center stage as the world’s biggest democratic election kicks off.
India Warns Devs — No Unreliable AI: India advises pre-approval for new AI deployments by tech firms.
Generative AI

India Warns Devs — No Unreliable AI: India advises pre-approval for new AI deployments by tech firms.

India advised major tech companies to seek government approval before they deploy new AI models.
Anthropic Ups the Ante: Anthropic introduces Claude 3, a new trio of multimodal models.
Generative AI

Anthropic Ups the Ante: Anthropic introduces Claude 3, a new trio of multimodal models.

Anthropic announced a suite of large multimodal models that set new states of the art in key benchmarks.
Google Tests Generative News Tools: Google funds newsrooms to test AI-powered article generation tools.
Generative AI

Google Tests Generative News Tools: Google funds newsrooms to test AI-powered article generation tools.

Google is paying newsrooms to use a system that helps transform press releases into articles.
Google Releases Open Source LLMs: All we know about Google's Gemma-7B and Gemma-2B models
Generative AI

Google Releases Open Source LLMs: All we know about Google's Gemma-7B and Gemma-2B models

Google asserted its open source bona fides with new models. Google released weights for Gemma-7B, an 8.5 billion-parameter large language model intended to run GPUs, and Gemma-2B, a 2.5 billion-parameter version intended for deployment on CPUs and edge devices.
Context Is Everything: Gemini 1.5 Pro, a leap in multimodal AI amid controversy over v1.0
Generative AI

Context Is Everything: Gemini 1.5 Pro, a leap in multimodal AI amid controversy over v1.0

An update of Google’s flagship multimodal model keeps track of colossal inputs, while an earlier version generated some questionable outputs.
Generated Video Gets Real(er): OpenAI's Sora, a new player in text-to-video generation
Generative AI

Generated Video Gets Real(er): OpenAI's Sora, a new player in text-to-video generation

OpenAI’s new video generator raises the bar for detail and realism in generated videos — but the company released few details about how it built the system.
Better Images, Less Training: Würstchen, a speedy, high-quality image generator
Generative AI

Better Images, Less Training: Würstchen, a speedy, high-quality image generator

The longer text-to-image models train, the better their output — but the training is costly. Researchers built a system that produced superior images after far less training.
Load More

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox