May 22, 2024

6 Posts

Project Idea — A Car for Dinosaurs: AI projects don’t need to have a meaningful deliverable. Lower the bar and do something creative.
May 22, 2024

Project Idea — A Car for Dinosaurs: AI projects don’t need to have a meaningful deliverable. Lower the bar and do something creative.

A good way to get started in AI is to start with coursework, which gives a systematic way to gain knowledge, and then to work on projects.
Music Industry Titan Targets AI, End-to-End Multimodality, Millions of Tokens of Context, More Responsive Text-to-Image
May 22, 2024

Music Industry Titan Targets AI, End-to-End Multimodality, Millions of Tokens of Context, More Responsive Text-to-Image

The Batch AI News and Insights: A good way to get started in AI is to start with coursework, which gives a systematic way to gain knowledge, and then to work on projects.
Interpreting Image Edit Instructions: Meta’s Emu Edit improves text-to-image generation with task classification.
May 22, 2024

Interpreting Image Edit Instructions: Meta’s Emu Edit improves text-to-image generation with task classification.

The latest text-to-image generators can alter images in response to a text prompt, but their outputs often don’t accurately reflect the text. They do better if, in addition to a prompt, they’re told the general type of alteration they’re expected to make.
Sony Music logo turning into the copyright symbol
May 22, 2024

Music Titan Targets AI: Sony Music accuses AI developers of copyright violations.

The world’s second-largest music publisher accused AI developers of potential copyright violations.
2 Million Tokens of Context & More: Google’s I/O developers’ conference reveals new AI models, features, and upgrades.
May 22, 2024

2 Million Tokens of Context & More: Google’s I/O developers’ conference reveals new AI models, features, and upgrades.

Google’s annual I/O developers’ conference brought a plethora of updates and new models. 
Faster, Cheaper Multimodality: All about GPT-4o, OpenAI’s latest multimodal model
May 22, 2024

Faster, Cheaper Multimodality: All about GPT-4o, OpenAI’s latest multimodal model

OpenAI’s latest model raises the bar for models that can work with common media types in any combination.

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox