Machine Learning Research

534 Posts

A warm-toned room features a sofa, a decorated shelf, and sunlight filtering through patterned curtains.
Machine Learning Research

Detailed Text- or Image-to-3D, Pronto: FlashWorld generates 3D objects, scenes, and surfaces with photorealistic fidelity

Current methods that produce 3D scenes from text or images are slow and produce inconsistent results. Researchers introduced a technique that generates detailed, coherent 3D scenes seconds.
View from a car on a tree-lined street, with an overlay instructing to decelerate if hazards are detected.
Machine Learning Research

Training Cars to Reason: Nvidia’s Alpamayo-R1 is a robotics-style reasoning model for autonomous vehicles

Chain-of-thought reasoning can help autonomous vehicles decide what to do next.
Matrix links queries to documents, illustrating embedding limits in representing relevance combinations.
Machine Learning Research

Retrieval Faces Hard Limits: Google and Johns Hopkins researchers show embedding models can’t search unlimited documents

Can your retriever find all the relevant documents for any query your users might enter? Maybe not, research shows.
Diagrams comparing LongCoT and Delethink environments show reasoning processes and context management.
Machine Learning Research

More Affordable Reasoning: Canadian researchers find capping context helps models better retrieve data

One way to improve a reasoning model’s performance is to let it produce a longer chain of thought. However, attending to ever-longer contexts can become expensive, and making that attention more efficient requires changes to a model’s architecture.
Diagram showing SCP hub linking clients with databases, tools, AI agents, and lab devices for experiments.
Machine Learning Research

Lingua Franca for Science Labs: SAIL’s Science Context Protocol helps AI Agents communicate about local and virtual experiments

An open protocol aims to enable AI agents to conduct scientific research autonomously across disciplinary and institutional boundaries.
Dialogue displays a model revealing it answered incorrectly and wrote code against instructions.
Machine Learning Research

Teaching Models to Tell the Truth: OpenAI fine-tuned a version of GPT-5 to confess when it was breaking the rules

Large language models occasionally conceal their failures to comply with constraints they’ve been trained or prompted to observe. Researchers trained an LLM to admit when it disobeyed.
Sharon Zhou is pictured smiling confidently with her hands clasped, reflecting AI’s potential for community-building.
Machine Learning Research

Chatbots That Build Community by Sharon Zhou: Sharon Zhou of AMD on expanding chat to serve groups and connect us with other people

Next year, I’m excited to see AI break out of 1:1 relationships with each of us. In 2026, AI has the potential to bring people together and unite us with human connection, rather than polarize and isolate us. It’s about time for ChatGPT to enter your group chats.
Pengtao Xie is pictured standing near a chalkboard filled with mathematical notes, addressing a classroom of attentive students.
Machine Learning Research

Multimodal Models for Biomedicine by Pengtao Xie: Pengtao Xie of UC-San Diego on why medical models need to visualize tiny chemicals and large organs

Over the past few years, we have seen rapid progress in models that jointly reason over text, images, sequences, graphs, and time series. Yet in biomedical settings, these capabilities often remain fragmented, brittle, or difficult to interpret.
Tanmay Gupta is pictured smiling next to a whiteboard filled with mathematical formulas, embodying active AI engagement.
Machine Learning Research

From Prediction to Action by Tanmay Gupta: Tanmay Gupta of the Allen Institute on building AI for long-horizon tasks

AI research in 2026 should confront a simple but transformative realization: Models that predict are not the same as systems that act. The latter is what we actually need.
Adji Bousso Dieng is pictured typing on a laptop in a warmly lit room, focusing on AI-driven scientific work.
Machine Learning Research

AI for Scientific Discovery by Adji Bousso Dieng: Adji Bousso Dieng, Princeton University Assistant Professor and AI Researcher, on optimizing models for the long tail

In 2026, I hope AI will transition from being a tool for efficiency to a catalyst for scientific discovery.
David Cox is pictured during a discussion in a glass-walled office, aligned with themes of open-source innovation and teamwork.
Machine Learning Research

Open Source Wins by David Cox: David Cox, VP for AI Models at IBM Research, on the need for open development in AI

My hope is that open AI continues to flourish and ultimately wins.
Mice on a laptop keyboard explore, with code on screen; background features festive lights, presents.
Machine Learning Research

Agents Write Code Faster, Cheaper: Software developers used more versatile AI-powered tools to write code

Coding apps moved beyond autofill-style code completion to agentic systems that manage a wide range of software development tasks.
Snowman in Thinker pose on snowy landscape, with a person building it.
Machine Learning Research

Thinking Models Solve Bigger Problems: Reasoning models, beginning with OpenAI’s o1 and DeepSeek’s R1, transformed the industry

Think step by step. Explain your reasoning. Work backwards from the answer. As 2025 began, models executed these reasoning strategies only when prompted. Now most new large language models do it as a matter of course, improving performance across a wide range of tasks.
Diagram shows LLM training with encoders for images, audio, video; inference with galaxies, satellites.
Machine Learning Research

Adapting LLMs to Any Sort of Data: SEMI (Sample-Efficient Modality Integration) tackles new domains with few-shot examples

Enabling a pretrained large language model to process a data type other than text (say, images), possibly in a specialized domain (say, radiology), typically requires thousands to millions of examples that pair the other data (perhaps x-rays) with text.
A table compares GPT-5.2's benchmark scores to Claude Opus 4.5 and Gemini 3 Pro in various reasoning tasks.
Machine Learning Research

OpenAI’s Answer to Gemini 3: GPT-5.2 arrives, touting variable reasoning and coding performance

OpenAI launched GPT-5.2 only weeks after its CEO Sam Altman reportedly issued a “code red” alarm in response to Google's Gemini 3.
Load More

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox