Feb 07, 2024

6 Posts

LLMs Can Get Inside Your Head: AI models show promise in understanding human beliefs, research reveals
Feb 07, 2024

LLMs Can Get Inside Your Head: AI models show promise in understanding human beliefs, research reveals

Most people understand that others’ mental states can differ from their own. For instance, if your friend leaves a smartphone on a table and you privately put it in your pocket, you understand that your friend continues to believe it was on the table.
GPT-4 Biothreat Risk is Low: Study finds GPT-4 no more risky than online search in aiding bioweapon development.
Feb 07, 2024

GPT-4 Biothreat Risk is Low: Study finds GPT-4 no more risky than online search in aiding bioweapon development.

GPT-4 poses negligible additional risk that a malefactor could build a biological weapon, according to a new study. OpenAI compared the ability of GPT-4 and web search to contribute to the creation of a dangerous virus or bacterium. The large language model was barely more helpful than the web.
New Leaderboards Rank Safety, More: Hugging Face introduces leaderboards to evaluate model performance and trustworthiness.
Feb 07, 2024

New Leaderboards Rank Safety, More: Hugging Face introduces leaderboards to evaluate model performance and trustworthiness.

Hugging Face introduced four leaderboards to rank the performance and trustworthiness of large language models (LLMs). The open source AI repository now ranks performance on tests of workplace utility, trust and safety, tendency to generate falsehoods, and reasoning.
Nude Deepfakes Spur Legislators: Taylor Swift deepfake outrage prompts U.S. lawmakers to propose anti-AI pornography laws.
Feb 07, 2024

Nude Deepfakes Spur Legislators: Taylor Swift deepfake outrage prompts U.S. lawmakers to propose anti-AI pornography laws.

Sexually explicit deepfakes of Taylor Swift galvanized public demand for laws against nonconsensual, AI-enabled pornography.
What If Large Language Models Become a Commodity?: Large language models are proliferating. What are the prospects for Amazon, Google, Meta, Microsoft, OpenAI, and LLM startups?
Feb 07, 2024

What If Large Language Models Become a Commodity?: Large language models are proliferating. What are the prospects for Amazon, Google, Meta, Microsoft, OpenAI, and LLM startups?

On the LMSYS Chatbot Arena Leaderboard, which pits chatbots against each other anonymously and prompts users to judge which one generated a better answer...
Taylor Swift Deepfakes, GPT-4 Biothreats, New Leaderboards, LLMs That Get Inside Your Head
Feb 07, 2024

Taylor Swift Deepfakes, GPT-4 Biothreats, New Leaderboards, LLMs That Get Inside Your Head

The Batch AI News and Insights: On the LMSYS Chatbot Arena Leaderboard, which pits chatbots against each other anonymously and prompts users to judge which one generated a better answer, Google’s Bard (Gemini Pro)

Subscribe to The Batch

Stay updated with weekly AI News and Insights delivered to your inbox