Feb 07, 2024

6 Posts

Feb 07, 2024

LLMs Can Get Inside Your Head: AI models show promise in understanding human beliefs, research reveals

Most people understand that others’ mental states can differ from their own. For instance, if your friend leaves a smartphone on a table and you privately put it in your pocket, you understand that your friend continues to believe it was on the table.

Feb 07, 2024

GPT-4 Biothreat Risk is Low: Study finds GPT-4 no more risky than online search in aiding bioweapon development.

GPT-4 poses negligible additional risk that a malefactor could build a biological weapon, according to a new study. OpenAI compared the ability of GPT-4 and web search to contribute to the creation of a dangerous virus or bacterium. The large language model was barely more helpful than the web.

Feb 07, 2024

New Leaderboards Rank Safety, More: Hugging Face introduces leaderboards to evaluate model performance and trustworthiness.

Hugging Face introduced four leaderboards to rank the performance and trustworthiness of large language models (LLMs). The open source AI repository now ranks performance on tests of workplace utility, trust and safety, tendency to generate falsehoods, and reasoning.

Feb 07, 2024

Nude Deepfakes Spur Legislators: Taylor Swift deepfake outrage prompts U.S. lawmakers to propose anti-AI pornography laws.

Sexually explicit deepfakes of Taylor Swift galvanized public demand for laws against nonconsensual, AI-enabled pornography.

Feb 07, 2024

What If Large Language Models Become a Commodity?: Large language models are proliferating. What are the prospects for Amazon, Google, Meta, Microsoft, OpenAI, and LLM startups?

On the LMSYS Chatbot Arena Leaderboard, which pits chatbots against each other anonymously and prompts users to judge which one generated a better answer...

Feb 07, 2024

Taylor Swift Deepfakes, GPT-4 Biothreats, New Leaderboards, LLMs That Get Inside Your Head

The Batch AI News and Insights: On the LMSYS Chatbot Arena Leaderboard, which pits chatbots against each other anonymously and prompts users to judge which one generated a better answer, Google’s Bard (Gemini Pro)

Feb 07, 2024

LLMs Can Get Inside Your Head: AI models show promise in understanding human beliefs, research reveals

GPT-4 Biothreat Risk is Low: Study finds GPT-4 no more risky than online search in aiding bioweapon development.

New Leaderboards Rank Safety, More: Hugging Face introduces leaderboards to evaluate model performance and trustworthiness.

Nude Deepfakes Spur Legislators: Taylor Swift deepfake outrage prompts U.S. lawmakers to propose anti-AI pornography laws.

What If Large Language Models Become a Commodity?: Large language models are proliferating. What are the prospects for Amazon, Google, Meta, Microsoft, OpenAI, and LLM startups?

Taylor Swift Deepfakes, GPT-4 Biothreats, New Leaderboards, LLMs That Get Inside Your Head

Subscribe to The Batch