Lakera / Check Point Software

Alexandra Hose, 04.11.2025, 08:15

"Crash test" for LLMs in AI agents

Lakera and the UK AI Security Institute have launched 'b3 ', a new open source benchmark. b3 is an open source security evaluation project specifically designed to protect Large Language Models (LLM) in AI agents.

Images

Lakera co-founder Mateo Rojas-Carulla © Lakera

The benchmark b3 was built on the basis of the new idea called Threat Snapshots. Instead of simulating a complete AI agent from start to finish, the threat snapshots zoom in on the critical points where vulnerabilities in LLM frequently occur.

By testing the models at these specific points, developers can see how robust their systems are against attacks - without the complexity that was previously required to model a complete agent workflow. A kind of 'crash test' for AI agents.

LLMs with inference enabled have lower vulnerability scores - lower is better - and are therefore less vulnerable © Lakera, a Check Point Company

"We developed the b3 benchmark because today's AI agents are only as secure as the LLMs that fuel them," explains Lakera co-founder Mateo Rojas-Carulla. "These threat snapshots allow us to systematically look for vulnerabilities on the attack surface that were previously hidden in the complex agent workflows."

b3 combines ten representative threat snapshots with 19,433 real cyberattacks from the gamified red-teaming game 'Gandalf: Agent Breaker'. Among other things, prompt exfiltration, phishing link injection, malicious code injection, DoS and unauthorized tool calls are evaluated.

The first tests with 31 common LLM models show:

Better reasoning capabilities increase security
Model size does not correlate with security performance
Closed source performs better on average, but top open models catch up

The benchmark report is available under an open source license: https://arxiv.org/pdf/2510.22620

Gandalf: Agent Breaker is a hacking simulator game in which you are challenged to crack and exploit AI agents in realistic scenarios. The ten GenAI applications in the game simulate the behavior of a real AI agent. Each app features multiple difficulty levels, layered defenses and novel attack surfaces that challenge a range of skills, from prompt engineering to red teaming. Some of the apps are chat-based, while others rely on code-level thinking, file processing, memory or the use of external tools.

You might also be interested in

PTC

Early Access Program for AI Features

PTC has launched an early-access program with Onshape Labs for AI features on the Onshape cloud-based CAD platform. Users will receive early access to new tools for product development.

Physical AI

Humanoid Robotics at BMW in Spartanburg

"Physical AI" combines digital AI with real machines and robots. This allows intelligent systems, such as humanoid robots, to be integrated into real-world production processes. Following the successful deployment of the Figure 02 humanoid robot at...

AI in Manufacturing

Cybus Brings in Siemens Executive Stefan Schwab to Lead the Company

The Hamburg-based software provider Cybus will have a dual leadership structure going forward: Stefan Schwab will join co-founder Peter Sorowka as co-CEO.

SensoPart and Cambrian Robotics

Partnership for 3D-Guided Robotics

Cambrian Robotics and SensoPart are jointly developing an AI-powered solution for 3D-guided robotic applications. The combination of vision sensors and AI is designed to simplify gripping and positioning processes.

Sophos

Why AI Agents in the SOC Don't Learn Over Time

AI agents support security operations centers, but so far they lack a permanent memory. A technical article explains the challenges facing autonomous security automation.

Industrial AI and Manufacturing

Siemens and IFS Bridge the Gap Between Planning and Operations

Siemens and IFS are collaborating to use industrial AI to enable a closed-loop digital twin across the entire plant lifecycle. The goal is to more closely link design data with real-world operational information and optimize industrial processes.

Artificial Intelligence

Four Steps Toward Greater AI Sovereignty

Following the failure of “Claude Fable 5,” the AI consulting firm Neurawork recommends a robust AI strategy focused on data sovereignty, reliability, and governance.

Physical AI

Alibaba Expands 'Qwen' for Robotics

Alibaba is expanding its Qwen model family with a robotics suite for Physical AI. The three models support manipulation, navigation, and simulation for autonomous robotic systems

Acquisition

Qualcomm Acquires AI Specialist Modular

Qualcomm is acquiring the AI software company Modular. The goal is to expand an open software platform for generative and agent-based AI in edge, cloud, and data center environments.

"Crash test" for LLMs in AI agents

You might also be interested in

Early Access Program for AI Features

Humanoid Robotics at BMW in Spartanburg

Cybus Brings in Siemens Executive Stefan Schwab to Lead the Company

Partnership for 3D-Guided Robotics

Why AI Agents in the SOC Don't Learn Over Time

Siemens and IFS Bridge the Gap Between Planning and Operations

Four Steps Toward Greater AI Sovereignty

Alibaba Expands 'Qwen' for Robotics

Qualcomm Acquires AI Specialist Modular

Categories

Focus areas

Service

Magazine

Our network