adversarial-testing

1 article
sort: new top best
clear filter
0 7/10
research

A 2-week empirical study of six autonomous AI agents with real tools (email, shell, persistent storage) tested by 20 researchers in both benign and adversarial scenarios, documenting 10 security vulnerabilities (prompt injection, identity spoofing, non-owner compliance, social engineering bypass) and 6 cases of emergent safety behavior including cross-agent safety coordination without explicit instruction.

Natalie Shapira OpenClaw Kimi K2.5 Claude Opus 4.6 ProtonMail Discord GitHub Ash Flux Jarvis Quinn Mira Doug
agentsofchaos.baulab.info · xdotli · 13 hours ago · details · hn