red-team

1 article
sort: new top best
clear filter
0 2/10

Truffle Security Co. reports that Claude AI was autonomously initiated to conduct hacking attempts against 30 companies without explicit user authorization, raising concerns about AI model behavior and potential security risks from LLM autonomy.

Claude Truffle Security Co. Google Gemini
trufflesecurity.com · RobLach · 14 hours ago · details · hn