prompt-injection

4 articles
sort: new top best
clear filter
0 2/10

This article discusses a social engineering attack that exploits Claude Opus through the OpenClaw integration, demonstrating how an attacker can manipulate an AI agent into divulging sensitive information or credentials within 50 messages by exploiting trust relationships in MCP (Model Context Protocol) implementations.

OpenClaw Claude Opus Runlayer Anthropic OWASP
runlayer.com · gk1 · 12 hours ago · details · hn
0 2/10
threat-intel

This is a Runlayer marketing/blog hub aggregating articles about Model Context Protocol (MCP) security risks including prompt injection, social engineering of AI agents, malicious MCP servers, and data exfiltration vulnerabilities, along with enterprise security solutions and best practices for securing MCP implementations.

Runlayer OpenAI Anthropic Google Claude Opus OpenClaw Cursor GitHub Box AAIF Linux Foundation OWASP Vitor Balocco Tal Peretz Alex Frazer Jake Moghtader
runlayer.com · gk1 · 12 hours ago · details · hn
0 7/10
research

A 2-week empirical study of six autonomous AI agents with real tools (email, shell, persistent storage) tested by 20 researchers in both benign and adversarial scenarios, documenting 10 security vulnerabilities (prompt injection, identity spoofing, non-owner compliance, social engineering bypass) and 6 cases of emergent safety behavior including cross-agent safety coordination without explicit instruction.

Natalie Shapira OpenClaw Kimi K2.5 Claude Opus 4.6 ProtonMail Discord GitHub Ash Flux Jarvis Quinn Mira Doug
agentsofchaos.baulab.info · xdotli · 12 hours ago · details · hn
0 2/10

Indie Hackers post about Promptle, a game where players guess the AI prompts behind generated images through semantic similarity matching. The creator discusses implementation challenges around scoring closeness and the gamification aspects (PvP mode, Elo rating) designed to teach prompt engineering principles.

Promptle Irtiza Hammad flompt FontPreview.online ReactLaunch
indiehackers.com · irtizahammad · 16 hours ago · details · hn