agent-safety

1 article
sort: new top best
clear filter
0 5/10

SENTINEL is an MCP server that audits AI agent reasoning in real-time before high-stakes decisions execute, using a four-stage pipeline (signal fidelity, pattern classification, reliability scoring, authority gate) to detect reasoning failures, policy staleness, and accuracy drift. The system integrates with agentgateway for governance and Datadog/Braintrust for monitoring, demonstrated in a healthcare use case where an insurance claim agent's accuracy drifted from 84% to 44% undetected.

SENTINEL Andrew Espira agentgateway Solo.io Claude GPT Datadog Braintrust Cleric Aetna UnitedHealthcare MCP CEL
espiradev.org · aespira · 2 days ago · details · hn