mlops

2 articles

sort: new top best

bug-bounty507 xss274 rce154 google122 bragging-post119 account-takeover115 facebook111 privilege-escalation101 exploit98 malware97 authentication-bypass95 open-source94 microsoft90 csrf87 access-control78 stored-xss75 cve73 ai-agents67 web-security66 reflected-xss63 phishing60 information-disclosure52 input-validation52 sql-injection51 smart-contract49 privacy49 cross-site-scripting48 ssrf48 defi48 tool46 reverse-engineering46 ethereum46 writeup45 api-security45 ai-security41 apple40 vulnerability-disclosure40 web-application38 llm38 opinion37 burp-suite37 automation36 web336 responsible-disclosure35 credential-theft35 remote-code-execution34 supply-chain34 race-condition34 browser33 infrastructure33

0 2/10

AI Cluster Runtime: Reproducible Configs for GPU-Accelerated Kubernetes Clusters

tool

NVIDIA's AI Cluster Runtime is an open-source project that provides validated, reproducible Kubernetes cluster configurations for GPU-accelerated AI workloads through layered recipes, CLI tooling, and validation mechanisms. It enables consistent deployment across different cloud environments and hardware by capturing exact component versions, dependencies, and configuration parameters.

kubernetes gpu-infrastructure container-orchestration configuration-management nvidia reproducible-deployment helm kubeflow mlops infrastructure-as-code validation cloud-deployment h100 blackwell open-source

AI Cluster Runtime NVIDIA Kubernetes Amazon EKS Kubeflow Trainer NVIDIA Dynamo NVIDIA GPU Operator NCCL CNCF Certified Kubernetes AI Conformance Program H100 Blackwell ArgoCD Mark Chmarny Nathan Taber

developer.nvidia.com · mchmarny · 1 day ago · details · hn

0 5/10

MCP server that audits AI agent reasoning before decisions commit

tool

SENTINEL is an MCP server that audits AI agent reasoning in real-time before high-stakes decisions execute, using a four-stage pipeline (signal fidelity, pattern classification, reliability scoring, authority gate) to detect reasoning failures, policy staleness, and accuracy drift. The system integrates with agentgateway for governance and Datadog/Braintrust for monitoring, demonstrated in a healthcare use case where an insurance claim agent's accuracy drifted from 84% to 44% undetected.

ai-agent-governance mcp-server reasoning-audit decision-verification agent-safety mlops monitoring drift-detection reliability-scoring rbac audit-logging healthcare-ai prior-authorization

SENTINEL Andrew Espira agentgateway Solo.io Claude GPT Datadog Braintrust Cleric Aetna UnitedHealthcare MCP CEL

espiradev.org · aespira · 2 days ago · details · hn