prompt-engineering

3 articles

sort: new top best

bug-bounty490 google398 microsoft329 xss293 facebook288 rce199 exploit191 apple187 malware173 cve127 account-takeover113 bragging-post101 csrf86 privilege-escalation85 phishing81 browser80 supply-chain67 writeup66 dos66 stored-xss64 react64 authentication-bypass62 reflected-xss57 cloudflare56 node55 reverse-engineering53 ssrf51 aws51 docker50 input-validation48 access-control47 cross-site-scripting46 oauth46 smart-contract45 web345 ethereum43 defi42 sql-injection42 lfi41 web-security40 info-disclosure40 cloud39 web-application39 race-condition38 pentest37 ctf36 idor35 burp-suite35 vulnerability-disclosure34 html-injection33

0 3/10

Show HN: Agile V Skills – Open skills for verifiable, traceable AI engineering

tool

Agile V Skills addresses a critical gap in AI-assisted software development: ensuring that AI-generated code is independently verified and traceable to requirements, rather than relying on the same AI agent to both write and test code (which introduces confirmation bias).

ai-agents software-engineering testing verification code-generation quality-assurance prompt-engineering llm-security

Agile V Skills

github.com · JoshuaWellbrock · 11 hours ago · details · hn

0 6/10

LLMs: Using a single Unix-style tool instead of multiple tools/function calling

research

A former backend lead at Manus proposes replacing traditional function-calling in LLM agents with a single Unix-style run(command="...") tool that leverages pipes and shell operators, arguing that LLMs are naturally aligned with CLI patterns they've seen extensively in training data and that this approach reduces cognitive load on tool selection while enabling composition.

llm-agents function-calling tool-use cli-design prompt-engineering agent-architecture unix-philosophy agentic-systems

Manus Meta Pinix agent-clip LocalLLaMA MorroHsu

old.reddit.com · drtse4 · 13 hours ago · details · hn

0 5/10

Golden Sets: Regression Engineering for Probabilistic Systems

tutorial

This article introduces golden sets—structured regression testing frameworks for probabilistic AI workflows that combine representative test cases, explicit scoring rubrics, and versioned evaluation contracts to detect regressions across prompt, model, retrieval, and policy changes before production impact.

ai-systems testing regression-testing evaluation quality-assurance probabilistic-systems prompt-engineering llm-safety policy-enforcement metric-design operational-guidelines

Heavy Thought Laboratories

heavythoughtcloud.com · ryan-s · 18 hours ago · details · hn