quality-assurance

2 articles
sort: new top best
clear filter
0 3/10

Agile V Skills addresses a critical gap in AI-assisted software development: ensuring that AI-generated code is independently verified and traceable to requirements, rather than relying on the same AI agent to both write and test code (which introduces confirmation bias).

Agile V Skills
github.com · JoshuaWellbrock · 11 hours ago · details · hn
0 5/10

This article introduces golden sets—structured regression testing frameworks for probabilistic AI workflows that combine representative test cases, explicit scoring rubrics, and versioned evaluation contracts to detect regressions across prompt, model, retrieval, and policy changes before production impact.

Heavy Thought Laboratories
heavythoughtcloud.com · ryan-s · 18 hours ago · details · hn