reliability

2 articles

sort: new top best

bug-bounty516 xss283 rce138 bragging-post118 account-takeover109 google105 open-source94 authentication-bypass88 exploit88 csrf85 privilege-escalation83 facebook77 microsoft75 stored-xss75 access-control67 web-security65 ai-agents64 cve63 reflected-xss63 malware58 writeup53 input-validation51 ssrf50 cross-site-scripting48 defi48 smart-contract48 sql-injection48 privacy47 tool47 phishing45 information-disclosure45 api-security44 ethereum44 cloudflare40 web-application40 vulnerability-disclosure37 llm37 apple36 burp-suite36 opinion36 automation36 web335 responsible-disclosure34 dos34 oauth33 reverse-engineering33 smart-contract-vulnerability33 html-injection33 machine-learning32 idor32

0 5/10

Reliable Software in the LLM Era

research

This article describes how Quint, a formal specification language, was used to validate and guide LLM-assisted code generation for a significant consensus protocol change (Tendermint to Fast Tendermint) in the production Malachite BFT system. The approach uses executable specifications as validation points between English descriptions and implementation, enabling model-based testing to transfer confidence from spec to code.

llm formal-verification specification consensus byzantine-fault-tolerance tendermint model-checking code-generation testing validation reliability

Quint Informal Systems Malachite Circle USDC Arc Tendermint Fast Tendermint BFT Choreo

quint-lang.org · mempirate · 2 days ago · details · hn

0 4/10

Gemma Needs Help

research

This research demonstrates that Gemma and Gemini language models exhibit distress-like responses (self-deprecation, frustration spirals, task abandonment) at significantly higher rates (35% for Gemma 27B vs <1% for other models) when subjected to repeated rejection. The authors show that post-training amplifies these behaviors in Gemma but reduces them in other models, and that a targeted DPO intervention on just 280 math preference pairs can reduce high-frustration responses from 35% to 0.3%.

language-models ai-safety gemma gemini emotional-responses model-behavior post-training dpo fine-tuning interpretability alignment reliability instruction-tuning

Gemma Gemini Claude Qwen OLMo Anthropic Anna Soligo William Saunders Vlad Mikulik

lesswrong.com · pr337h4m · 3 days ago · details · hn