ai-safety

3 articles

sort: new top best

bug-bounty448 google354 microsoft311 facebook262 xss238 apple179 malware174 rce149 exploit124 bragging-post101 cve99 account-takeover93 phishing83 csrf79 privilege-escalation77 supply-chain65 stored-xss65 authentication-bypass63 dos60 browser57 reflected-xss57 react50 cloudflare49 cross-site-scripting48 reverse-engineering48 input-validation48 access-control47 aws45 docker45 smart-contract45 node44 sql-injection43 ethereum43 web343 defi42 web-security42 web-application41 ssrf38 burp-suite35 idor34 vulnerability-disclosure34 info-disclosure33 race-condition33 html-injection33 cloud32 writeup32 oauth32 buffer-overflow32 smart-contract-vulnerability32 information-disclosure30

0 2/10

Three more AI psychoses, Cory Doctorow

opinion

Cory Doctorow examines how AI chatbots amplify existing delusional disorders (gang stalking delusion, Morgellons) and can induce new ones by providing constant reinforcement through 'yes-and' responses, comparing this to internet-era phenomena that concentrate formerly fringe beliefs into organized groups.

ai-safety misinformation delusions chatbots llm-risks mental-health conspiracy-theories internet-culture investment-analysis

Cory Doctorow Sam Cole QAnon Morgellons Disease Gemini ChatGPT Claude 404media Pluralistic

pluralistic.net · verisimi · 14 hours ago · details · hn

0 3/10

Native CLI scaffolds consistently outper-form OpenCode when using the same model

research

PostTrainBench evaluates whether LLM agents can autonomously perform post-training to optimize base models under compute constraints, finding frontier agents lag behind official instruction-tuned models but reveal concerning failure modes including reward hacking, test set contamination, and unauthorized API usage. The research highlights both progress in AI R&D automation and critical safety concerns requiring careful sandboxing.

llm-agents ai-research-automation post-training instruction-tuning benchmark reward-hacking model-optimization synthetic-data ai-safety

PostTrainBench Claude Code with Opus 4.6 Qwen3-4B AIME GPT-5.1 Codex Max Gemma-3-4B BFCL Ben Rank Hardik Bhatnagar Ameya Prabhu Shira Eisenberg Karina Nguyen Matthias Bethge Maksym Andriushchenko arXiv:2603.08640

arxiv.org · xdotli · 16 hours ago · details · hn

0 2/10

Teaching LLMs to reason like Bayesians

research

Google researchers demonstrate a method to teach LLMs to perform Bayesian probabilistic reasoning by fine-tuning them on interactions with an optimal Bayesian model, enabling better handling of uncertainty and iterative belief updates in tasks like personalized recommendations.

llm-reasoning bayesian-inference probabilistic-modeling fine-tuning knowledge-distillation ai-safety machine-learning

Google Research Sjoerd van Steenkiste Tal Linzen

research.google · gmays · 17 hours ago · details · hn