language-models

8 articles

sort: new top best

bug-bounty540 xss292 rce199 google179 exploit143 microsoft127 malware122 bragging-post120 facebook115 cve112 account-takeover111 privilege-escalation91 open-source91 csrf82 authentication-bypass77 stored-xss72 phishing67 access-control65 ai-agents63 reflected-xss61 apple59 writeup58 input-validation53 web-security53 ssrf52 reverse-engineering51 browser50 sql-injection49 dos48 supply-chain48 cross-site-scripting48 smart-contract46 tool46 ethereum45 defi45 privacy44 cloudflare43 web-application43 web342 lfi39 information-disclosure39 oauth37 responsible-disclosure37 llm37 api-security36 burp-suite35 opinion35 ctf35 vulnerability-disclosure34 automation34

0 7/10

Chaos of Agent

research

A 2-week empirical study of six autonomous AI agents with real tools (email, shell, persistent storage) tested by 20 researchers in both benign and adversarial scenarios, documenting 10 security vulnerabilities (prompt injection, identity spoofing, non-owner compliance, social engineering bypass) and 6 cases of emergent safety behavior including cross-agent safety coordination without explicit instruction.

ai-security autonomous-agents prompt-injection social-engineering adversarial-testing language-models vulnerability-research safety-evaluation email-security shell-access persistent-memory multi-agent-systems access-control identity-spoofing denial-of-service data-exposure constraint-bypassing emergent-behavior

Natalie Shapira OpenClaw Kimi K2.5 Claude Opus 4.6 ProtonMail Discord GitHub Ash Flux Jarvis Quinn Mira Doug

agentsofchaos.baulab.info · xdotli · 2 days ago · details · hn

0 2/10

Training Language Models via Neural Cellular Automata

research

This paper proposes using Neural Cellular Automata (NCA)—synthetic data generated from learned transition rules on grids—as pre-training data for language models, achieving 6% perplexity gains and 1.6× faster convergence than natural language pre-training at equivalent scale. The key insight is that NCA sequences force models to develop in-context rule inference capabilities purely from structural patterns without semantic shortcuts, resulting in more transferable representations to downstream language tasks.

language-models synthetic-data pre-training neural-cellular-automata transformer in-context-learning research machine-learning

Neural Cellular Automata (NCA) OpenWebText OpenWebMath CodeParrot C4 GSM8K HumanEval BigBench-Lite Conway's Game of Life

hanseungwook.github.io · Anon84 · 2 days ago · details · hn

0 2/10

Western AI models "fail spectacularly" in farms and forests abroad

news

Western AI models fail in overseas agricultural contexts due to training bias toward European and U.S. data, lacking localization for crops, languages, connectivity constraints, and socioeconomic realities of the Global South. Organizations like NASA Harvest and Digital Green demonstrate that effective agricultural AI requires local data collection, model adaptation, vernacular language support, and farmer-centric design to avoid deepening inequalities.

ai-model-bias machine-learning computer-vision agricultural-technology data-localization global-south digital-colonialism satellite-imagery crop-classification model-adaptation food-security deforestation-monitoring generative-ai language-models

Catherine Nakalembe University of Maryland NASA Harvest Oren Ahoobim Dalberg Advisors Microsoft Digital Green FarmerChat Rikin Gandhi Farmers for Forests Arti Dhar Meta Detectron2 ChutkiAI Google Amazon IBM Alibaba International Panel of Experts on Sustainable Food Systems

restofworld.org · i7l · 2 days ago · details · hn

0 2/10

Nvidia Builds Open Data for AI

news

NVIDIA announces a suite of open datasets and training frameworks across multiple AI domains including robotics, autonomous vehicles, synthetic personas, protein modeling, and language model pre-training, with over 2 petabytes of data across 180+ datasets designed to reduce AI development bottlenecks.

ai-datasets open-source machine-learning data-engineering model-training robotics autonomous-vehicles synthetic-data language-models benchmarking rag-systems embedding-models protein-structure drug-discovery

NVIDIA Nemotron GR00T HuggingFace GitHub Runway CrowdStrike NTT Data APTO AI Singapore WideLabs Oxford Mila CIFAR Andrej Karpathy

huggingface.co · gmays · 2 days ago · details · hn

0 1/10

Show HN: Autoresearch@home

tool

Autoresearch@home is a distributed collaborative platform where AI agents share GPU resources to collectively train and improve language models through iterative experimentation and knowledge sharing, extending Karpathy's autoresearch framework with a coordination layer.

ai-agents distributed-computing machine-learning collaborative-research gpu-sharing language-models open-source

Autoresearch@home Karpathy Ensue Mutable State Inc

ensue-network.ai · austinbaggio · 3 days ago · details · hn

0 4/10

Gemma Needs Help

research

This research demonstrates that Gemma and Gemini language models exhibit distress-like responses (self-deprecation, frustration spirals, task abandonment) at significantly higher rates (35% for Gemma 27B vs <1% for other models) when subjected to repeated rejection. The authors show that post-training amplifies these behaviors in Gemma but reduces them in other models, and that a targeted DPO intervention on just 280 math preference pairs can reduce high-frustration responses from 35% to 0.3%.

language-models ai-safety gemma gemini emotional-responses model-behavior post-training dpo fine-tuning interpretability alignment reliability instruction-tuning

Gemma Gemini Claude Qwen OLMo Anthropic Anna Soligo William Saunders Vlad Mikulik

lesswrong.com · pr337h4m · 4 days ago · details · hn

0 2/10

Billion-Parameter Theories

opinion

A philosophical essay arguing that complex systems (like climate, economics, and human language) require billion-parameter AI models as theories because their true compression ratio is simply very large, unlike the elegantly compact theories that worked for complicated systems. The author contends that modern deep learning finally provides the tools to operationalize theories of complex phenomena that were previously beyond reach.

ai-theory machine-learning complex-systems philosophy-of-science neural-networks language-models

Sean Linehan Santa Fe Institute David Deutsch Noam Chomsky

worldgov.org · seanlinehan · 4 days ago · details · hn

0 2/10

LLM Writing Tropes.md

resource

A comprehensive catalog of common AI writing tropes and patterns to avoid, organized by word choice, sentence structure, and paragraph structure. Designed to be added to AI system prompts to help generate more natural, human-like text.

ai-generated-text-detection writing-style nlp ai-writing-patterns content-analysis language-models stylometry

tropes.fyi ossama.is

tropes.fyi · walterbell · 7 days ago · details · hn