llm-optimization

2 articles

sort: new top best

bug-bounty498 google355 xss301 microsoft298 facebook263 rce211 exploit200 malware171 apple164 cve136 account-takeover115 bragging-post102 privilege-escalation95 csrf90 phishing86 browser75 writeup74 authentication-bypass69 supply-chain68 dos66 stored-xss65 reflected-xss57 ssrf56 reverse-engineering55 react52 access-control51 input-validation49 cross-site-scripting48 aws47 cloudflare47 docker46 web-security46 lfi46 sql-injection45 smart-contract45 ethereum44 web-application44 web343 defi43 ctf43 oauth43 node43 pentest40 race-condition39 idor37 open-source37 cloud37 burp-suite36 info-disclosure36 auth-bypass35

0 4/10

To Sparsify or to Quantize: A Hardware Architecture View

research

A technical analysis of sparsity versus quantization as hardware optimization strategies for neural networks, exploring architectural challenges (unstructured sparse data chaos vs. quantization metadata overhead) and current compromises (structured sparsity patterns and algorithmic co-design techniques) used in modern AI accelerators.

hardware-architecture neural-network-optimization sparsity quantization model-compression ai-accelerators tensor-cores memory-bandwidth deep-learning llm-optimization

NVIDIA Ampere EIE SCNN BitNet b1.58 GPTQ Quip SmoothQuant AWQ StreamingLLM OCP Microscaling Formats Deep Compression

sigarch.org · matt_d · 23 hours ago · details · hn

0 3/10

Show HN: Tarvos – Relay Architecture for infinitely building with coding agents

tool

Tarvos introduces a relay architecture for AI coding agents that mitigates LLM context degradation by dispatching fresh agents sequentially, each reading a static master plan and receiving only a minimal 40-line handoff note, with automatic token-based switching when context budgets are exceeded.

ai-coding-agents llm-optimization context-window-management relay-architecture prompt-engineering autonomous-development token-tracking claude-api

Tarvos Claude Code Chroma Research MIT

github.com · Photon48 · 23 hours ago · details · hn