long-context

1 article

sort: new top best

bug-bounty490 google398 microsoft329 xss293 facebook288 rce199 exploit191 apple187 malware173 cve127 account-takeover113 bragging-post101 csrf86 privilege-escalation85 phishing81 browser80 supply-chain67 writeup66 dos66 stored-xss64 react64 authentication-bypass62 reflected-xss57 cloudflare56 node55 reverse-engineering53 ssrf51 aws51 docker50 input-validation48 access-control47 cross-site-scripting46 oauth46 smart-contract45 web345 ethereum43 defi42 sql-injection42 lfi41 web-security40 info-disclosure40 cloud39 web-application39 race-condition38 pentest37 ctf36 idor35 burp-suite35 vulnerability-disclosure34 html-injection33

0 2/10

Show HN: SiMM – Distributed KV Cache for the Long-Context and Agent Era

tool

SiMM is an open-source distributed KV cache engine that addresses GPU memory constraints in LLM inference by storing KV cache in RDMA-backed memory pools, achieving 3.1× speedup over no cache and up to 9× lower KV I/O latency on long-context multi-turn workloads.

llm-inference kv-cache distributed-systems rdma performance-optimization gpu-memory long-context open-source

SiMM SGLang vLLM OpenRouter RDMA

github.com · SherryWong · 14 hours ago · details · hn