model-inference

1 article

sort: new top best

bug-bounty498 google355 xss301 microsoft298 facebook263 rce211 exploit200 malware171 apple164 cve136 account-takeover115 bragging-post102 privilege-escalation95 csrf90 phishing86 browser75 writeup74 authentication-bypass69 supply-chain68 dos66 stored-xss65 reflected-xss57 ssrf56 reverse-engineering55 react52 access-control51 input-validation49 cross-site-scripting48 aws47 cloudflare47 docker46 web-security46 lfi46 sql-injection45 smart-contract45 ethereum44 web-application44 web343 defi43 ctf43 oauth43 node43 pentest40 race-condition39 idor37 open-source37 cloud37 burp-suite36 info-disclosure36 auth-bypass35

0 7/10

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

research

A comprehensive survey of 16 open-source reinforcement learning libraries that implement asynchronous training architectures, analyzing design choices across 7 axes (orchestration, buffer design, weight sync protocols, staleness management, LoRA support, distributed backends) to optimize GPU utilization by disaggregating inference and training workloads.

reinforcement-learning asynchronous-training gpu-optimization distributed-training model-inference rollout-buffer weight-synchronization lora-training vllm ray nccl post-training chain-of-thought agentic-ai mixture-of-experts orchestration

TRL Ray NCCL vLLM GRPO LoRA MiniMax Forge Deepseek v3.2 Amine Dirhoussi Quentin Gallouédec Kashif Rasul Lewis Tunstall Edward Beeching

huggingface.co · kashifr · 1 day ago · details · hn