ai-inference

2 articles

sort: new top best

bug-bounty488 xss246 rce124 bragging-post117 google116 account-takeover104 microsoft96 facebook94 privilege-escalation83 authentication-bypass83 open-source81 csrf81 stored-xss75 malware66 access-control66 apple65 web-security64 reflected-xss63 ai-agents62 cve56 exploit54 input-validation53 sql-injection50 phishing50 cross-site-scripting49 defi48 smart-contract48 api-security47 ethereum45 ssrf44 information-disclosure43 privacy40 web-application39 vulnerability-disclosure38 dos37 tool37 burp-suite37 reverse-engineering36 automation35 cloudflare34 responsible-disclosure34 llm34 web334 opinion34 writeup34 idor33 html-injection33 smart-contract-vulnerability33 ai-security32 waf-bypass31

0 2/10

Meta reveals four Broadcom-built ASICs for AI inference

news

Meta unveiled four custom Broadcom-built AI inference chips (MTIA 300/400/450/500) designed for ranking, recommendation, and generative AI workloads, with plans to deploy multiple gigawatts starting in 2027. The chips use modular chiplet architecture with RISC-V cores and HBM stacks, with successive generations claiming performance competitive or superior to commercial alternatives like Nvidia.

custom-asic ai-inference chip-design meta broadcom hardware chiplet-architecture hbm-memory risc-v genai data-center

Meta Broadcom MTIA 300 MTIA 400 MTIA 450 MTIA 500

theregister.com · giuliomagnifico · 2 days ago · details · hn

0 2/10

Launch HN: RunAnywhere (YC W26) – Faster AI Inference on Apple Silicon

tool

RunAnywhere released MetalRT, a Metal GPU-optimized inference engine for Apple Silicon that achieves 1.67x faster LLM decode than llama.cpp and 4.6x faster speech-to-text than mlx-whisper through custom GPU shaders and zero-allocation inference. They also open-sourced RCLI, a voice AI pipeline combining STT, LLM, and TTS with sub-600ms end-to-end latency entirely on-device.

ai-inference apple-silicon gpu-optimization metal-shaders llm-performance speech-recognition text-to-speech on-device-ai macos open-source-tool

RunAnywhere MetalRT RCLI YC W26 Sanchit Shubham llama.cpp Apple MLX Ollama sherpa-onnx mlx-whisper Qwen3 LFM2.5

github.com · sanchitmonga22 · 3 days ago · details · hn