cursor

2 articles
sort: new top best
clear filter
0 2/10

Cursor describes CursorBench, an internal evaluation suite for measuring AI coding agent quality using real developer workflows rather than public benchmarks. The approach combines offline evals on private task data with online metrics to better distinguish between models and align with actual developer experience.

Cursor CursorBench SWE-bench Verified OpenAI GPT-5 Haiku Cursor Blame
cursor.com · ingve · 1 day ago · details · hn
0 1/10

An open-source library of 125 GTM skills for AI coding agents (Claude Code, Codex, Cursor) that automate sales and marketing workflows like lead generation, cold email sequences, competitor monitoring, and SEO page generation through structured markdown playbooks.

Claude Code OpenAI Codex Cursor Gooseworks goose-skills
github.com · hbamoria · 1 day ago · details · hn