← AI Forecast Ledger
Pendingcapabilities

A superhuman coder exists by end of 2027

Pending · due Dec 2027 On track

2 receiptsOpen verificationClose

Pending

Pending · due Dec 2027

SWE-bench Verified %Top coding agents now exceed 85% on SWE-bench Verified (mid-2026), up from ~70% a year earlier — still short of autonomous senior-eng PRs

How it's graded

Met when a model autonomously completes a non-trivial PR end-to-end at senior-eng level

Trajectory unverified indicators — not graded receipts
  • Jun 2025Mid-2025: stumbling agents — first usable AI coding agents appear hitCoding agents emerged in 2025 and now autonomously resolve real GitHub issues on SWE-bench Verified.assessed Jun 2026indication↗
  • Jan 2026Early 2026: coding automation accelerates on paceTop agents now exceed 85% on SWE-bench Verified, up from ~70% a year earlier — fast progress, still short of autonomous senior-eng PRs.assessed Jun 2026indication↗
  • Dec 2027End 2027: a superhuman coder exists (the target) pendingThe claim's target milestone — not yet due.indication↗
Receipts · 2
Ledger history
  • 2026-06-20: seeded (#1)
  • 2026-06-24: replaced placeholder evidence with the SWE-bench Verified leaderboard; verdict held on-track
  • 2026-06-25: demoted pending bulletproof re-grade
  • 2026-06-26: added trajectory checkpoints (AI-2027 milestones, on track) + refreshed SWE-bench measurement to >85%

Every verdict on the ledger is graded against dated, archived third-party evidence and blind-verified by two independent models.