Is the Law of Accelerating Returns genuinely slowing? A quantitative analysis of Moore's Law doubling periods, LLM training efficiency improvements, and inference compute scaling from 2018–2026

This is the most fundamental unresolved question in the report. Perplexity presents specific evidence of slowdown (lengthening doubling periods, inference-compute bottlenecks) that Gemini and Grok dispute. If the LOAR is genuinely decelerating, Kurzweil's entire 2029–2045 predictive framework loses its empirical foundation. This requires dedicated quantitative analysis of semiconductor roadmaps, AI training efficiency data, and compute scaling laws — data that exists but was not systematically compiled by any provider

A pre-registered, definition-specific AGI prediction audit: What specific capabilities must AI systems demonstrate by December 31, 2029 for Kurzweil's prediction to be scored as "correct," and what is the current probability of each capability being achieved?

The AGI 2029 prediction is the highest-stakes near-term claim in Kurzweil's body of work, and all providers note that its evaluation depends critically on definition. A pre-registered scoring rubric (specifying exactly what "human-level AI" means in measurable terms) would resolve the definitional ambiguity that currently makes the prediction nearly unfalsifiable. This would also establish a template for evaluating other vague long-term predictions

A systematic comparison of Kurzweil's prediction accuracy against other prominent technology forecasters (Gartner Hype Cycle, McKinsey Global Institute, MIT Technology Review) using identical scoring methodology

All providers assess Kurzweil in isolation, making it impossible to determine whether his 40–50% strict accuracy rate is impressive or mediocre relative to the field. The FHI/Armstrong study notes that 42% is "impressive" for decade-out forecasts, but this claim is not benchmarked against alternatives. A comparative analysis would establish whether Kurzweil's framework adds genuine predictive value beyond base rates and expert consensus

A granular audit of Kurzweil's biological and longevity predictions specifically — mapping each prediction against current clinical trial pipelines, regulatory timelines, and biological research milestones to produce a revised realistic timeline for LEV

Longevity Escape Velocity is Kurzweil's most consequential biological prediction and the one where all providers agree his timing is most optimistic. However, no provider provides a rigorous bottom-up analysis of what specific breakthroughs are required, what the current state of each is, and what realistic timelines look like given regulatory constraints. This would move the assessment from "probably 2035–2045" (a rough adjustment of Kurzweil's timeline) to a grounded estimate based on actual research pipelines

An analysis of the smart glasses market trajectory (2025–2030) as a test case for Kurzweil's AR/VR predictions — specifically whether the 211% growth in smart glasses in 2025 represents the beginning of the adoption curve he predicted or a niche market phenomenon

Anthropic's unique finding that smart glasses grew 211% in 2025 while VR headsets declined 42.8% suggests that Kurzweil's *form factor* prediction (glasses, not headsets) may be vindicated even as the industry bet on the wrong form factor. This is a potentially important partial vindication that no other provider examines in depth. Understanding whether smart glasses are on an adoption trajectory consistent with Kurzweil's vision would significantly update the VR/AR domain accuracy assessment

The Kurzweil Scorecard: Cross-Provider Synthesis Report

Tracking 147 Predictions Against Real-World Outcomes Through March 2026

Executive Summary

The "86% accuracy" claim is demonstrably inflated, but the "7% accuracy" critique is equally misleading. The defensible independent accuracy rate for specific, time-bound predictions is approximately 40–50%, rising to 70–75% for directional accuracy (correctly identifying what technology would arrive, regardless of when). The gap between these figures is the central methodological dispute in evaluating Kurzweil's record.
A domain-stratified accuracy pattern is the most actionable finding. Kurzweil's predictions achieve ~85–95% accuracy in pure information technology (computing hardware, networks, digital media), drop to ~65–70% in AI/software, fall further to ~40–60% in robotics and autonomous systems, and collapse to ~15–20% in biotechnology and nanotechnology — revealing that his Law of Accelerating Returns applies reliably only where progress is purely computational and fails systematically where physical-world deployment, regulatory friction, or biological complexity intervenes.
The "5–10 year timing optimism" pattern holds but has become domain-differentiated. For pure software/AI predictions, the gap is narrowing (possibly to zero or even negative for AGI). For physical-world deployment predictions (self-driving, VR/AR, nanobots, longevity), the gap has widened to 10–20 years, suggesting Kurzweil's framework is becoming simultaneously more accurate in one domain and less accurate in another.
The 2029 AGI prediction is Kurzweil's most consequential near-term test. All six providers agree it has shifted from fringe speculation to mainstream plausibility since 2022, but disagree on whether current LLM trajectories constitute genuine progress toward AGI or sophisticated narrow capability. The prediction's outcome within three years will either be his greatest vindication or his most prominent miss.
Longevity Escape Velocity by 2029–2032 is his most vulnerable near-term prediction. All providers independently assess this as highly unlikely on his stated timeline, with realistic estimates ranging from 2035 to 2045+, representing a consistent 10–15 year optimism bias in the biological domain — the same structural error visible across his entire biotech prediction history.

Cross-Provider Consensus

Finding 1: The True Accuracy Rate Lies Between 40–55% for Specific Predictions

Providers: Perplexity, Anthropic, OpenAI, Gemini, Gemini-Lite, Grok-Premium Confidence: HIGH

All six providers independently converge on rejecting both Kurzweil's self-assessed 86% and the harshest critic figure of ~7%. The independent academic benchmark from the FHI/Armstrong study (~42%) is cited by five of six providers as the most credible anchor. Providers differ on whether to apply generous partial credit (pushing toward 50–55%) or strict binary scoring (holding at 40–45%), but none accept either extreme.