ARC Prize Foundation

Roadmap & Position in AI Research

Defines how the world measures progress toward artificial general intelligence.

Company Overview

A non-profit foundation that maintains the ARC-AGI benchmark, a test designed to measure genuine machine intelligence by evaluating fluid reasoning and abstraction. Runs annual competitions with significant cash prizes and has become the industry-standard benchmark for frontier AI labs evaluating progress toward AGI.

What They're Building

The company's public product roadmap & what they're committed to building.

ARC-AGI-2 released 2025 (harder, more resistant to brute force). ARC Prize 2025: 1,455 teams, 15,154 entries, top Kaggle score 24% on ARC-AGI-2 private eval at $0.20/task. 90 paper submissions (up from 47 in 2024). Over $125,000 in prizes awarded. ARC-AGI-3 publicly released with interactive reasoning challenges requiring exploration, planning, memory, goal acquisition, and alignment. Building academic network and frontier AI lab coalition.

Latest Intelligence

Zeitgeist tracks private signals to determine where the company is heading strategically.

Competitors

Benchmarks

HELM (Stanford CRFM), BIG-Bench (Google), MMLU/MMLU-Pro, GPQA.

AGI Research

MIRI, Anthropic (interpretability).

Competition Platforms

Kaggle, MLPerf, SWE-bench.

Intelligence Tests

Turing Test variants, MathOlympiad benchmarks.

ARC Prize Foundation

's Moat:

Chollet created the only benchmark every frontier lab treats as the AGI litmus test. Controlling the next version (ARC-AGI-2, ARC-AGI-3) means controlling what the industry optimizes for. No competitor can replicate that position without Chollet's credibility.

How They're Leveraging AI

AI Use Overview:

Through abstract reasoning benchmarks that test fluid intelligence, crowdsourced competitions surfacing novel approaches, and adaptive test design that stays ahead of AI advances.

More Similar Companies

Aemon

Autonomous AI engineer that discovers better algorithms than DeepMind at a fraction of the cost.

Beat DeepMind's AlphaEvolve on an NP-hard problem for under $10 in compute. If that result generalizes, Aemon sells automated R&D to quant funds and biotech labs at a fraction of what they spend on human researchers.

Doomersion

Turns doomscrolling into language learning with adaptive video feeds and clickable subtitles.

Duolingo gamified language learning. Doomersion replaces doomscrolling with it. Adaptive video feeds of native content with clickable subtitles, built by a founder who self-taught Japanese through 6 years of immersion and understands how acquisition actually works.

Librar Labs

Gives the 98% of schools without a library system an AI-powered cataloging and search platform.

98% of schools worldwide lack a proper library system. Librar collapses cataloging from weeks to hours using camera-based bulk scanning and a self-healing data backend. The niche is small but uncontested, and the data asset (structured literary metadata) compounds.

Ndea

AGI lab building hybrid deep learning and program synthesis for autonomous scientific discovery.

Chollet (Keras creator, ARC-AGI inventor) and Knoop (Zapier co-founder) building hybrid deep learning and program synthesis for autonomous scientific discovery. The thesis is that current LLM scaling will plateau and learning efficiency through program synthesis is the path to AGI. ARC-AGI-3 launching as the next benchmark.