Arena (formerly LLMArena)

Product & Competitive Intelligence

Crowdsourced human-preference benchmarking platform for LLMs and generative AI models.

Company Overview

Arena is an AI evaluation platform that runs human-preference model comparisons and publishes leaderboards used by labs and enterprises. Serving AI labs (OpenAI, Google, Anthropic, xAI) and enterprise buyers in software engineering, legal, medical, and research workflows.

Latest Intel

Zeitgeist tracks private signals to determine where the company is heading strategically.

No Signals Yet

View All The Latest Signals

What They're Building

The company's public product roadmap & what they're committed to building.

Community Leaderboard:

Public head-to-head voting across text, image, and (as of Jan 2026) video models.

Arena Enterprise Evaluations:

Commercial benchmarking service for model labs and enterprises, reportedly $30M ARR within four months of launch.

Arena-Hard and RouteLLM:

Research-grade datasets and routing tools released via the lmarena GitHub org.

Vision Arena:

Multimodal preference evaluation extending the battle format to image and video outputs.

Competitors

Hugging Face:

Model hosting and open leaderboards, broader scope but less focused on human-preference battles.

Artificial Analysis:

Automated benchmarking and pricing comparisons, no crowdsourced preference layer.

Vellum:

Enterprise eval and prompt ops tooling aimed at application teams, not model labs.

Arena (formerly LLMArena)

's Moat:

Proprietary dataset of millions of human preference votes across frontier models, a data asset no competitor can replicate without matching Arena's community scale.