Telecom API incumbent with massive carrier reach, but AgentPhone competes by packaging phone workflows for AI agents.
Voice agent infrastructure player focused on real-time calls, while AgentPhone widens the surface to numbers, SMS, webhooks, and MCP.
Voice AI platform for phone agents, stronger on call automation while AgentPhone pitches a lower-level agent phone layer.
A defensible position is still ahead of the company. The likely path is workflow switching costs as teams wire numbers, webhooks, transcripts, and MCP tools deep into their agent stacks and accumulate operational state inside AgentPhone.
AgentPhone appears to orchestrate streaming STT, hosted LLM prompts, TTS, webhooks, and MCP tool calls rather than training proprietary models, which makes the product an integration and developer-experience play, not a model play.
Building human-like AI voices that speak, clone, dub, and converse in 70+ languages
Having established defensible voice quality and market share through its API, ElevenLabs is now becoming a multimodal generation platform with an enterprise go-to-market engine.
Voice AI infrastructure for real-time speech-to-text, text-to-speech, and voice agents.
Deepgram controls the full vertical stack from bare-metal training hardware to a Rust inference runtime, a cost and latency moat that API competitors riding hyperscaler infrastructure cannot replicate without years of capex.