Enables AI voice agents to securely collect PCI-compliant payments during live phone calls.
Using real-time speech-to-payment NLP for card capture, voice fraud detection during transactions, and adaptive agentic conversation management.

|
Payments Infrastructure
|
YC W26

Last Updated:
March 19, 2026

Builds secure payments infrastructure for voice AI agents, enabling businesses to collect PCI-compliant payments during live AI-powered phone calls via a single developer API.
Maven has publicly announced support for major payment gateways (Stripe, Authorize.net, Adyen, Braintree, Shift4), customizable voice personas with multi-language support, SMS checkout link fallback, card tokenization for recurring/delayed payments, and smart retry logic for failed transactions. Their public documentation and YC profile emphasize becoming the "default infrastructure for human-to-agent payment touchpoints."
GitHub and hiring signals suggest investment in real-time speech-to-text and NLP model optimization, pointing toward lower-latency inference and multilingual expansion. Conference and demo activity hints at upcoming ACH and digital wallet support. The lack of public job postings suggests a tight, founder-led engineering team likely building toward a Series A raise in late 2026. Integration patterns with platforms like Retell AI and Vapi suggest a strategy of embedding Maven as the payment layer across the voice AI ecosystem rather than competing with agent builders directly. Early customer conversations appear concentrated in healthcare and travel verticals, signaling likely vertical-specific compliance features (HIPAA-adjacent, PCI Level 1).
<p>Real-time speech recognition and natural language processing pipeline that converts live caller speech into structured payment data (card numbers, expiration dates, CVVs) during voice agent calls, enabling seamless and accurate payment capture without human intervention.</p>
Maven's AI listens to what you say on a phone call with a voice agent and instantly, accurately turns your spoken credit card number into a real payment—like a cashier with perfect hearing who never asks you to repeat yourself.
Maven's engineering team deploys a tightly integrated speech-to-text (STT) and natural language understanding (NLU) pipeline optimized specifically for payment data extraction during live voice agent calls. Unlike general-purpose transcription, their models are fine-tuned to recognize digit sequences, expiration date formats, and CVV codes spoken in varied accents, speeds, and noisy environments. The NLU layer parses contextual cues to distinguish between card numbers, zip codes, and other numeric data spoken in the same conversation. Smart retry logic detects when digits are missed or ambiguous and triggers natural re-prompts through the voice agent. The system also supports multilingual input, allowing callers to provide payment details in their preferred language. All inference runs in real time with strict latency budgets to maintain conversational flow, and card data is tokenized immediately upon capture so raw payment information never persists in memory or logs, ensuring PCI compliance at the model layer.
It's like having a court stenographer who specializes exclusively in credit card numbers—impossibly fast, never mishears a digit, and shreds the transcript the instant the payment goes through.
<p>Machine learning-based fraud detection system that analyzes transaction patterns, caller behavioral signals, and payment metadata in real time during voice agent calls to flag and block suspicious payment attempts before they are processed.</p>
Maven's AI acts like a silent security guard on every call, watching for suspicious patterns in how someone speaks and pays to catch fraud before the charge ever goes through.
Maven integrates real-time fraud detection models directly into its payment processing pipeline, analyzing multiple signal layers simultaneously during live voice agent calls. At the transaction level, models evaluate card velocity, geographic anomalies, BIN-level risk scores, and transaction amount patterns against historical baselines. At the behavioral level, the system analyzes caller interaction patterns—such as hesitation timing, re-entry frequency, and conversational anomalies—that may indicate social engineering, card testing, or stolen card usage. These signals are fused into a composite risk score computed within the latency budget of the live call, enabling the voice agent to either proceed, request additional verification, or gracefully decline the transaction. The models are continuously retrained on anonymized transaction outcome data to adapt to evolving fraud vectors. Because Maven sits at the infrastructure layer across multiple customers and verticals, their fraud models benefit from a network effect—patterns detected in one deployment improve detection across the entire platform.
It's like a bouncer who doesn't just check your ID at the door but also notices you're sweating, glancing around nervously, and trying to pay with five different cards in two minutes.
<p>Context-aware agentic AI orchestration layer that dynamically manages multi-turn payment conversations, adapting voice persona, language, script flow, and escalation logic in real time based on caller intent, sentiment, and transaction state.</p>
Maven's AI reads the room during a payment call—if you're confused it slows down and explains, if you're in a hurry it speeds up, and if something goes wrong it smoothly finds another way to get you paid up.
Maven's product team has built an agentic AI orchestration layer that goes beyond simple scripted IVR flows to create truly adaptive payment conversations. The system maintains a rich conversation state graph that tracks caller intent, emotional sentiment, transaction progress, and error history across multiple turns. Using this context, the orchestration engine dynamically selects the optimal next action—whether that's adjusting the voice persona's tone and pacing, switching languages mid-call, simplifying instructions for a confused caller, or escalating to an SMS checkout link when voice capture repeatedly fails. The underlying models combine transformer-based intent classification with reinforcement learning–trained dialogue policies that optimize for payment completion while maintaining caller satisfaction. The orchestration layer also integrates knowledge sources (product catalogs, billing records, customer history) to personalize the conversation contextually. For example, if a returning caller previously failed a voice payment, the agent might proactively offer the SMS fallback. This adaptive behavior is configured per-customer through Maven's API, allowing businesses to define brand-aligned guardrails while the ML models handle real-time optimization within those constraints.
It's like a really good waiter who remembers you hate olives, notices you're in a rush tonight, and brings the check before you even ask—except this waiter is collecting your credit card over the phone.
Maven occupies a unique niche at the intersection of voice AI and payments compliance,two domains that are individually complex and rarely combined. By abstracting PCI-compliant payment collection into a single API call for voice agents, they eliminate a critical blocker for enterprises deploying conversational AI at scale, creating a defensible infrastructure layer that voice agent platforms depend on rather than compete with.