Autonomous AI that triages and fixes production incidents with contextual pull requests.
Using intelligent alert clustering and deduplication, multimodal root cause analysis across tools, and autonomous code remediation with governance.

|
Incident Management
|
YC W26

Last Updated:
March 19, 2026

Builds an autonomous AI engineer that triages, investigates, and fixes production software incidents by integrating with monitoring tools like Sentry and Datadog, generating contextual pull requests, and continuously learning from each resolved issue.
Sonarly has publicly announced integrations with Sentry, Datadog, Grafana, Slack, Discord, and GitHub/GitLab. They've detailed autonomous alert triage and deduplication, AI-generated production-aware pull requests with contextual evidence, and a continuous learning system that updates its internal representation of the codebase after each incident. Their public messaging emphasizes a long-term vision of "software that improves itself."
Their tech stack references to Claude Code and Opus 4.6 suggest deep partnership or early access with Anthropic's most advanced coding models, signaling cutting-edge agentic AI capabilities beyond typical GPT wrappers. The emphasis on a deterministic governance and verification layer for AI-generated code changes indicates they're building enterprise-grade safety rails,a likely prerequisite for landing larger customers. The tiny team size (2 people) combined with YC backing suggests they're in intense product-market fit validation mode, likely running design partners with YC-network startups. GitHub activity and product language hint at expansion toward predictive incident prevention (not just reactive fixes) and deeper infrastructure-level integrations (cloud, CI/CD pipelines). Conference and community signals suggest potential hybrid human+AI escalation workflows for complex incidents.
<p>AI-powered system that automatically groups, deduplicates, and prioritizes production alerts to eliminate noise and surface only actionable incidents.</p>
It's like having a brilliant intern who reads every single alarm in your building, figures out which ones are real fires versus burnt toast, and only wakes you up when the building is actually on fire.
Sonarly's alert triage engine ingests raw alert streams from monitoring tools like Sentry, Datadog, and Grafana, then applies ML-based clustering and NLP analysis to group duplicate or related alerts into unified incident threads. The system uses learned patterns from historical incident data and real-time contextual signals (stack traces, error messages, deployment timestamps, service dependency graphs) to assign severity scores and filter out false positives. This dramatically reduces alert fatigue for on-call engineers, ensuring they focus only on genuine, high-impact production issues. The continuous learning loop means the system gets smarter with each incident, adapting to the unique noise patterns and failure modes of each customer's infrastructure. This transforms the traditional "wall of alerts" into a curated, prioritized incident queue that accelerates response times and reduces burnout.
It's the spam filter for your production alerts—except instead of catching Nigerian prince emails, it catches the 47 duplicate Sentry errors that all mean the same database connection is down.
<p>Autonomous AI agent that investigates production incidents by correlating logs, traces, metrics, code changes, and user feedback to identify the precise root cause.</p>
Instead of five engineers spending two hours in a war room staring at dashboards, an AI detective instantly cross-references every clue—logs, code changes, metrics, and user complaints—to tell you exactly what broke and why.
Sonarly's root cause analysis engine operates as an autonomous investigative agent that fuses data from multiple observability sources—application logs, distributed traces, infrastructure metrics, recent git commits, and even unstructured user feedback from Slack and Discord. Using LLM-powered reasoning chains, the agent constructs a causal narrative by correlating temporal signals (e.g., a spike in error rates immediately following a specific deployment), structural signals (e.g., which services share dependencies), and semantic signals (e.g., matching error message patterns to known failure modes). The system generates a detailed incident report with a confidence-scored root cause hypothesis, linked evidence artifacts, and a suggested remediation path. This replaces the traditional manual "swivel-chair" debugging process where engineers must mentally synthesize information across dozens of tools and dashboards. Over time, the agent builds an evolving knowledge graph of the system's architecture and failure patterns, enabling faster and more accurate diagnoses for recurring or similar issues.
It's like having a medical diagnostician who can simultaneously read your blood work, MRI, patient history, and WebMD reviews—and actually get the diagnosis right on the first try.
<p>AI coding agent that autonomously generates production-aware pull requests to fix identified bugs, complete with contextual evidence and deterministic safety verification before submission.</p>
Instead of just telling you something is broken, the AI actually writes the fix, shows its homework, and waits for your approval before pushing it live—like a mechanic who diagnoses the problem, orders the part, and installs it while you're still on hold with the dealership.
Sonarly's most novel capability is its autonomous code fix generation pipeline. Once the root cause analysis agent identifies the source of a production issue, a specialized coding agent—powered by frontier LLMs like Claude Code and Opus 4.6—generates a targeted code patch. Critically, this isn't a naive "autocomplete" fix: the agent has full context of the incident (logs, traces, root cause hypothesis, affected services) and the codebase (via GitHub/GitLab integration with granular repository access). The generated pull request includes not just the code change but also a detailed explanation linking the fix to the specific incident evidence, making code review efficient and transparent. Before submission, the fix passes through a deterministic governance and verification layer—automated tests, static analysis, and safety checks—that acts as a quality gate to prevent AI-introduced regressions. This hybrid approach (generative AI + deterministic verification) is Sonarly's key architectural differentiator, addressing the trust gap that prevents most engineering teams from letting AI agents touch production code. The result is a system that can resolve many common production issues end-to-end without human intervention, while maintaining the safety and auditability standards required by enterprise engineering organizations.
It's like autocorrect for your production code—except it actually understands grammar, checks with an editor, and only hits send after a lawyer reviews it.
Sonarly combines deep observability tool integration with autonomous coding agents powered by frontier LLMs (Claude Code/Opus 4.6) and a deterministic governance layer, enabling them to not just detect but actually fix production issues,a capability gap most monitoring and alerting tools leave entirely to humans.