Discover what the world's best startups are building before it's announced publicly.
Model Evaluation
Optimization
Personalization
Recommendation Systems
Reinforcement Learning
Zapier requires you to define the automation. Cofia watches your system events and anonymized network traffic, then proactively creates automations you did not ask for. Zero-prompt automation is a different product category from workflow builders.
Data Extraction
Computer Vision
Model Evaluation
Prediction
Optimization
Trading card mystery packs are a $5B+ market built on trust that the odds are fair. CatchBack open-sources its card selection code with cryptographic proofs, letting creators build packs that buyers can verify. $13K MRR with 700 users and a Steph Curry collaboration launching.
RAG
Data Extraction
Speech AI
Model Evaluation
Prediction
Most AI agent platforms give one agent to a whole team. Clice gives each team member their own agent that coordinates with teammates' agents, turning scheduling, handoffs, and ticket updates into agent-to-agent communication rather than human overhead.
Computer Vision
Model Evaluation
Prediction
Optimization
Personalization
Companies waste an estimated $240B annually on GPU compute. Chamber's founder built Amazon's GPU orchestration project and AWS CloudWatch Application Signals, so he has seen the problem from inside the largest cloud provider and knows exactly where the waste hides.
Data Extraction
Computer Vision
Speech AI
Model Evaluation
Prediction
The founder taught game development to 22M+ people on YouTube and built two game engines from scratch. CodeWisp lets anyone describe a game in plain English and publish it instantly, with 4,100+ developers already on the platform.
RAG
Data Extraction
Speech AI
Model Evaluation
Prediction
Sales reps on technical products freeze when a prospect asks a question they cannot answer. Caretta joins the call live, pulls from the company's entire knowledge base, and surfaces the answer in real time. The difference between winning and losing a deal is often one unanswered question.
RAG
Data Extraction
Computer Vision
Model Evaluation
Optimization
RAG accuracy plateaus around 80% for most implementations. Captain claims 95%+ by running parallel LLM queries across document chunks and aggregating results, which is a brute-force approach that works if the orchestration is fast enough. SOC 2 certified.
Model Evaluation
Optimization
Anomaly Detection
Workflow Agent
AI Infrastructure
Most AI agent teams monitor in production and fine-tune separately. Carrot Labs closes the loop: it evaluates agent performance against business metrics and selectively retrains, which means the agent improves without a human deciding what to fix.
RAG
Data Extraction
Computer Vision
Speech AI
Model Evaluation
Professional video editing still requires a $2,000 NLE and years of training. Cardboard runs in the browser, takes natural language commands, and exports to Premiere and DaVinci. Highest-upvoted HN launch in YC W26 suggests the developer community is paying attention.
RAG
Data Extraction
Model Evaluation
Prediction
Simulation
Red teaming and guardrails exist as separate tools. Cascade combines them into one platform with adaptive scaffolding that learns from production runs, already deployed across legal reasoning and customer support agents. The CEO researched graph reasoning and agentic safety at UC Berkeley's BAIR Lab.
Data Extraction
Model Evaluation
Personalization
Workflow Agent
Natural Language Processing
Outbound sales teams stitch together 7+ tools (Apollo, Instantly, Clay, LinkedIn, etc.) and lose signal between them. Cardinal consolidates the stack into one workflow and already runs outbound for 40+ YC companies, which is strong proof from a demanding buyer set.
No Clear ML Differentiation
RAG
Data Extraction
Computer Vision
Speech AI
Model Evaluation
QA engineers are expensive, slow, and first to get cut. Canary reads your code diffs, understands the intent behind each change, and generates targeted Playwright tests per PR. Teams are hitting 90%+ coverage in days instead of weeks.
Data Extraction
Model Evaluation
Synthetic Data
Workflow Agent
Natural Language Processing
LLMs hallucinate. Lean proves things. Cajal pairs LLMs with formal verification so every mathematical result is machine-checked, starting with quantum computing and finance where a wrong proof costs real money.
Data Extraction
Speech AI
Model Evaluation
Prediction
Optimization
Humane and Rabbit tried always-on AI wearables and both failed on privacy and form factor. Button only listens when physically pressed, which is a better privacy model, and two ex-Apple Vision Pro engineers know how to ship hardware that people actually wear.