How Is

Kita

Using AI?

Extracts decision-ready signals from messy borrower documents in emerging markets where OCR fails.

Using vision-language document parsing for unstructured formats, document fraud anomaly detection, and outcome-linked underwriting ML from repayment data.

Company Overview

Builds a document intelligence platform for emerging market lenders, using proprietary vision-language models to extract structured, fraud-checked, decision-ready signals from unstructured borrower documents like bank statements and payslips.

Product Roadmap & Public Announcements

Kita has publicly signaled geographic expansion from the Philippines and Indonesia into Mexico and underserved U.S. lending segments. They've detailed their vision-language model approach to document extraction and announced continuous learning engines that link document signals to actual repayment outcomes. Hyperlocalization of models to adapt to local document formats and underwriting drivers is a stated priority, along with deeper API integration into lender workflows.

Signals & Private Analysis

GitHub and hiring signals suggest investment in multi-modal ML engineers and applied research in vision-language architectures beyond standard OCR. Conference appearances and founder interviews hint at building a proprietary repayment-outcome feedback loop that creates a compounding data moat per lender. There are indicators of upcoming insurance and alternative credit scoring adjacencies, and likely partnerships with regional neobanks. The lean team size suggests a potential acqui-hire or Series A raise in 2026 to fund geographic scaling.

Kita

Machine Learning Use Cases

Vision-Language Document Parsing
For
Cost Reduction
Product

<p>Extracts structured, decision-ready data from noisy, unstructured borrower documents using proprietary vision-language models that understand both layout and semantic content.</p>

Layman's Explanation

It reads messy bank statements and payslips the way a human loan officer would, but in seconds and without coffee breaks.

Use Case Details

Kita's core ML capability is a proprietary vision-language model pipeline that ingests raw borrower documents—bank statements, payslips, utility bills, and government IDs—in dozens of local formats across Southeast Asian markets. Unlike traditional OCR systems that merely recognize characters, Kita's models jointly reason over visual layout (tables, stamps, logos, handwriting) and semantic content (transaction categories, income patterns, employer names) to produce structured JSON outputs ready for underwriting engines. The models are trained on a growing corpus of hyperlocalized document types, handling challenges like low-resolution scans, mixed languages (Tagalog, Bahasa, English), and non-standard formatting that defeat off-the-shelf solutions. This allows lenders to onboard borrowers who lack digital banking access, dramatically expanding their addressable market while slashing processing time from hours of manual review to seconds of automated extraction.

Analogy

It's like hiring a multilingual accountant with photographic memory who can read a crumpled receipt in a dark room and instantly file it correctly.

Document Fraud Anomaly Detection
For
Risk Reduction
IT-Security

<p>Detects fraudulent or tampered borrower documents at the extraction stage using ML-driven anomaly detection on visual and data-level signals.</p>

Layman's Explanation

It spots fake or doctored bank statements the way a forensic detective spots a forged painting—by catching tiny inconsistencies invisible to the naked eye.

Use Case Details

Kita integrates fraud detection directly into its document processing pipeline, analyzing both visual artifacts and data-level inconsistencies to flag suspicious borrower submissions. On the visual side, the system examines font consistency, pixel-level editing traces, metadata anomalies, and layout deviations from known genuine document templates for each institution and document type. On the data side, it cross-references extracted transaction patterns against statistical baselines—flagging impossible transaction sequences, round-number salary inflation, or inconsistent date formatting. Because Kita processes documents across many lenders in the same markets, it builds a growing fraud signature database that improves detection over time. This is especially critical in emerging markets where document fraud is prevalent and manual verification is costly and slow. By catching fraud before it enters underwriting, Kita saves lenders from both direct losses and the operational cost of downstream investigation and recovery.

Analogy

It's like a bouncer at a club who doesn't just check your ID—he knows what every real ID from every state looks like and can spot a fake lamination job from across the room.

Outcome-Linked Underwriting ML
For
Decision Quality
Data

<p>Continuously improves lender-specific underwriting models by linking extracted document signals to actual loan repayment outcomes, creating a compounding data moat.</p>

Layman's Explanation

It learns which details in a borrower's paperwork actually predict whether they'll pay back the loan—and gets smarter with every loan issued.

Use Case Details

Kita's most strategically differentiated ML capability is its custom learning engine, which closes the loop between document extraction and loan performance. After extracting hundreds of structured signals from borrower documents—income stability patterns, employer tenure, spending categories, savings behavior, utility payment consistency—the system tracks which signals correlate with actual repayment or default outcomes for each specific lender. Over time, this creates lender-specific predictive models that weight document features according to their real-world predictive power in that lender's unique borrower population and market context. This is fundamentally different from generic credit scoring: a spending pattern that predicts default in Manila may be benign in Jakarta, and Kita's hyperlocalized feedback loop captures these nuances automatically. The result is a compounding data moat—each loan originated through Kita's platform makes the next underwriting decision more accurate, creating switching costs and network effects that deepen with scale.

Analogy

It's like a chef who doesn't just follow a recipe but tastes every dish that comes back from the table, learning exactly which ingredients make customers come back for more—and adjusting the recipe for each restaurant individually.

Key Technical Team Members

  • Carmel Limcaoco, Co-Founder and CEO
  • Rhea Malhotra, Co-Founder and CTO

Kita's founders combine rare Stanford-trained vision-language model expertise with firsthand emerging market lending experience, enabling them to build hyperlocalized document AI that generic OCR or Western-trained models fundamentally cannot replicate.

Kita

Funding History

  • 2024-2025: Carmel Limcaoco and Rhea Malhotra co-found Kita while completing Stanford CS Masters
  • 2026: Accepted into Y Combinator W26 batch
  • 2026: ~$500K raised (YC standard deal)
  • 2026: Live in Philippines, expanding to Indonesia, Mexico, and US underserved segments

Kita

Competitors

  • Document AI / OCR Platforms: Hyperscience, Ocrolus, Docsumo (general document extraction). Emerging Market Lending Infra: Lendsqr, Presta (Africa-focused), Brankas (SEA open finance). Credit Scoring / Alternative Data: Nova Credit, Pngme, LenddoEFL. Bank Statement Analyzers: Perfios, FinBox (India-focused).
More

Companies
Get Every New ML Use Cases Directly to Your Inbox
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.