How Is

CellType

Using AI?

Simulates human biology with AI to predict drug effects and run virtual trials.

Using virtual biology simulation via Cell2Sentence foundation models, agentic multi-omics reasoning across 30+ biomedical APIs, and patient stratification modeling.

Company Overview

Builds an agentic drug discovery platform that simulates human biology using foundation models (Cell2Sentence) and autonomous AI agents to predict drug effects, identify targets, and run virtual clinical trials, replacing traditional preclinical models.

Product Roadmap & Public Announcements

Open-source celltype-cli (MIT license) powered by Claude for autonomous drug discovery workflows. Integration with 30+ biomedical APIs (PubMed, ChEMBL, UniProt). Cell2Sentence presented at ICML. Enterprise on-premise deployment and GPU-accelerated models planned.

Signals & Private Analysis

Rapid iteration on multi-dataset integration. Expansion to new therapeutic areas beyond oncology. 'All deals are inbound from top 10 pharma' hints at undisclosed pilots. Land-and-expand: give away research agent, monetize virtual human simulation engine. Likely Series A given pharma traction.

CellType

Machine Learning Use Cases

Virtual biology simulation
For
Cost Reduction
Product

<p>Simulates drug effects across human cell types and tissues in silico to predict efficacy and toxicity before any wet-lab experiment.</p>

Layman's Explanation

Instead of testing drugs on mice and hoping they work in humans, CellType builds a digital twin of human biology and tests drugs on that first.

Use Case Details

CellType's Virtual Human platform uses the Cell2Sentence foundation model to construct a comprehensive computational representation of human biology at single-cell resolution. By converting gene expression profiles into LLM-compatible token sequences, the model learns cellular states, tissue microenvironments, and drug-cell interaction dynamics from massive single-cell RNA-seq datasets. When a candidate compound is introduced virtually, the system predicts downstream transcriptomic changes across multiple cell types simultaneously, flagging potential efficacy signals in target tissues and toxicity risks in off-target organs. This whole-system approach addresses the fundamental limitation of traditional preclinical models—species-specific biology—by grounding every prediction in actual human data. The platform has demonstrated validated predictions of novel drug hits in immuno-oncology, with results confirmed through in vitro and ex vivo experiments. Pharma partners use the platform to de-risk pipeline decisions before committing to expensive animal studies and Phase I trials.

Analogy

It's like crash-testing a car in a hyper-realistic video game before ever bending real metal—except the car is a drug and the game is a perfect simulation of your body.

Agentic multi-omics reasoning
For
Decision Quality
Engineering

<p>Deploys autonomous AI agents that identify and validate novel drug targets by reasoning across multi-omics data and biomedical literature.</p>

Layman's Explanation

CellType's AI agents act like tireless PhD researchers that read every paper, analyze every dataset, and propose the best drug targets—all before your morning coffee.

Use Case Details

CellType's celltype-cli orchestrates autonomous research agents powered by Claude that execute end-to-end target discovery workflows without manual intervention. An agent receives a high-level therapeutic hypothesis (e.g., "find novel immunotherapy targets in triple-negative breast cancer"), then autonomously queries DepMap dependency screens, PRISM drug sensitivity data, L1000 perturbation signatures, and proteomic datasets to identify genes essential for tumor survival but dispensable in normal tissues. Simultaneously, the agent mines PubMed, ChEMBL, and UniProt via API to assess druggability, existing IP landscape, and clinical precedent. It performs pathway enrichment analysis, builds interaction networks, and ranks candidates using a multi-criteria scoring framework that balances novelty, tractability, and safety. The entire pipeline—from hypothesis to prioritized target list with supporting evidence—runs in hours rather than the weeks or months typical of manual bioinformatics. Human scientists review the agent's reasoning chain and evidence at each decision node, maintaining scientific rigor while dramatically accelerating throughput. This approach has enabled CellType's pharma partners to populate early-stage pipelines with higher-confidence targets at unprecedented speed.

Analogy

It's like having a research assistant who has memorized every biology paper ever written, can run every analysis tool simultaneously, and never needs sleep—except it actually exists and works before lunch.

Patient stratification modeling
For
Risk Reduction
Strategy

<p>Uses foundation models to stratify virtual patient populations and predict clinical trial outcomes, enabling optimized trial design before recruiting a single patient.</p>

Layman's Explanation

CellType figures out which patients a drug will actually work for before the trial even starts, so pharma companies stop wasting billions testing drugs on the wrong people.

Use Case Details

CellType's virtual clinical trial capability leverages Cell2Sentence to model patient heterogeneity at single-cell resolution, creating in silico patient cohorts that reflect the biological diversity of real-world populations. By training on large-scale single-cell atlases spanning healthy and diseased tissues across diverse demographics, the model learns how genetic background, disease subtype, and microenvironment composition influence drug response. When a pharma partner inputs a candidate compound and target indication, the platform simulates treatment across thousands of virtual patients, predicting responders vs. non-responders based on transcriptomic signatures, pathway activation states, and immune contexture. The system outputs recommended biomarker-driven inclusion/exclusion criteria, optimal dosing strategies for different patient subgroups, and predicted effect sizes—enabling sponsors to design smaller, more targeted trials with higher probability of success. This addresses one of the most expensive failure modes in drug development: Phase II and III trials that fail not because the drug doesn't work, but because the trial enrolled the wrong patient population. By front-loading patient stratification computationally, CellType helps partners avoid costly late-stage failures and accelerate time-to-approval for precision therapies.

Analogy

It's like a dating app for drugs and patients—instead of hoping for a random match, the AI figures out exactly who's compatible before anyone commits to a very expensive first date.

Key Technical Team Members

  • David van Dijk, CEO & Founder
  • Ivan Vrkic, Co-founder

Founders invented Cell2Sentence at Yale, uniquely enabling LLMs to natively reason about cellular biology as language. First-mover in applying transformer architectures to whole-system drug discovery. 11,000+ citations and ICML/Cell/Nature publications.

CellType

Funding History

  • 2024: Cell2Sentence developed at Yale
  • 2025: CellType founded, Cell2Sentence presented at ICML
  • 2025: celltype-cli open-sourced (MIT)
  • 2026: Y Combinator W26 batch
  • 2026: Inbound top 10 pharma partnerships

CellType

Competitors

  • Protein-Centric AI: Isomorphic Labs, Recursion, Insilico Medicine
  • Foundation Model Bio: Genentech (scGPT), Microsoft (BioGPT), Owkin
  • Virtual Biology: Cellarity, Relation Therapeutics
  • Agentic Drug Discovery: FutureHouse (Robin), Emerald Cloud Lab
More

Companies
Get Every New ML Use Cases Directly to Your Inbox
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.