Focuses on visual realism and video generation rather than persistent, navigable 3D worlds; more consumer-oriented with lighter infrastructure requirements.
Enterprise digital twin platform with massive distribution but relies on explicit 3D authoring rather than generative AI for world creation. |
Research-stage world models with access to enormous compute and data, but no standalone product or developer API, and constrained by Google's product prioritization.
The co-founders created ImageNet and NeRF, giving them proprietary intuition about 3D data curation and neural rendering that translates directly into training-data advantages and architectural decisions competitors have to rediscover independently.
World Labs builds multimodal world models that generate editable 3D environments from text, images, video, panoramas, and layouts, then export Gaussian splats, meshes, or video, which is a substantively different output surface than text-to-image or text-to-video peers.
Retrofit autonomy kits that convert excavators and heavy equipment into operator-less machines.
Ex-Waymo trucking leadership attacking a labor-starved $13T construction market with reversible retrofits instead of new OEM machines, a faster path to revenue than highway autonomy.
AI-first autonomy stack for driverless trucks and robotaxis, validated in generative simulation.
Simulation-first development with generative world models lets Waabi validate safety without the fleet burn that bankrupted earlier AV companies, arriving at driverless launch with less capital consumed.
End-to-end deep learning software for autonomous driving, licensed to OEMs and fleets
OEM-licensable embodied AI with real production wins positions Wayve as the neutral AV software layer while Tesla and Waymo stay vertically integrated.
Builds the training data supply chain for humanoid robots from real-world human movement.
Humanoid robots need millions of hours of real human movement data, not synthetic. Asimov pays workers to wear a phone on a headband during normal tasks, creating training data at a cost and diversity that motion capture suits and sim environments cannot match.