World Labs

Roadmap & Position in Spatial AI

Generates navigable 3D worlds from text and images using large world models.

Company Overview

World Labs is a spatial intelligence company that trains large world models to generate, edit, and render photorealistic 3D environments from multimodal inputs. Customers span gaming, VFX, robotics simulation, and AR/VR, with integrations targeting Unity, Unreal, and Blender users.

What They're Building

The company's public product roadmap & what they're committed to building.

Marble

A multimodal world model for generating and editing 3D Gaussian Splat scenes from text, images, video, or 3D layouts.

World API

A developer API for programmable 3D world generation, accepting text, images, panoramas, and video as inputs.

Spark 2.0

An open-source 3D Gaussian Splatting renderer built on THREE.js, WebGL2, and Rust/WASM for browser-based visualization.

RTFM

A real-time generative world model research preview for interactive, frame-by-frame world generation on GPU.

Latest Intelligence

Zeitgeist tracks private signals to determine where the company is heading strategically.

Competitors

Luma AI:

Focuses on visual realism and video generation rather than persistent, navigable 3D worlds; more consumer-oriented with lighter infrastructure requirements.

NVIDIA Omniverse:

Enterprise digital twin platform with massive distribution but relies on explicit 3D authoring rather than generative AI for world creation. |

Google DeepMind (Genie):

Research-stage world models with access to enormous compute and data, but no standalone product or developer API, and constrained by Google's product prioritization.

World Labs

's Moat:

The co-founders created ImageNet and NeRF, giving them proprietary intuition about 3D data curation and neural rendering that translates directly into training-data advantages and architectural decisions competitors have to rediscover independently.

How They're Leveraging AI

AI Use Overview:

World Labs builds multimodal world models that generate editable 3D environments from text, images, video, panoramas, and layouts, then export Gaussian splats, meshes, or video, which is a substantively different output surface than text-to-image or text-to-video peers.

More Similar Companies

Bedrock Robotics

Retrofit autonomy kits that convert excavators and heavy equipment into operator-less machines.

Ex-Waymo trucking leadership attacking a labor-starved $13T construction market with reversible retrofits instead of new OEM machines, a faster path to revenue than highway autonomy.

Waabi

AI-first autonomy stack for driverless trucks and robotaxis, validated in generative simulation.

Simulation-first development with generative world models lets Waabi validate safety without the fleet burn that bankrupted earlier AV companies, arriving at driverless launch with less capital consumed.

Wayve

End-to-end deep learning software for autonomous driving, licensed to OEMs and fleets

OEM-licensable embodied AI with real production wins positions Wayve as the neutral AV software layer while Tesla and Waymo stay vertically integrated.

Asimov

Builds the training data supply chain for humanoid robots from real-world human movement.

Humanoid robots need millions of hours of real human movement data, not synthetic. Asimov pays workers to wear a phone on a headband during normal tasks, creating training data at a cost and diversity that motion capture suits and sim environments cannot match.