Weaviate

Roadmap & Position in Vector Database

Open-source vector database for AI search, RAG, and agents.

Company Overview

Weaviate is an open-source AI database that stores, searches, and retrieves vectorized data for RAG and agent workflows. It serves developers and enterprises building AI search, support automation, knowledge tools, and personalization.

What They're Building

The company's public product roadmap & what they're committed to building.

Query Agent GA

Weaviate made Query Agent generally available with search mode, multi-turn context, user-defined filters, multi-tenant support, streaming, and a TypeScript client.

Disk-Based Vector Scale

Version 1.36 introduced HFresh preview, a disk-based vector index for larger datasets where memory cost becomes the scaling limit.

Managed Agent Memory

Engram extends Weaviate into persistent memory for agents through asynchronous extraction, reconciliation, and storage.

MCP Server Preview

Version 1.37 added a built-in MCP Server preview so LLMs and AI coding tools can interact with Weaviate directly.

Database Hardening

Recent releases moved server-side batching, object TTL, async replication improvements, and backup restoration cancellation to general availability.

Latest Intelligence

Zeitgeist tracks private signals to determine where the company is heading strategically.

Competitors

Pinecone

Managed vector database competitor, generally more cloud-first while Weaviate keeps a strong open-source and self-hosted position.

Qdrant

Open-source vector database competitor with a performance-oriented developer following.

Milvus / Zilliz

Vector database platform often used for large-scale similarity search workloads.

Chroma

Developer-friendly vector database often used in early RAG application prototypes.

pgvector

Postgres extension that competes when teams prefer adding vector search to an existing database stack.

Weaviate

's Moat:

Technical infrastructure is the near-term moat: open-source adoption, production database depth, and managed agent services can create switching costs if workloads move into Weaviate Cloud.

How They're Leveraging AI

AI Use Overview:

Weaviate uses vector search, hybrid retrieval, managed embeddings, and agent services to make enterprise data searchable and usable in AI applications.

More
Data Infrastructure and Analytics

Byteport

Makes massive file transfers 10x faster so teams stop deleting data they can't afford to move.

Robotics teams delete 96% of their sensor data because they cannot move it fast enough. Byteport's DART protocol achieves 1500x faster transfer than TCP for large files, which turns a data bottleneck into a data asset for any team that generates more than it can ship.

Captain

Delivers 95%+ accurate knowledge search across unstructured enterprise data, beating standard RAG.

RAG accuracy plateaus around 80% for most implementations. Captain claims 95%+ by running parallel LLM queries across document chunks and aggregating results, which is a brute-force approach that works if the orchestration is fast enough. SOC 2 certified.

EigenPal

Automates enterprise document workflows with 93% straight-through processing from just 3-5 samples.

Most document AI requires hundreds of labeled examples. EigenPal reaches 93% straight-through automation from 3-5 samples, which means regulated enterprises (banks, insurers) can deploy on new document types in hours instead of months.

Human Archive

Captures 8,000 hours/day of multimodal human activity data to train the next generation of robots.

Robotics foundation models are data-starved. Human Archive has 50,000+ contributors wearing custom sensor rigs across homes, restaurants, hotels, and construction sites, capturing 8,000 hours/day of synchronized video, depth, and tactile data. Scale AI for embodied AI.