Vector Databases

A vector database isn’t a normal database. It’s a meaning-matching engine. You search for “dogs” and it returns documents about “canines” and “hounds” — because it understands what you mean, not just what you typed.

Vector databases are specialized AI infrastructure built to store, manage, and query high-dimensional vector embeddings efficiently at scale. Unlike traditional relational databases that rely on structured schemas and exact keyword matches, vector databases are engineered to process AI-generated data and perform semantic search.

What Are Embeddings?

Vector embeddings are numerical representations of unstructured data (text, images, audio) mapped into a high-dimensional mathematical space. These embeddings capture the underlying meaning, context, and relationships within the data. Similar concepts cluster together mathematically.

How Semantic Search Works

Instead of looking for exact word matches, vector databases perform similarity searches by calculating the mathematical distance between vectors using algorithms like cosine similarity or Euclidean distance.

Example: A search for “refund rules” matches a document labeled “cancellation and return policy” because their vector embeddings are close in semantic space.

The RAG Connection

Vector databases are critical infrastructure for RAG pipelines:

Large documents are chunked and converted into embeddings
Embeddings are stored in the vector database
User queries are also converted into embeddings
The database rapidly searches for the most semantically relevant chunks
Retrieved context is injected into the LLM’s prompt

Performance at Scale

Calculating exact similarity between a query and every entity in a massive database is computationally expensive (O(N) time complexity). To achieve real-time performance, vector databases use Approximate Nearest Neighbor (ANN) search algorithms like HNSW (Hierarchical Navigable Small World) or IVF (Inverted File Index).

ANN trades a microscopic amount of accuracy for massive gains in query speed, allowing the database to search millions of records in milliseconds.

Do You Need a Vector Database?

Not always.

Signal	Lightweight Store OK	Vector DB Likely Needed
Corpus size	≤ 50k chunks, ≤ 1 GB embeddings	≥ 200k chunks, multi-GB embeddings
Update frequency	Daily batch adds	Continuous upserts or deletes
Latency target	P95 ≤ 500 ms acceptable	P95 ≤ 150-250 ms required
Filters	Few metadata filters	Complex filters, multi-tenant scopes
Traffic	≤ 10 QPS peaks	≥ 50-100 QPS sustained

Popular Vector Databases

Database	Best For	Self-Hosted Option
Pinecone	Managed, fast startup	No
Qdrant	Self-hosted, flexible	Yes
Milvus	Large-scale, enterprise	Yes
Chroma	Lightweight, prototyping	Yes
Weaviate	Semantic, multimodal	Yes

The Failure-First Angle

Vector databases are invisible infrastructure — until they fail. When a RAG system produces wrong answers, the vector database is often the culprit: stale embeddings, wrong chunk size, or relevance thresholds set too low. But because the failure is upstream, teams blame the model instead.

The Cost Transparency Angle

Vector databases add infrastructure cost. At small scale (≤ 50k chunks), a SQLite database with embeddings is free. At large scale (≥ 200k chunks), a managed vector database adds $200-500/month. The cost is invisible until you need it.

RAG — Where vector databases are used
LLM Drift — When embeddings shift
Data Layer — Where vector databases live
Knowledge Base Decay — When embeddings become stale
Embeddings — The vector representations stored and searched here
Self-Hosted AI — Qdrant, Milvus, Chroma run on your own infrastructure
Ollama — Runs embedding models locally for air-gapped indexing pipelines

WyrdWerk Deployment Wiki

Explorer

Vector Databases

What Are Embeddings?

How Semantic Search Works

The RAG Connection

Performance at Scale

Do You Need a Vector Database?

Popular Vector Databases

The Failure-First Angle

The Cost Transparency Angle

Graph View

Table of Contents

Backlinks

WyrdWerk Deployment Wiki

Explorer

Vector Databases

What Are Embeddings?

How Semantic Search Works

The RAG Connection

Performance at Scale

Do You Need a Vector Database?

Popular Vector Databases

The Failure-First Angle

The Cost Transparency Angle

Related

Graph View

Table of Contents

Backlinks