Product LaunchAlpha ReleaseVector DatabaseRAG

Introducing DotVectorDB: Serverless Vector Database for AI Applications

Build semantic search and RAG applications with distributed blobs, 100+ AI models, and intelligent caching. Now in alpha.

October 30, 2025•8 min read•View Docs

Why We Built DotVectorDB

Every AI application needs two things: powerful models and relevant context. While we already provide access to 100+ AI models through our gateway, we saw developers struggling with the context part—building semantic search, managing embeddings, and implementing RAG (Retrieval Augmented Generation).

Existing vector databases are either too complex, too expensive, or require managing infrastructure. We wanted something different: a serverless vector database that just works.

DotVectorDB is our answer—a high-performance vector database with distributed blobs, intelligent caching, and seamless integration with our AI gateway.

What Makes It Different

15-40x Faster with Caching

Queries are automatically cached with intelligent invalidation. Same cost, dramatically faster responses. Perfect for production RAG applications.

First query: ~50ms

Cached query: ~2ms ⚡

Truly Serverless

No infrastructure to manage, no databases to configure. Just create a blob and start inserting vectors. Scale from zero to millions of vectors instantly.

Multi-Tenant by Default

Complete data isolation between tenants. Perfect for SaaS applications where each customer needs their own vector space. Secure by design.

Integrated with 100+ AI Models

Seamlessly combine vector search with our AI gateway. Build complete RAG applications with GPT-4, Claude, Gemini, and more through a single platform.

DotVectorDB vs Turbopuffer

Both are serverless, but DotVectorDB is built for knowledge graphs. While Turbopuffer does vector search, DotVectorDB supports KGE (Knowledge Graph Embeddings) for relationship traversal and complex queries.

Feature	DotVectorDB	Turbopuffer
Knowledge Graph Support	KGE (entities, relations, triples)	Vector-only
Relationship Traversal	Multi-hop queries, graph traversal	Not supported
Pricing (FREE tier)	50K vectors, 1M queries	No free tier
Query Caching	15-40x faster, automatic	Not available
AI Models Included	100+ models (GPT-4, Claude, Gemini)	Vector search only
Full-Text Search	Coming soon	Available
Best For	Knowledge graphs + RAG with relationships	Pure vector similarity search

Choose DotVectorDB if: You need knowledge graph support with relationship traversal, or you're building RAG apps that require understanding connections between entities.

Choose Turbopuffer if: You only need vector similarity search with full-text and prefer pay-as-you-go pricing.

Get Started in Minutes

Here's how simple it is to build a RAG application:

1. Create a Tenant & Blob

curl -X POST https://api.infraxa.ai/api/vector/tenants \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{"tenant_id": "my-app"}'

curl -X POST https://api.infraxa.ai/api/vector/tenants/my-app/blobs \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "blob_id": "docs",
    "blob_type": "dense",
    "dimension": 768
  }'

2. Insert Vectors

// Generate embeddings with Gemini
const embedding = await generateEmbedding(text);

// Insert into DotVectorDB
await fetch('https://api.infraxa.ai/api/vector/tenants/my-app/blobs/docs/vectors', {
  method: 'POST',
  headers: {
    'Authorization': `Bearer ${API_KEY}`,
    'Content-Type': 'application/json',
  },
  body: JSON.stringify({
    vectors: [embedding],
    attributes: [{ text, title, url }],
  }),
});

3. Search & Generate

// Search for context
const results = await searchVectors(query);
const context = results.attributes.map(a => a.text).join('\n\n');

// Generate AI response with context
const response = await streamText({
  model: infraxa("z-ai/glm-4.5-flash"), // FREE!
  messages: [{ role: "user", content: query }],
  system: `Use this context:\n\n${context}`,
});

Get Started View Docs

Build AI Agents That Can Pay for Data

DotVectorDB + x402 enables truly autonomous AI agents. Your agents can search premium knowledge bases, access paid APIs, and retrieve specialized data—all without human intervention.

How It Works

Agent Needs Data

Your AI agent encounters a task requiring premium knowledge

Discovers Vector DB

Agent finds DotVectorDB with relevant data via MCP tools

Pays Automatically

x402 handles payment (2s settlement) - no human approval needed

✓

Gets Results

Agent receives premium data and completes its task

Why Agents Need x402

Autonomous payments - Agents pay for premium data automatically
Access premium knowledge - Search specialized vector databases that require payment
2-second settlement - No waiting, instant access to data
MCP integration - Works with Claude Desktop, Cline, and other MCP clients

Example: Agent Searching Premium Data

import { Agent } from "agents";
import { withX402Client } from "agents/x402";

export class ResearchAgent extends Agent {
  async searchPremiumKnowledge(query: string) {
    // Build x402 client with agent's wallet
    const x402Client = withX402Client(
      this.mcpClient,
      { network: "base", account: this.account }
    );

    // Agent automatically pays to access premium vector DB
    const results = await x402Client.callTool(
      null, // Auto-approve payments
      {
        name: "search_vectors",
        arguments: { 
          query,
          database: "premium-research-papers"
        }
      }
    );

    // Agent now has access to premium research data
    return results;
  }
}

Use It Internally Too!

Not building a marketplace? Use DotVectorDB + x402 for your own internal agents:

🤖 Internal Knowledge Base

Store company docs, code, and data. Your agents search it autonomously.

💰 Cost Tracking

Track which agents use what data. Perfect for internal billing.

🔐 Access Control

Different agents, different wallets, different permissions.

⚡ No Rate Limits

Agents pay per query. No API keys, no rate limiting headaches.

External Use Cases: Research agents accessing academic papers, trading bots buying market data, customer service agents querying proprietary knowledge bases, code assistants accessing premium documentation.

Learn About x402