Product LaunchAlpha ReleaseVector DatabaseRAG

Introducing DotVectorDB: Serverless Vector Database for AI Applications

Build semantic search and RAG applications with distributed blobs, 100+ AI models, and intelligent caching. Now in alpha.

October 30, 20258 min readView Docs

Why We Built DotVectorDB

Every AI application needs two things: powerful models and relevant context. While we already provide access to 100+ AI models through our gateway, we saw developers struggling with the context part—building semantic search, managing embeddings, and implementing RAG (Retrieval Augmented Generation).

Existing vector databases are either too complex, too expensive, or require managing infrastructure. We wanted something different: a serverless vector database that just works.

DotVectorDB is our answer—a high-performance vector database with distributed blobs, intelligent caching, and seamless integration with our AI gateway.

What Makes It Different

15-40x Faster with Caching

Queries are automatically cached with intelligent invalidation. Same cost, dramatically faster responses. Perfect for production RAG applications.

First query: ~50ms
Cached query: ~2ms ⚡
Truly Serverless

No infrastructure to manage, no databases to configure. Just create a blob and start inserting vectors. Scale from zero to millions of vectors instantly.

Multi-Tenant by Default

Complete data isolation between tenants. Perfect for SaaS applications where each customer needs their own vector space. Secure by design.

Integrated with 100+ AI Models

Seamlessly combine vector search with our AI gateway. Build complete RAG applications with GPT-4, Claude, Gemini, and more through a single platform.

DotVectorDB vs Turbopuffer

Both are serverless, but DotVectorDB is built for knowledge graphs. While Turbopuffer does vector search, DotVectorDB supports KGE (Knowledge Graph Embeddings) for relationship traversal and complex queries.

FeatureDotVectorDBTurbopuffer
Knowledge Graph Support
KGE (entities, relations, triples)
Vector-only
Relationship Traversal
Multi-hop queries, graph traversal
Not supported
Pricing (FREE tier)
50K vectors, 1M queries
No free tier
Query Caching
15-40x faster, automatic
Not available
AI Models Included
100+ models (GPT-4, Claude, Gemini)
Vector search only
Full-Text SearchComing soon
Available
Best ForKnowledge graphs + RAG with relationshipsPure vector similarity search

Choose DotVectorDB if: You need knowledge graph support with relationship traversal, or you're building RAG apps that require understanding connections between entities.

Choose Turbopuffer if: You only need vector similarity search with full-text and prefer pay-as-you-go pricing.

Get Started in Minutes

Here's how simple it is to build a RAG application:

1. Create a Tenant & Blob
curl -X POST https://api.infraxa.ai/api/vector/tenants \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{"tenant_id": "my-app"}'

curl -X POST https://api.infraxa.ai/api/vector/tenants/my-app/blobs \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "blob_id": "docs",
    "blob_type": "dense",
    "dimension": 768
  }'
2. Insert Vectors
// Generate embeddings with Gemini
const embedding = await generateEmbedding(text);

// Insert into DotVectorDB
await fetch('https://api.infraxa.ai/api/vector/tenants/my-app/blobs/docs/vectors', {
  method: 'POST',
  headers: {
    'Authorization': `Bearer ${API_KEY}`,
    'Content-Type': 'application/json',
  },
  body: JSON.stringify({
    vectors: [embedding],
    attributes: [{ text, title, url }],
  }),
});
3. Search & Generate
// Search for context
const results = await searchVectors(query);
const context = results.attributes.map(a => a.text).join('\n\n');

// Generate AI response with context
const response = await streamText({
  model: infraxa("z-ai/glm-4.5-flash"), // FREE!
  messages: [{ role: "user", content: query }],
  system: `Use this context:\n\n${context}`,
});

Build AI Agents That Can Pay for Data

DotVectorDB + x402 enables truly autonomous AI agents. Your agents can search premium knowledge bases, access paid APIs, and retrieve specialized data—all without human intervention.

How It Works
1

Agent Needs Data

Your AI agent encounters a task requiring premium knowledge

2

Discovers Vector DB

Agent finds DotVectorDB with relevant data via MCP tools

3

Pays Automatically

x402 handles payment (2s settlement) - no human approval needed

Gets Results

Agent receives premium data and completes its task

Why Agents Need x402
  • Autonomous payments - Agents pay for premium data automatically
  • Access premium knowledge - Search specialized vector databases that require payment
  • 2-second settlement - No waiting, instant access to data
  • MCP integration - Works with Claude Desktop, Cline, and other MCP clients
Example: Agent Searching Premium Data
import { Agent } from "agents";
import { withX402Client } from "agents/x402";

export class ResearchAgent extends Agent {
  async searchPremiumKnowledge(query: string) {
    // Build x402 client with agent's wallet
    const x402Client = withX402Client(
      this.mcpClient,
      { network: "base", account: this.account }
    );

    // Agent automatically pays to access premium vector DB
    const results = await x402Client.callTool(
      null, // Auto-approve payments
      {
        name: "search_vectors",
        arguments: { 
          query,
          database: "premium-research-papers"
        }
      }
    );

    // Agent now has access to premium research data
    return results;
  }
}
Use It Internally Too!

Not building a marketplace? Use DotVectorDB + x402 for your own internal agents:

🤖 Internal Knowledge Base

Store company docs, code, and data. Your agents search it autonomously.

💰 Cost Tracking

Track which agents use what data. Perfect for internal billing.

🔐 Access Control

Different agents, different wallets, different permissions.

⚡ No Rate Limits

Agents pay per query. No API keys, no rate limiting headaches.

External Use Cases: Research agents accessing academic papers, trading bots buying market data, customer service agents querying proprietary knowledge bases, code assistants accessing premium documentation.

Learn About x402

Transparent Pricing

Start free, scale as you grow. No hidden fees, no surprises.

FREE
$0/month
  • 50K vectors
  • 1M queries/month
  • 3 blobs
  • 2KB attributes
STARTER
$49/month
  • 250K vectors
  • 10M queries/month
  • 10 blobs
  • 4KB attributes
View All Tiers

Alpha Release & Roadmap

DotVectorDB is currently in alpha. We're actively developing new features and would love your feedback.

Coming Soon
  • Hybrid search (vector + keyword)
  • Advanced filtering and metadata queries
  • Real-time vector updates
  • SDKs for Python, TypeScript, and Go
  • Vector analytics and insights

Join our alpha program and help shape the future of DotVectorDB. Your feedback directly influences our roadmap.

Start Building Today

Get started with DotVectorDB in minutes. Free tier includes everything you need to build and test your RAG application.