Skip to main content

Qdrant Backend

Qdrant stores embeddings and response payloads for semantic caching.

Ports

AI Cost Firewall uses Qdrant gRPC:

6334

Qdrant REST usually runs on:

6333

REST is useful for health checks and manual inspection.

Local Docker example

docker run -d --rm --name qdrant \
-p 6333:6333 \
-p 6334:6334 \
qdrant/qdrant

Health check:

curl http://127.0.0.1:6333/healthz

Firewall config:

qdrant_url http://127.0.0.1:6334;

Vector size

qdrant_vector_size 1536;

This must match the embedding model. Existing collections are validated at startup.