Qdrant Backend
Qdrant stores embeddings and response payloads for semantic caching.
Ports
AI Cost Firewall uses Qdrant gRPC:
6334
Qdrant REST usually runs on:
6333
REST is useful for health checks and manual inspection.
Local Docker example
docker run -d --rm --name qdrant \
-p 6333:6333 \
-p 6334:6334 \
qdrant/qdrant
Health check:
curl http://127.0.0.1:6333/healthz
Firewall config:
qdrant_url http://127.0.0.1:6334;
Vector size
qdrant_vector_size 1536;
This must match the embedding model. Existing collections are validated at startup.