Runtime Overview
AI Cost Firewall supports:
- health and readiness endpoints
- graceful shutdown
- request draining
- upstream timeout handling
- hot reload through
SIGHUP - runtime metrics
- semantic cache fail-open behavior
Startup dependencies
Redis is required for exact caching.
Qdrant is required when:
semantic_cache_enabled true;
semantic_cache_fail_open applies to runtime semantic lookup failures only.