Skip to main content

Runtime Overview

AI Cost Firewall supports:

  • health and readiness endpoints
  • graceful shutdown
  • request draining
  • upstream timeout handling
  • hot reload through SIGHUP
  • runtime metrics
  • semantic cache fail-open behavior

Startup dependencies

Redis is required for exact caching.

Qdrant is required when:

semantic_cache_enabled true;

semantic_cache_fail_open applies to runtime semantic lookup failures only.