Service Health
Real-time health posture of all production services with uptime sparklines and reliability metrics.
Operational
of 11 services
Degraded
performance impacted
Partial Outage
requires attention
Avg Health Score
Health Matrix
At-a-glance color-coded status for every service. Hover for details.
api-gateway
Primary public API gateway. Routes all customer traffic, terminates TLS, applies rate limits and auth.
billing-service
Subscription billing, invoicing, tax calculation. Integrates with Stripe.
auth-service
OAuth2 / OIDC identity provider, session management, MFA verification.
postgres-primary
Primary OLTP PostgreSQL cluster, multi-AZ synchronous replication.
redis-cluster
Redis 7 cluster for session cache, rate limiting, and pub/sub.
checkout-api
Payment checkout flow orchestration. Handles 3DS, retries, idempotency.
kafka-bus
Kafka event bus for cross-service async communication.
web-app
Customer-facing Next.js web application. SSR + ISR, multi-region.
vault
HashiCorp Vault for secrets management and dynamic credentials.
authz-engine
Policy evaluation engine for RBAC/ABAC. Open Policy Agent based.
payment-ledger
Immutable ledger of all payment events. Append-only, double-entry.
Composite Health Score Trend
Average health score across all services (30d)