Health History

Service Health History

90-day availability heatmaps for every production service. Spot chronic issues and degradation patterns.

Avg Uptime (90d)

99.965%

Across all filtered services

Total Downtime

497min

Last 90 days

Incident Days

160

Days with at least one incident

Services Tracked

11

11 total

11 of 11 services
api-gatewayTier 1Operational

Primary public API gateway. Routes all customer traffic, terminates TLS, applies rate limits and auth.

Avg Uptime

99.981%

Downtime

25m

Incidents

13

api-gateway - 90-Day Availability

Daily uptime % - hover a cell for details

Less
More90-day window
billing-serviceTier 1Degraded

Subscription billing, invoicing, tax calculation. Integrates with Stripe.

Avg Uptime

99.956%

Downtime

57m

Incidents

2

billing-service - 90-Day Availability

Daily uptime % - hover a cell for details

Less
More90-day window
auth-serviceTier 1Operational

OAuth2 / OIDC identity provider, session management, MFA verification.

Avg Uptime

99.966%

Downtime

45m

Incidents

1

auth-service - 90-Day Availability

Daily uptime % - hover a cell for details

Less
More90-day window
postgres-primaryTier 1Operational

Primary OLTP PostgreSQL cluster, multi-AZ synchronous replication.

Avg Uptime

99.960%

Downtime

51m

Incidents

16

postgres-primary - 90-Day Availability

Daily uptime % - hover a cell for details

Less
More90-day window
redis-clusterTier 2Operational

Redis 7 cluster for session cache, rate limiting, and pub/sub.

Avg Uptime

99.952%

Downtime

62m

Incidents

20

redis-cluster - 90-Day Availability

Daily uptime % - hover a cell for details

Less
More90-day window
checkout-apiTier 1Partial Outage

Payment checkout flow orchestration. Handles 3DS, retries, idempotency.

Avg Uptime

99.991%

Downtime

12m

Incidents

1

checkout-api - 90-Day Availability

Daily uptime % - hover a cell for details

Less
More90-day window
kafka-busTier 2Operational

Kafka event bus for cross-service async communication.

Avg Uptime

99.958%

Downtime

55m

Incidents

18

kafka-bus - 90-Day Availability

Daily uptime % - hover a cell for details

Less
More90-day window
web-appTier 2Operational

Customer-facing Next.js web application. SSR + ISR, multi-region.

Avg Uptime

99.951%

Downtime

63m

Incidents

1

web-app - 90-Day Availability

Daily uptime % - hover a cell for details

Less
More90-day window
vaultTier 1Operational

HashiCorp Vault for secrets management and dynamic credentials.

Avg Uptime

99.948%

Downtime

68m

Incidents

20

vault - 90-Day Availability

Daily uptime % - hover a cell for details

Less
More90-day window
authz-engineTier 2Operational

Policy evaluation engine for RBAC/ABAC. Open Policy Agent based.

Avg Uptime

99.959%

Downtime

53m

Incidents

13

authz-engine - 90-Day Availability

Daily uptime % - hover a cell for details

Less
More90-day window
payment-ledgerTier 1Operational

Immutable ledger of all payment events. Append-only, double-entry.

Avg Uptime

99.951%

Downtime

63m

Incidents

18

payment-ledger - 90-Day Availability

Daily uptime % - hover a cell for details

Less
More90-day window

Command Palette

Search for a command to run...