Services Catalog
Full inventory of production services with reliability, ownership, and alert posture.
Total Services
Production
Operational
82% of fleet
Degraded / Outage
Needs attention
Avg Health Score
Fleet-wide weighted
Service Catalog
Click a row to view full service details, SLOs, dependencies, and recent activity.
Service | Health | Team | Owner | p95 | Error Rate | Uptime 30d | Alerts | Incidents |
|---|---|---|---|---|---|---|---|---|
checkout-apiTier 1 Payment checkout flow orchestration. Handles 3DS, retries, idempotency. | Partial Outage | Payments Engineering | 312ms | 1.400% | 99.620% | 8 | 1 | |
billing-serviceTier 1 Subscription billing, invoicing, tax calculation. Integrates with Stripe. | Degraded | Platform Engineering | 124ms | 0.210% | 99.860% | 5 | 1 | |
payment-ledgerTier 1 Immutable ledger of all payment events. Append-only, double-entry. | Operational | Payments Engineering | 84ms | 0.050% | 99.940% | 2 | 0 | |
web-appTier 2 Customer-facing Next.js web application. SSR + ISR, multi-region. | Operational | Web Application | 220ms | 0.120% | 99.910% | 3 | 0 | |
redis-clusterTier 2 Redis 7 cluster for session cache, rate limiting, and pub/sub. | Operational | Reliability Engineering | 1ms | 0.000% | 99.980% | 1 | 0 | |
auth-serviceTier 1 OAuth2 / OIDC identity provider, session management, MFA verification. | Operational | Reliability Engineering | 52ms | 0.040% | 99.970% | 1 | 0 | |
authz-engineTier 2 Policy evaluation engine for RBAC/ABAC. Open Policy Agent based. | Operational | Security Operations | 3ms | 0.001% | 99.980% | 1 | 0 | |
kafka-busTier 2 Kafka event bus for cross-service async communication. | Operational | Platform Engineering | 8ms | 0.000% | 99.960% | 2 | 0 | |
api-gatewayTier 1 Primary public API gateway. Routes all customer traffic, terminates TLS, applies rate limits and auth. | Operational | Platform Engineering | 38ms | 0.018% | 99.992% | 2 | 0 | |
postgres-primaryTier 1 Primary OLTP PostgreSQL cluster, multi-AZ synchronous replication. | Operational | Reliability Engineering | 4ms | 0.000% | 99.999% | 0 | 0 |
Payment checkout flow orchestration. Handles 3DS, retries, idempotency.
p95
312ms
Errors
1.40%
Uptime
99.62%
Subscription billing, invoicing, tax calculation. Integrates with Stripe.
p95
124ms
Errors
0.21%
Uptime
99.86%
Immutable ledger of all payment events. Append-only, double-entry.
p95
84ms
Errors
0.05%
Uptime
99.94%
Customer-facing Next.js web application. SSR + ISR, multi-region.
p95
220ms
Errors
0.12%
Uptime
99.91%
Redis 7 cluster for session cache, rate limiting, and pub/sub.
p95
1ms
Errors
0.00%
Uptime
99.98%
OAuth2 / OIDC identity provider, session management, MFA verification.
p95
52ms
Errors
0.04%
Uptime
99.97%
Policy evaluation engine for RBAC/ABAC. Open Policy Agent based.
p95
3ms
Errors
0.00%
Uptime
99.98%
Kafka event bus for cross-service async communication.
p95
8ms
Errors
0.00%
Uptime
99.96%
Primary public API gateway. Routes all customer traffic, terminates TLS, applies rate limits and auth.
p95
38ms
Errors
0.02%
Uptime
99.99%
Primary OLTP PostgreSQL cluster, multi-AZ synchronous replication.
p95
4ms
Errors
0.00%
Uptime
100.00%