All systems operational.
Status updates land every 60s. RSS feed, email digest, and Slack webhook subscriptions are all live.
Subscribe via RSS for raw events, email for human-readable digests, or write directly to status@exai.cloud to be added to the on-call broadcast list. The page tells you, first, whether it’s us or you.
7 of 8 services green. Audit-log streaming on watch.
Service-by-service.
Reading the same dashboard SRE reads.
Eight surfaces. Each row reports current month uptime, status, and the last filed incident date. Same numbers our on-call engineer sees on the wall.
What’s on fire now.
And what just was.
One open incident, owner-attributed. Plus the four most recent filings inside the 90-day window — every entry links to its annotated post-mortem.
Audit log streaming · Splunk shipping lag (eu-west-1)
Customer-managed Splunk HEC endpoint in eu-west-1 returning backpressure. Audit events are buffering and being shipped via the S3 fallback path. No event loss; delivery delay is currently under 90 seconds.
Five regions. Five realities.
Latency, prebuild warm time, and cold-start are measured per region from in-region synthetic probes — refreshed every 60s. Model provider availability rolls up Anthropic, OpenAI, and Google upstream status.
Scheduled change.
No surprises.
Maintenance windows are advanced 14 days — signed by SRE on duty and broadcast on every subscription channel before they open.
- 01May 12 · 02:00–02:45 UTC·eu-west-1Postgres minor upgrade
- 02May 19 · 03:00–04:00 UTC·globalKMS rotation (90-day cadence)
- 03Jun 02 · 02:00–03:30 UTC·ap-southeast-1Firecracker host refresh
Get told first. Quietly.
Three pipes for status events: a raw RSS feed for the on-call channel, a daily email digest for platform owners, and a Slack webhook for team-room broadcasts. Subscribe to one — or all three.