§ 01 / 06

CustomersFortune 50 → 18-person teamsOperator memory · receipts · reference calls available.

cohort · n=40 · Q1 2026

exAI · what 40 platform teams actually measured

Operator memory.
Real numbers.

Forty platform teams, eighteen industries, one agentic runtime — measured at ninety days, not at the demo.

These pages are not testimonials. They are receipts. Onboarding drops, AI-PR throughput, cold-start P50, contractual uptime, and dollar-savings per seat — collected from the same telemetry the customer ships to their own SIEM. Pick one. We will put you on the phone with the engineer who lived it.

Request reference call Download case study

Aggregate · n=40 cohort

90-day rolling

Q1 2026 · raw telemetry

Median across the cohort. Outliers redacted at request of customer counsel — full datasets available under NDA at /benchmarks.

Engineers active / mo

Cold-start · P50

0 ms

AI-PRs / eng / mo

Uptime · SLA

0.00%

Fig. 01 · cohort aggregateMethodology · /benchmarks

§ 02 / 06Deployed by

Fortune 50 fintech, public healthcare, commerce, defense, mobility, biotech, telecom — twelve named, twenty-eight under NDA. Reference calls arranged within five business days.

Acme PayNorthwindCoalfieldWestwind HealthGlobex MarketsTributary EnergyAquila TelecomStandard FederalHelix MobilityLattice BioPolaris TradingAegis DefenseAcme PayNorthwindCoalfieldWestwind HealthGlobex MarketsTributary EnergyAquila TelecomStandard FederalHelix MobilityLattice BioPolaris TradingAegis Defense

§ 03 / 06

Operator memory · Fortune 50 fintech

We retired a forked VSCode, two point tools, and an internal Gitpod. Ramp-up dropped from six weeks to three days, and our platform team stopped maintaining the IDE.

Karim Mori

Head of Developer Platform · Fortune 50 fintech

Engineers

4,812

Onboarding

−92%

PRs merged

4.8×

Saved / year

$1.2M

Replaced01 / 03

A forked VSCode distribution maintained by a five-person platform team, two point-tool subscriptions for AI completions and review, and an internally hosted Gitpod cluster running on EKS.

Kept02 / 03

GitHub Enterprise, Datadog, Buildkite, the SOC 2 control catalog, the existing Okta-fronted SSO, and the Friday demo cadence — exAI dropped in beneath them, not on top.

Surprised them03 / 03

Reviewer trust climbed before throughput did. The platform team had budgeted six months for cultural adoption — Composer's plan-first surface earned the room in three weeks.

Karim leads developer platform for a Fortune 50 fintech with 4,812 engineers across nine product divisions. The cohort he represents is the hardest one we sell into — incumbent tooling, entrenched vendors, an internal IDE team with twelve years of accumulated culture. He took the reference call on the record.

§ 04 / 06

Three operators · three numbers

Reviewer trust.
Container root.
Migration scale.

Three different sized organizations, three different shaped problems, the same agentic runtime underneath. Each operator picked the receipt that mattered to their seat.

on the record

“Composer's plan step is the first AI coding feature I've seen that earns reviewer trust.”

Jules Reeves

Principal Engineer · Infrastructure SaaS · $8B ARR

Time-to-first-PR

1 day

PRs / week

412

Silent merges

Reviewer ack

100%

on the record

“Firecracker per workspace means I stopped having the 'why does every dev have Docker root' conversation.”

Marisa Liao

CISO · Public healthcare platform · 18K employees

vCPU floor

VM escapes

SOC 2 controls

92 / 92

KMS regions

on the record

“The Orchestrator ran a 30-hour Next.js 14→15 migration across 184 apps. Two human gates. One PR per app. Zero rollbacks.”

Daniel Hsu

Staff Engineer · Commerce platform · 2,200 engineers

Apps migrated

184

Run time

30h 12m

Human gates

Rollbacks

§ 05 / 06

Aggregate impact

What 40 platform teams
actually measured.

Pulled from the customer's own telemetry, not from a survey. Ninety-day rolling windows. Outliers trimmed at p2 / p98 before medians were computed. The full cohort dataset is published, anonymized, at /benchmarks.

Onboarding timemedian

−0%

Range −62 → −94 across cohort.

AI-PRs / eng / momedian

Median. Top decile 41.

TCO · 500-seat / yrmedian

$0K

Down from $1.84M legacy stack.

Uptime · contractualmedian

0.00%

8 quarters · 0 SLA breaches.

MethodologyQ1 2026 cohortn=4090-day rollingraw datasets at /benchmarks

Download full cohort report ↗

§ 06 / 06

Reference calls · within five business days

Talk to one of them.
On the record.

Pick the seat closest to yours — CISO, principal engineer, head of platform — and we will route the call. No pre-screening, no vendor-side talking points, no NDA on the conversation. The numbers above are the ones they will recite.

Request reference call Join the waitlist

SOC 2 Type IIISO 27001HIPAA-readyGDPR · DPFPCI DSS 4.0

Operator memory.Real numbers.

Reviewer trust.Container root.Migration scale.

What 40 platform teamsactually measured.

Talk to one of them.On the record.

Operator memory.
Real numbers.

Reviewer trust.
Container root.
Migration scale.

What 40 platform teams
actually measured.

Talk to one of them.
On the record.