SDS · Field notes

Cloud waste hasn't moved in seven years. SMBs have nowhere to go.

27 percent waste, $182 billion globally, and a FinOps ecosystem priced for enterprise.

Type
Field note
Date
14 May 2026
Audience
Founders, CTOs, and operators running cloud workloads at small and mid-sized companies

The State of Cloud Waste 2026 puts the global cloud waste rate at 27 percent of cloud spend, against a global cloud bill of roughly $675 billion in 2025. Gross waste lands at $182 billion; the conservative estimate is $101 billion. The waste rate has not moved since 2019.

27% Waste rate, unchanged since 2019

30-40% Startup / SMB waste rate

23% Average GPU utilization

Cloud cost optimization has been the number-one stated cloud initiative across the major industry surveys for five consecutive years. Eighty-five percent of organizations report doing FinOps in some form, up from 72 percent a year ago. Twenty-eight percent of them are mature at it. The other 72 percent are "doing FinOps" the way a gym membership is "doing exercise."

The breakdown of where the waste actually sits is unsurprising once you see it. Thirty-five percent is idle compute. Twenty-five percent is overprovisioned instances. Fifteen percent is unattached storage. Ten percent is orphaned snapshots. Ten percent is data transfer. Five percent is unused licenses. Across providers, GPU utilization averages 23 percent.

The shape of the gap is the part most coverage misses. Startups waste 30 to 40 percent of cloud spend. Enterprises waste 18 to 25 percent. Small companies are worse at this, not better, and the ecosystem that has emerged around cloud cost discipline is built for enterprises. FinOps platforms are priced for enterprise procurement. FinOps content assumes a dedicated FinOps team. The result: the businesses with the worst waste rate have the fewest paths to fix it.

The wrong frame: "we need a FinOps platform"

The natural reflex inside an SMB looking at this is to assume that the fix is tooling. Buy CloudHealth or Cloudability or Apptio or one of the half-dozen platforms competing for the FinOps spend. The vendors all show the same screenshot: a dashboard with utilization heat maps, a forecast curve, a savings opportunity widget showing $X,XXX in monthly recoverable spend.

For a company spending $50,000 a month on cloud, that is the right call. The platform pays for itself. For a company spending $5,000 a month, the platform costs more than the waste it would identify, and the dashboard becomes a thing the operator means to look at and does not. The waste rate stays at 30 to 40 percent because the tool is not the bottleneck. The discipline is.

There is a second frame that needs killing. "We will tighten that up later, once we have product-market fit." This was reasonable advice in 2018 when cloud spend was small relative to engineering payroll. In 2026, with AI workloads pushing GPU spend into the same order of magnitude as a junior engineer's salary, the "we will tighten it later" path takes companies past the point where the waste is a real line item before anyone has built the muscle to address it.

The right frame: cost discipline is an architecture decision, not a tooling decision

The State of Cloud Waste 2026 report names four structural reasons waste persists: no ownership (only 44 percent of organizations use chargeback or showback), fear of breaking things (58 percent cite production-impact concerns), lack of visibility (61 percent cannot attribute more than 80 percent of cost to specific teams), and vendor incentives misaligned with customer outcomes.

Three of those four are downstream of architecture choices made early. Ownership lives in tagging discipline. Visibility lives in IaC organization. Fear of breaking things lives in whether the dev environment can be safely torn down, which is an environment-isolation question. The fourth (vendor incentives) is real, but it is the thing you compensate for, not the thing you fix.

We have applied the same posture to AI cost and to cloud security in prior pieces: the control surface that matters lives in the architecture, not in the tooling layer above it. Four moves separate an account that wastes 30 percent from one that wastes 5 percent.

Right-size at launch, not after the bill arrives

The default behavior across most teams is to provision conservatively at launch and then "rightsize later." Later does not come, because nothing forces it to. The dev environment that was sized for "we might need to load-test next quarter" runs at 12 percent CPU for eighteen months and bills $400 a month in spite of doing approximately nothing.

We ship every cloud environment sized to its actual expected load, not its hypothetical worst case. Production gets headroom; dev and staging do not. Autoscaling is configured at launch, with the scale-in side enabled. If the environment can run on t-class instances, it does. The same engineering review that catches dependency vulnerabilities flags resources sized two tiers above what their utilization curve justifies.

For GPU workloads specifically: no provisioned GPU instance lives without a defined utilization target and a corresponding spend ceiling. If the workload cannot keep the GPU above some named utilization floor, the workload runs on a smaller GPU or moves to spot capacity. Twenty-three percent average utilization across the industry is a tooling failure, not a workload failure; teams know how to use GPUs, they just have not built the muscle to right-size them.

Tagging discipline encoded in IaC, not in policy documents

The "no ownership" finding (only 44 percent of organizations do chargeback or showback) is a tagging problem, and tagging is solved at the IaC layer or not at all. A tagging policy written into a Confluence page and reviewed quarterly does not produce tagged resources. A tagging policy encoded as a required input to every IaC module produces them automatically.

Every resource we provision in an engagement carries owner, environment, project, and cost-center tags by IaC contract. The module refuses to apply if the tags are not set. AWS Cost Allocation tags are activated as part of account setup, not as an afterthought. The result is that cost attribution above 80 percent is the default state, not the aspiration.

For SMBs, the practical implication is that the tagging conversation happens once, at the start of an engagement, and the discipline runs itself from there. No FinOps platform required; the native cloud cost reporting against well-tagged resources is enough for most companies under $20,000 a month in cloud spend.

Auto-stop and lifecycle policy as default, not exception

The 35 percent of waste that is idle compute, and the 15 percent that is unattached storage, are both lifecycle problems. The resource was created for a reason, the reason expired, and nothing reaped it. The reasonable industry response would be to default the cloud to reaping idle resources; that has not happened, because the cloud providers profit from idle resources existing. So we encode it in the engineering posture instead.

Dev and staging environments shut down outside business hours by default. Production preserves uptime; everything else does not. EBS volumes that detach for more than seven days get flagged and deleted on review. Snapshots older than 30 days outside an explicit retention policy get reaped. S3 lifecycle rules age objects into Infrequent Access and Glacier on schedules that fit the data, not on the operator remembering to set them.

These are not exotic patterns. They are AWS Instance Scheduler, EventBridge plus Lambda, S3 Lifecycle, and a small amount of IaC discipline. The reason they are not the default at most SMBs is not that they are hard; it is that "set up auto-stop on the dev environment" never gets prioritized against shipping the next feature.

Daily cost reconciliation in the application layer

The companion piece to this one (the AI cost discipline Field Notes) makes the case that AWS Cost Anomaly Detection does not cover charges billed through AWS Marketplace, and that cost observability needs to live in the application layer to catch the things the cloud's own tooling will miss. The same posture applies to general cloud spend, not just AI.

In every engagement, we ship a daily cost-reconciliation report. The application's own counters (requests, storage writes, data transfer, GPU hours) get logged in a form the team can compare against the cloud bill the next day. When the gap appears (Lambda invocation count doubled, data transfer out spiked, a forgotten dev environment kicked back on), the team sees it within 24 hours, not at the end of the month.

This is not a FinOps platform. It is a small Lambda or scheduled job that the engagement leaves behind, owned by the same engineers who own everything else. The output goes to the same channel that gets dependency-update alerts. For SMBs without a dedicated ops team, this is the difference between a cost surprise that takes a month to surface and one that takes a day.

What this article is not

Not a critique of the FinOps Foundation or any specific platform vendor. The discipline is real; the tooling works for the enterprises it was built for. The gap is that the SMB segment is not served by the same products and is rarely written about in the same content.
Not a claim that SDS ships a productized "FinOps for SMBs" SKU. We do not. Cloud cost discipline is part of how we run engineering engagements, and the patterns above are what we leave behind when we hand a project off.
Not a one-size-fits-all blueprint. The right auto-stop schedule for a healthcare workload with batch processing windows is different from the right schedule for a marketing site. The principles transfer; the schedule does not.
Not a recommendation to cut aggressively at the expense of reliability. Production gets headroom; dev does not. The 5 to 8 percent waste target at SMB scale is achievable without breaking anything that matters, through architecture discipline, not through margin-shaving heroics.
Not a complete cloud cost curriculum. We did not cover reserved instances and savings plans, spot capacity strategy for non-time-sensitive workloads, or multi-cloud arbitrage. For most SMBs, the four moves above produce most of the available recovery before any of them become worth the operational complexity.

One-sentence takeaway

The 27 percent cloud waste rate has not budged in seven years because the discipline that fixes it has only been packaged for enterprises, and the SMBs with the worst waste rate need it as architecture, not as tooling.

Talk to us

If you are running cloud workloads at SMB scale (say, $1,000 to $30,000 a month) and the question "are we one of the companies wasting 30 percent" is open, the next move is a 30-minute conversation. Bring the rough shape of your cloud footprint (which provider, which services, where the spend concentrates), and where your cost reporting lives today. Within the call we will tell you where the highest-impact move sits, what we would tighten first, and whether a deeper SDS engagement is the right next step.

We do not take every engagement, and we will tell you on the call whether we are the right partner.

www.socialdolphinservices.com

Sources

Spendark, The State of Cloud Waste 2026: $100B+ in Unnecessary Spend (2026). Source for the 27 percent waste rate, the $182B gross / $101B conservative figures, the waste breakdown by category, the four structural causes of waste, and the 23 percent average GPU utilization figure.
Flexera, 2026 State of the Cloud Report (industry coverage, May 2026). Source for the 32 percent average waste figure, the 85 percent FinOps adoption / 28 percent maturity split, and the multi-cloud adoption numbers.
LeanOps, 2026 FinOps Playbook (2026). Context for the 30 to 50 percent cost recovery range achievable by mature FinOps practitioners.