The companies whose AI is paying back are not running better models. They are running better workflows.
Globalization Partners released its 2026 AI at Work Report this week, surveying 2,850 leaders across six global markets. The headline: 73% of executives say at least some of their AI investments fell short of expectations over the past 12 months. The share of leaders describing their organizations as "aggressively using AI to innovate" dropped from 60% to 42% in a single year. Nearly seven in ten say they are prepared to cut AI budgets if goals are not met this year.
Read alongside PwC's 2026 AI Performance Study, which finds that 20% of firms are capturing roughly three-quarters of AI's economic value, the picture is not "AI does not work." It is that a small minority of companies are getting outsized returns, and most of the field is stuck in pilot purgatory. The boardroom mood has shifted from "let's try things" to "show me what we got."
SDS exists for the 27% who are getting it right, and to move more companies into that group. This piece is how we think about the move.
The G-P report names a friction that does not show up in the AI vendor pitch deck: 69% of executives say the time employees spend monitoring, reviewing, and updating AI-generated work has gone up over the past year. The model produces output. A person checks it, rewrites parts of it, decides which parts to trust. The time saved on the first draft gets eaten by the time spent on the rework.
It gets worse. 88% of executives in the same study are concerned that employees are using AI to "perform productivity," with 47% very or extremely concerned this is already happening. And 82% of executives admit that AI has lowered the value they place on human employees, which is a quiet way of saying the social contract underneath the work is slipping.
Most AI programs are being scored on the wrong axis. The CFO asks: how many hours did this save us? The team reports a number. The number is small relative to what was promised, the program looks expensive, the budget gets cut.
The miss is that the early returns on AI are mostly not labor savings. They show up in decision quality, predictive power, cycle time on judgment-heavy work, and revenue protected by catching things humans would have missed. Katy George at Time put it sharply this week: leaders are measuring AI in the wrong places. Labor cost is a lagging indicator. Decision quality is the leading one. By the time the cost-savings number is big enough to make the spreadsheet land, the company is already three quarters behind whichever competitor scored their AI program on the right axis.
When we work with executives on this, the first move is almost never "build more." It is "define 1 to 3 outcomes the AI program is actually responsible for moving, and stop scoring it on everything else."
Most AI programs live in what we call "experiment land." The orgs in the 27% have crossed into "accountability land." The difference is not the budget or the model. It is the operating posture.
| Experiment land | Accountability land |
|---|---|
| Pilots with vague goals and tool-first experiments | Outcome-first, with a roadmap tied to concrete business metrics |
| Measured by usage and hours saved | Measured by revenue, risk, learning, and decision quality |
| Hidden human labor fixing AI outputs | Designed workflows where AI surfaces exceptions and the human reviews those |
| Human value quietly devalued, trust eroded | Human expertise explicitly elevated as the arbiter and trainer of the AI system |
| Stuck in pilot purgatory with no scale | Repeatable patterns that compound across teams and geographies |
Each row in that table is a design decision. None of them is a model upgrade. None of them requires you to bet the company on a new technology. They require someone to sit down and answer four questions for each AI use case in the portfolio, with the discipline to act on the answers.
This is the work, condensed.
The 27% of executives whose AI investments are paying back are not running better models, they are running better workflows: outcome-first goals, scoped tools, human expertise routed through approval queues, and leading indicators on decision quality instead of headcount math.
If you are reading this and the 73% number lands a little too close to home, the next move is a 30-minute conversation. Bring the list of AI initiatives you have in flight. We will walk through which ones are pointed at real outcomes, where the hidden rework tax is showing up in your team, and what would have to change for the program to land you in the 27%.
We do not take every engagement, and we will tell you on the call whether we are the right partner. Either way, you will leave the conversation with a sharper read on the portfolio than you came in with.