Research · Financial Press

Back to sweep

Research sweep · deep · 2025 – 2026

AI on Deterministic Rails

  • Claude Opus 4.8
  • financial
  • frontier
  • academic
  • vc
  • blogs
  • tech

Synthesised 2026-06-07

Narrative

The dominant financial-press theme from early 2025 through June 2026 is a sharpening gap between AI adoption breadth and value depth. McKinsey's 2025 State of AI survey found that 88 percent of organisations used AI in at least one function, yet only 39 percent reported any EBIT impact and just 23 percent were scaling agentic systems beyond one or two functions. BCG's parallel analysis found that 60 percent of enterprises generate no material value despite investment, and only 5 percent are creating substantial value at scale. The MIT-reported figure — widely cited in Fortune and Axios — that 95 percent of generative AI pilots are stalling reflects not model failure but governance absence: no clear owner, no workflow redesign, no production-grade data.

The token-cost shock of early 2026 crystallised the enterprise economics problem. Uber exhausted its full 2026 AI tools budget in four months after rolling out Anthropic's Claude Code to roughly 5,000 engineers; the company's Operations chief admitted there was no clear link between token consumption and useful consumer features. Microsoft cancelled most of its internal Claude Code licences for its Experiences and Devices division in May 2026, steering developers back to GitHub Copilot CLI. Goldman Sachs Research, in its May 2026 'Decoding the Agentic Economy' report, projected that agentic AI will drive a 24-fold increase in token consumption by 2030, reaching 120 quadrillion tokens per month, while also forecasting 60–70 percent annual declines in per-token compute cost. The divergence — falling unit costs, exploding total spend — is the central tension: agentic loops can consume 1,000 times the tokens of a single chatbot query, and budgets set before the Claude Code-driven adoption surge in late 2025 are now structurally obsolete.

Bloomberg Intelligence's June 2026 Generative AI 2026 Outlook, which raised its market forecast to $2.3 trillion by 2032 from $1.8 trillion in its March 2025 edition, identified the shift from training to inference as the primary demand driver and noted that coding agents alone are on track from $1 billion in 2024 to near $100 billion by 2032. The pricing model disruption is already underway: SAP CEO Christian Klein told Bloomberg in March 2026 that the company was shifting to consumption-based pricing; GitHub moved all Copilot plans to usage-based AI Credits billing from June 2026; Bloomberg estimated that subscription-based pricing could decline from 60 percent of software pricing models to 30 percent over the next decade as outcome-based models expand. Incumbent platforms with deep workflow integration — Microsoft, SAP, Workday, ServiceNow — appear structurally better positioned than point-solution SaaS vendors in Bloomberg Intelligence's disruption framework.

The open-weight model shift added a second cost-control lever alongside orchestration. DeepSeek V3, released in December 2024 at a claimed training cost of $6 million, offered inference at roughly 12.5 times the cost advantage over Claude 3.5 Sonnet according to IISS analysis. MindStudio's May 2026 survey of enterprise deployments found that open-weight models including DeepSeek V3, Qwen 3, and Llama 4 Maverick were matching closed frontier models on coding, classification, and structured data extraction tasks, with costs 5–20 times lower and the option of self-hosted private inference for data-sensitive workflows. The harness-over-model thesis gained traction in parallel: practitioners and Bloomberg Intelligence analysts converged on the view that orchestration design — context seeding, subagent delegation, deterministic control flow, error handling — now accounts for more variance in production outcomes than the choice of underlying model.


Sources

ID Title Outlet Date Significance
f1 Agentic AI 2026 Outlook — Bloomberg Intelligence Bloomberg Intelligence 2026-05 Bloomberg Intelligence's primary research report on how agentic AI is restructuring enterprise software pricing from seat-based to usage- and outcome-based models, directly relevant to the token-economics and SaaS disruption angles.
f2 Generative AI Market Poised to Reach $2.3 Trillion by 2032 as Agentic Systems Proliferate — Bloomberg Intelligence Bloomberg Intelligence 2026-06 Bloomberg Intelligence's June 2026 market-size forecast, including the shift of AI revenue from training to inference, and the projection that coding agents grow from $1 billion in 2024 to near $100 billion by 2032.
f3 Bloomberg Intelligence on AI Agents in the Enterprise (video) Bloomberg 2026-05 Bloomberg Intelligence senior software analyst discusses how AI agents are disrupting the enterprise software stack, offering independent Wall Street analytical perspective on the agentic transition.
f4 The AI Trainers Charging $25,000 a Day to Push Wall Street's Agentic Shift Bloomberg 2026-05 Bloomberg feature on the practical and financial challenge of enterprise agentic adoption in financial services, documenting the gap between capital commitment and realised workflow automation.
f5 How DeepSeek and Open-Source AI Models Are Disrupting Big Tech Bloomberg 2025-08 Bloomberg's primary news analysis of the open-weight model disruption, covering how DeepSeek and Chinese labs pushed competitive inference costs down and forced OpenAI to release its first open model in six years.
f6 OpenAI Releases Open-Weight Models After DeepSeek's Success Bloomberg 2025-08 Bloomberg's reporting on OpenAI's strategic response to open-weight competition, marking a structural shift in how frontier lab models are distributed and priced.
f7 AI Sticker Shock Hits Corporate America Axios 2026-05 Axios original reporting naming specific enterprise AI cost-management crises, including direct executive quotes on misallocated use cases and uncontrolled token spend, with commentary from a former Microsoft chief AI officer.
f8 AI Agents Forecast to Boost Tech Cash Flow as Usage Soars — Goldman Sachs Research Goldman Sachs Research 2026-05 Goldman Sachs primary research publication ('Decoding the Agentic Economy') forecasting a 24-fold increase in token consumption by 2030 and a coming margin inflection for hyperscalers, the key investment-bank framework for understanding agentic cost dynamics.
f9 AI Token Costs Force Rethink at Uber and Microsoft EE News Europe 2026-05 Synthesises the Goldman Sachs token-demand forecast against documented enterprise pullbacks at Uber and Microsoft, providing the sharpest single-article framing of the cost paradox: cheaper tokens, higher total bills.
f10 Token Shock Hits Silicon Valley's Biggest Spenders PYMNTS 2026-05 Documents Microsoft's Claude Code licence cancellations and Uber's full 2026 AI budget exhausted in four months, with specific financial figures including Uber's $3.4 billion R&D spend and the structural mismatch between usage-based billing and enterprise finance cycles.
f11 AI Agent Economics: Token Tax Locks Gross Margins 30 Points Below SaaS Baseline TechTimes 2026-06 Provides the clearest unit-economics framing of the token tax problem, citing ICONIQ Capital data showing AI-native product gross margins at 52 percent in 2026 versus 75–85 percent for mature SaaS, and documents the structural cost asymmetry between model-maker and API buyer.
f12 Agentic AI Enterprise Token Cost EY 2026-06 EY consulting framework coining 'Agent FinOps' as a necessary enterprise discipline, and documenting how a single customer-service interaction can inflate from $0.04 to $1.20 under agentic orchestration — practical TCO evidence missing from vendor claims.
f13 AI Costs Begin to Bite as Agents May Increase Token Demand by 24 Times — Uber and Microsoft Among Companies Feeling the Bite Tom's Hardware 2026-05 Documents Uber's admission that 80 percent of engineers used agentic AI and over 60 percent of code was AI-generated with no clear correlation to consumer product value, making it the most cited enterprise cost-ROI mismatch case study of 2026.
f14 The State of AI in 2025: Agents, Innovation, and Transformation McKinsey & Company 2025-11 McKinsey's annual survey of 1,993 organisations across 105 countries, finding that while 88 percent use AI in at least one function, only 23 percent are scaling agentic systems and just 39 percent report EBIT impact — the primary independent benchmark for enterprise AI adoption status.
f15 From PoC to Production: Why Enterprise AI Struggles for Trust Tech Journal UK 2025-12 Named practitioner testimony from NatWest Group's global AI architecture lead on the governance gap that kills pilots in production — a key primary-voice source on the PoC-to-production structural barrier.
f16 MIT Report: 95% of Generative AI Pilots at Companies Are Failing Fortune 2025-08 Fortune's coverage of MIT data showing that PoC stall rates are structural rather than accidental, with the MIT finding that back-office automation delivers the highest ROI but receives less than half of budget allocation.
f17 DeepSeek's Release of an Open-Weight Frontier AI Model International Institute for Strategic Studies (IISS) Strategic Comments 2025-04 IISS authoritative policy analysis establishing that DeepSeek V3 inference is 12.5 times cheaper than Claude 3.5 Sonnet and over 15 times cheaper than GPT-4o, providing the foundational cost-differential evidence for the open-weight right-sizing thesis.
f18 Open-Weight AI Models Are Catching Up: What It Means for Enterprise Automation MindStudio 2026-05 Practical enterprise analysis showing where open-weight models (DeepSeek V3, Qwen 3, Llama 4 Maverick) now match closed frontier models on coding and structured tasks while remaining 5–20 times cheaper, and identifying where the performance gap persists in long agentic workflows.
f19 SAP Shifts to AI Consumption Pricing as Agents Threaten SaaS Revenue Model ERP Today 2026-04 Documents CEO Christian Klein's March 2026 Bloomberg interview announcing SAP's shift from per-user to consumption-based pricing, the most concrete large-enterprise example of the seat-to-usage pricing transition and its predictability risks for buyers.
f20 Enterprise SaaS in the Agentic AI Era: Salesforce, ServiceNow, Workday VaaSBlock 2026-06 Independent financial analysis comparing how Salesforce, ServiceNow, and Workday are positioned against the agentic pricing shift, with market performance data showing compressed Salesforce valuation multiples as Agentforce revenue conversion lags seat revenue erosion.
f21 Klarna AI Customer Service: Replacing 700 Agents — A 2026 Case Study Perspective AI 2026-05 Forensic case study distinguishing Klarna's verified operational results from vendor-narrative inflation, documenting the May 2025 Bloomberg/Reuters-reported reversal as a scope correction rather than full walkback, and providing the architectural detail about authenticated context access that makes the deployment non-replicable generically.
f22 Artificial Intelligence Helps Klarna Double Revenues with Half the Staff Computer Weekly 2025-11 Documents Klarna's Q3 2025 financial results showing revenue of $903 million against a workforce reduced from 5,500 to below 3,000, providing the primary financial performance data point for the AI-enabled workforce contraction thesis.
f23 AI Prices Are Going Up, Up, Up — And What This Means for Enterprise AI Josh Bersin 2026-05 Names Uber and failed projects at Pizza Hut and Starbucks as cases where token burn preceded project failure, and documents Big 4 hyperscaler 2025 capex at $370–410 billion with Reuters-cited Bridgewater estimates, framing the macro investment context.
f24 Agentic AI 2024–2025 Retrospective: Shipped vs Walked Back AgentModeAI 2026-05 Provides a four-class evidence taxonomy distinguishing vendor-controlled wins, audited pilots, public walk-backs, and structural failure modes — the most rigorous independent sceptical framework for evaluating agentic ROI claims available in the date range.
f25 Products Over Models: Why the AI Harness Matters More Than Benchmarks in 2026 MindStudio 2026-05 Practitioner analysis establishing that harness design — orchestration logic, context management, tool integrations, output handling — accounts for more production performance variance than model selection, directly supporting the orchestration-over-model-capability thesis.

We use analytics cookies to understand site usage and improve the service. We do not use marketing cookies.