AI Isn’t Cheap to Run and Anthropic’s New Forecast Proves It

Anthropic just admitted what most AI companies avoid saying out loud: serving intelligence at scale is still very expensive.

Anthropic, the company that developed Claude, has made a novel move in the field of artificial intelligence. It admitted that running large language models at scale is still brutally expensive. 

Internally, the company trimmed its 2025 gross profit margin forecast to around 40%, roughly ten percentage points lower than earlier expectations. Neither because customers disappeared nor due to less demand. Only because the cloud is costing more than planned. Specifically, the price of running Anthropic’s models on Google Cloud and Amazon Web Services came in about 23% higher than forecast. 

It’s not a collapse, and it’s certainly not a crisis. It’s a reality check, albeit an uncommonly honest one, in an industry still enamored with perfect curves and frictionless growth stories.

The Reality Check Behind Anthropic’s Revised Forecast

Even at a 40% gross margin, Anthropic’s margins would still be significantly better than they were the previous year which means that the corporation is still heading in the right direction, but not as quickly or inexpensively as its spreadsheets originally predicted. That distinction matters, particularly given a market where AI companies are often talked about as if they print money the moment a Fortune 500 logo signs a contract. 

What’s unfolding in this instance is a lesson in how AI economics works soon after you leave the pitch deck and proceed to production.

Renting Intelligence is Expensive 

Although training a large model is costly, it is a one-time inconvenience. The meter runs every hour of the day when that model is served to real consumers at scale. Every prompt and every response, even every enterprise workflow discreetly consumes GPU resources. Anthropic is largely dependent on Google and Amazon‘s cloud infrastructure, and those costs have not gone down as effectively as anticipated. Inference is still persistently pricey despite model optimizations and increasing enterprise adoption. 

CFOs can’t sleep over a figure like the 23% cost overrun. Not because it devalues the business, but because it shifts the pace. Growth must put forth more effort to compensate when margins contract.

Context is important, though. An anticipated 40% gross margin in 2025 would put Anthropic far ahead of where it was a year ago. 

Predictable Usage Doesn’t Mean Predictable Costs

The company’s increasing unit economics are reflective of a broader trend among major AI vendors. Contracts grow in permanence and usage becomes more predictable as more enterprise customers use AI for document workflows, internal knowledge systems, customer support and coding assistance. This steadiness aids in maintaining revenue even when costs fluctuate. Anthropic continues to prepare for aggressive top line expansion for this reason. 

The margin trim does not imply retreat. It indicates setting priorities. Anthropic believes the long game will prevail in enterprise AI. Businesses tolerate fewer illusions and more reliability, but they also pay greater wages and experience less turnover.

Zooming out, this narrative reveals as much about the AI industry as it does about Anthropic.

The Infrastructure Reality Behind AI Margins

Despite all the rhetoric of endless scale and software margins, the present-day leading AI models operate more like infrastructure businesses than classic SaaS. They depend on hardware supply chains, energy prices and cloud pricing strategies controlled by a small number of hyperscalers. 

Margins will continue to be a fluctuating target until businesses like Anthropic own more of their own compute stack or dramatically reduce inference costs through novel architectures.

This fact is already influencing sector-wide strategy. Long-term cloud partnerships and conversations about custom chips, as well as the clandestine shift to smaller yet more effective models for corporate functions are all examples of it. Bigger is no longer necessarily better if it consumes your earnings faster than it generates revenue.

Anthropic’s decision to reduce its margin projection does not indicate a lack of competence. It is an indication of maturity. 

The Takeaway

Early AI narratives thrive on immaculate assumptions and infinite scale with margins that might miraculously behave like pure software. Businesses in the real world don’t get that luxury. They have to revise forecasts and absorb higher costs, all while they keep building anyway. The ones worth watching are not the loudest optimists, but the operators willing to admit, “This is harder than we thought, and we are still in.”

The fantasy is that AI profitability arrives fully formed. The reality is more disorganized and shaped by hardware and infrastructure constraints that refuse to go away. This does not diminish the impact of the AI revolution. It only gives it reality.

Seldom are revolutions low-cost when the wirings are still exposed. 

We are the Agency for Influence

Discover new ways to drive revenue and build reputation for your brand.

,

More from Avenue Z

Recommended reads

Connect With Us

Stay in touch. Discuss your needs with us and see how we can help.