Cheap cloud AI was never going to last. The off-ramp wasn't built for everyone
The era of cheap cloud AI is coming to an end as companies shift to usage-based billing, leaving many users unprepared for the increased costs. Heavy users have been consuming significantly more compute than their subscription fees cover, leading to a bifurcation between enterprise and consumer users. The transition to self-hosted open-source models presents a viable alternative, but requires technical skills that many users lack.
- ▪GitHub Copilot will move to usage-based billing starting June 1, 2026.
- ▪Anthropic is capping Claude Code and splitting it from the subscription model on June 15, 2026.
- ▪Heavy users have been consuming about $8 of compute for every $1 of subscription revenue, which is no longer sustainable for the labs.
Opening excerpt (first ~120 words) tap to expand
← All posts19 May 2026·5 min read·AIPricingAnthropicSelf-hostingCheap cloud AI was never going to last. The off-ramp wasn't built for everyone.On June 1, GitHub Copilot moves to usage-based billing. On June 15, Anthropic caps Claude Code. The flat-fee era is ending in a two-week window. Heavy users have been consuming roughly $8 of compute per $1 of subscription revenue; the labs aren't covering that anymore. The off-ramp – self-hosting open-source models – exists, but charges a skill toll that excludes the user base the labs onboarded. Who gets left where, and was the door ever meant to stay open? Anthropic just announced self-hosted sandboxes and MCP tunnels for Managed Agents.
…
Excerpt limited to ~120 words for fair-use compliance. The full article is at Hacker News (AI / LLM).