[bmdpat]
All writing
4 min read

When JPMorgan's AI bill goes up, who controls it?

JPMorgan turned on AI for 250k people. The quiet line is that the usage racks up fees. Here is how to control the bill before it arrives.

Share LinkedIn

JPMorgan turned on AI for its entire bank. Around 250,000 employees now have access. CEO Jamie Dimon put it plainly: "more AI people and fewer bankers." This is the biggest bank on earth treating AI as default infrastructure, not a pilot.

The capability numbers are loud. A pitch deck that took four hours now takes about 30 seconds, per Microsoft. Goldman gave an AI assistant to more than 10,000 workers. Morgan Stanley's AskResearchGPT covers 70,000 research reports. A Citigroup study found 54% of financial jobs have high potential for automation. Standard Chartered plans to cut 8,000 jobs by 2030.

But the line that should stop a builder cold came from Bloomberg: bankers' Claude usage is racking up fees. That is the quiet part. When you hand AI to a quarter-million people, the bill stops being a footnote. It becomes a board-level number.

Capability is the easy part now

Here is the shift. For years the hard question was "can the model do this?" That question is mostly answered. The hard question now is "what does it cost to do it at scale, and who is watching that meter?"

Usage is trivial to turn on. One config flag, one rollout, and suddenly every employee can call a model a hundred times a day. Nobody calls it once. They retry. They re-run. They paste the whole document in again because it was faster than scrolling. Each call is cheap. The aggregate is not.

JPMorgan has a CFO and a procurement team to absorb that surprise. Most teams shipping AI features do not.

The same trap hits a two-person team

I build AI agents for a living. The failure mode is identical at every scale.

You ship a feature that calls a model. It works in the demo. You launch. A week later someone wires it into a loop, or a user finds a way to trigger it on every page load, or a retry path has no ceiling. The token count goes vertical. You find out when the invoice lands, not when it happens.

The bank version is 250,000 seats. The startup version is one runaway background job. The shape is the same: spend that grows faster than anyone is tracking, with no hard stop in the path.

Controlling the bill means a runtime ceiling

You cannot fix this with a dashboard you check on Mondays. By Monday the money is gone. Cost control for AI has to live in the request path, before the call goes out.

Three things actually move the number:

Budgets per goal. Decide what one completed task is worth and refuse to spend past it. If a job blows its budget, it stops and tells you, instead of grinding through 40 retries.

Per-key and per-user caps. One bad actor or one buggy loop should not be able to drain the account. Cap each key so the blast radius is one key.

Rate limits at the agent level. Not just the provider's limit. Your own limit, enforced in your code, so a runaway process hits your wall first.

None of this is exotic. It is the same discipline you already apply to database connections and outbound API calls. AI spend just did not have it baked in yet.

Why the spend will keep climbing

The headcount numbers tell you the spend is not a one-time spike. A Citigroup study found 54% of financial jobs have high potential for automation. Standard Chartered plans to cut 8,000 jobs by 2030. Dimon's "more AI people and fewer bankers" is not a slogan. It is a budget shift from salaries to model usage.

When work moves from a salaried person to a metered API call, your cost stops being fixed and starts being variable. Variable cost scales with usage, and usage at a bank only goes up. A task that used to cost a flat salary slice now costs whatever the agents decide to spend that day.

That is the trade every team is quietly making when it ships AI. You swap a predictable cost for a flexible one. Flexible is good for capability and bad for forecasting, unless you put your own ceiling on it. The ceiling is the thing that turns a variable cost back into a number you can plan around.

The takeaway

JPMorgan's rollout is the signal, not the story. Enterprise AI spend is going vertical, and "the usage racks up fees" is now a headline at the biggest bank in the world. The teams that survive the next round of margin pressure will not be the ones with the fanciest models. They will be the ones who put a ceiling on spend before the bill arrived.

If you are shipping AI features and you do not have a hard cap in the request path, you are running JPMorgan's experiment without JPMorgan's balance sheet.

I built AgentGuard for exactly this: a runtime budget, token, and rate limiter for AI agents. It sets a hard ceiling so a runaway agent stops instead of draining your account. Check it out at https://bmdpat.com/tools/agentguard.

Want more like this?

AI agent builds, real costs, what works. M-F only when there is something worth sending. No fluff.

PH

Patrick Hughes

Building BMD HODL — a one-person AI-operated holding company. Nashville, Tennessee. Twenty-Two agents.

More writing