
There’s a debate happening inside every financial institution right now, and it comes down to a single question: do you use AI to do what you already do, only cheaper — or do you use it to build something your customers never imagined? Forbes put it plainly: the fork in the road is between cost reduction and reinvention. Both paths require significant AI infrastructure investment. But the economics of that investment look very different depending on which path you choose.
Bank of America’s virtual assistant Erica handles roughly 2 million customer conversations per day. JPMorgan Chase is embedding large language models into payment screening and authentication workflows. Boston Consulting Group projects that AI could unlock over $370 billion in annual profit for retail banks by 2030. The scale of adoption is no longer in question — what’s in question is whether the return on that adoption justifies the infrastructure cost, and whether the cost structure is sustainable as AI workloads scale.
The Cost Reduction Path: Efficiency Has a Ceiling
The most common AI strategy in banking is straightforward: take an existing process that’s expensive and make it cheaper. A call centre handling 50,000 inquiries a day becomes a call centre where AI handles 40,000 and humans handle 10,000. The interaction is faster and the cost per unit drops. But the customer experience is recognisably the same — call, wait, talk, hang up. Sometimes the quality drops, as Klarna discovered when it cut nearly half its workforce and faced customer backlash.
The efficiency path has a ceiling. Once you’ve automated the routine queries and reduced headcount, the marginal gains diminish. You’ve made the existing business cheaper, but you haven’t created anything new. And the AI infrastructure costs — the cloud compute, the API calls, the model hosting — don’t disappear when you hit that ceiling. They become a permanent fixture on the balance sheet.
The Reinvention Path: Build What Wasn’t Possible Before
The alternative approach — the one that early-stage fintech companies are taking — starts from the customer’s actual need and builds the experience from scratch around what AI makes newly possible. A client wondering whether a Roth conversion makes sense doesn’t schedule a meeting and wait three weeks. An AI-native platform pulls their financial and tax data, models multiple scenarios, factors in current law, and presents a recommendation in seconds — with the reasoning documented for compliance review.
This isn’t a chatbot answering FAQ questions. It’s a system that combines financial knowledge bases, live data connections, quantitative modelling, conversation history, compliance verification, and multimodal presentation — text, visualisation, voice — into a single interaction. The architecture is fundamentally different from bolting AI onto a legacy call centre. And the compute requirements are fundamentally different too.
The Infrastructure Bill Nobody Forecasted
Both paths lead to the same place: a growing cloud and AI infrastructure bill. The efficiency path runs AI models to classify, route, and respond to customer queries at scale. The reinvention path runs more complex models to analyse financial data, generate personalised recommendations, and power multimodal interfaces. Either way, the compute costs are usage-based, unpredictable, and scaling faster than anyone budgeted for.
The financial institutions handling this well share a few practices. They’re separating AI inference costs from traditional IT infrastructure in their budgets — because the growth curves are completely different. They’re routing simple tasks to cheaper models and reserving expensive frontier models for complex analysis. They’re batching non-real-time workloads (overnight compliance scans, weekly portfolio analysis) to take advantage of lower batch API pricing. And they’re auditing their cloud commitments regularly.
That last point matters more than most finance teams realise. Many institutions — especially fintechs that went through accelerator programmes or signed early cloud commitments — are sitting on unused cloud and AI API credits that are quietly approaching their expiry dates. At the same time, other companies are actively looking for cheaper compute. A growing cloud credits marketplace has emerged to connect the two sides — allowing sellers to recover cash from credits they won’t use and buyers to access compute at below-retail pricing. For any financial institution running AI workloads at scale, it’s worth checking whether you’re a buyer, a seller, or both.
The Strategic Question for Every Financial Institution
The two approaches — cut costs or reinvent the experience — are not mutually exclusive. But they do reflect fundamentally different theories of value. Cost reduction improves margins on existing revenue streams. Reinvention creates new revenue streams by serving customers who were previously underserved because the economics didn’t work.
Large banks investing billions in AI-powered efficiency will capture real savings. Fintechs using AI to reimagine the customer experience from scratch will capture the next generation of clients. But both need to get the infrastructure economics right. The AI strategy that delivers the best customer outcome at a sustainable cost-per-interaction is the one that wins — regardless of which path it takes.
The Bottom Line for Financial Institutions
Whether you’re using AI to cut costs or reinvent the experience, the infrastructure bill is real and growing. Separate AI spend from traditional IT budgets. Route tasks by complexity. Batch what doesn’t need real-time. Audit your cloud commitments. And don’t let unused credits expire when there’s a market for them. The institutions that manage AI infrastructure as a strategic asset — not an unmonitored utility — will have a structural advantage over those that don’t.
