Working thesis · Agent payments

Financial controls for AI agents.

A working thesis on the control gap in agent-driven payments — and the runtime authorization layer the agent economy will need before money moves, not after.

Pressure-test it with me → Proposed solution

The shift

AI agents are beginning to move money on behalf of users and companies. They buy services, pay APIs, book travel, procure resources, and trigger workflows — without a human approving every transaction. That changes the control problem.

The actor is ephemeral

Nothing persistent to hold accountable.

Most financial systems assume the actor behind a transaction is persistent — a person, company, account, cardholder, or merchant that can be investigated after the fact. An agent may be an ephemeral execution context: it can complete a task, initiate a transaction, and disappear.

The model can't be the control

Enforcement has to sit outside the agent.

Instructions inside the model are useful, but they can't be the final control layer for money movement. A probabilistic system can misinterpret, overreach, or act outside its intended scope. For high-consequence actions, financial policy has to be enforced outside the agent itself.

Signals from the market Tailwind

Mar 2026

Over 90% of enterprise leaders expect AI agents to influence 20%+ of online orders by 2027 — and a third think agents could drive over half of all transactions.
Logicbroker · State of Agentic Commerce →
May 2026

Bessemer projects $1T+ in U.S. orchestrated retail revenue by 2030 (up to $3–5T globally) — while flagging payments and post-transaction liability as still-unsolved plumbing.
Bessemer · The Delegated Buyer →
Jun 2026

Recent ecosystem maps of agentic commerce catalog payments, trust, and risk across the stack — but show no authorization or financial-control layer underneath.
Agentic Commerce Frontier · Ecosystem Map →

Evidence the tailwind is real Added as I find it

Why today's controls don't hold

Most financial control is retrospective: money moves, then systems reconcile, audit, dispute, or remediate afterward. That works when mistakes are recoverable and there's a persistent actor to hold accountable.

Agentic payments put pressure on both assumptions. On instant rails, settlement is fast and often irreversible; by the time a problem surfaces, the funds may be gone and the agent that initiated the transaction may no longer exist.

Control has to happen before money moves, not after.

The missing primitive

Issuers and agent-payment networks can approve a single transaction against a spending limit. But a per-transaction limit is a static gate. It doesn't track cumulative authorization — the running answer to how much of an agent's mandate has already been consumed.

What no single participant clearly owns is the state across transactions:

What an agent is allowed to do
Who it's acting for
The time window on that authority
The budget it's consuming
What it has already done
How much authorization remains

That live authorization state is the missing primitive.

The shape of a solution

A runtime financial-control layer between agent decision-making and payment execution. Before money moves, that layer resolves five questions:

01Is the agent who it claims to be?
02What principal, user, or company is it acting for?
03Is it authorized to take this kind of action?
04Is it authorized to act now, under this mandate?
05Does sufficient budget remain after prior actions?

Transactions that fail those checks are blocked before execution. Transactions that pass commit the authorization consumed, budget state, and audit record as one atomic event. Policy is enforced by construction rather than reconstructed later through logs and reconciliation.

At machine scale, control, accounting, and audit stop being separate systems and become one operation.

Why neutrality matters

This state has to stay coherent across agents, issuers, wallets, networks, merchants, and tools — which makes it hard for any single participant to hold alone. Each sees only its own slice:

Card issuer

Sees its own authorizations.

Payment network

Sees activity on its own rails.

Spend platform

Governs what flows through its own product.

Agent framework

Enforces policy inside its own runtime.

None of them necessarily sees the full authorization state once an agent operates across multiple tools, rails, issuers, and payment methods. So the control point likely needs to sit below individual rails — or be implemented as infrastructure that platforms embed when they orchestrate agentic transactions on behalf of users and enterprises.

Where this matters first

The first pressure points are likely platforms that let agents initiate transactions or high-consequence actions for end users or companies:

AI-native fintechs
Spend-management and procurement tools
Corporate travel platforms
Payment infrastructure providers
Banks building agentic-payment products
Enterprise workflow systems with transactional agents

Each will need to prove not just that a transaction settled, but that the agent was authorized to initiate it in the first place.

Questions I'm trying to validate

I'm trying to understand where this problem is already real, where it's still premature, and how teams building agentic products are handling it today.

01When agents initiate transactions or external actions, where is authority enforced today?
02Is authorization state managed inside the agent framework, the application, the payment provider, or a separate control layer?
03Who owns cumulative budget state when an agent spans multiple tools, merchants, issuers, or rails?
04What's the minimum useful primitive — mandates, budget controls, policy checks, audit events, dispute trails, or something else?
05My guess is travel, procurement, and fintech feel this first — does that match, or is it ecommerce, healthcare admin, or enterprise workflow?

Where this comes from

I'm a venture-backed startup CFO who spent about a decade building the controls, governance, and audit systems this thesis is about — from seed through exit. I have an engineering team ready to start work on it. Right now I'm pressure-testing the thesis with people close to the ecosystem: where the gap is real, where it's already being solved, and what agent builders actually need from a control layer.

Close to this problem?

If you're building agentic products — or financing the teams that are — I'd like to compare notes on where the control gap is real today.

Book a call → More writing