The Copilot Fallacy · Sovereign Alpha

The Fallacy

Most institutions I advise are running a first-order strategy for a third-order problem.

Someone signs a Copilot or ChatGPT Enterprise contract. People start using it. Email drafts appear faster, slide decks assemble themselves, research notes take less of the day. Efficiency dashboards tick up. The transformation is declared under way.

This is the Copilot Fallacy: the systematic mistake of treating AI as software to license rather than as intelligence that demands reorganising the work around it. The name is not a critique of any specific product. Swap Copilot for Claude, Gemini, or an in-house model and the fallacy is unchanged, because it lives in the posture, not the tool. The reflex is to buy, deploy, measure per-seat productivity, and stop there.

That reflex is not irrational. Per-seat productivity is what IT procurement is built to defend, and vendor marketing materials are engineered to fit exactly that frame. But it is only the first of three orders of effect. The second and third orders are where competitive outcomes are actually decided, and they are where almost no institution I see is looking.

The rest of this essay is about those three orders: what each one is, why stopping at the first one is a strategic error, and what it takes to run all three.

First-order effects are real. They are also not strategic. Every institution that licenses the same tool captures the same lift, give or take a few points of usage discipline.

1st

Productivity Lift

Individual tasks done faster with AI assistance

Easy to measure, easy to market, available to every competitor who signs the same contract. Real but undifferentiating.

2nd

Role Restructuring

The Capacity Flip: value shifts from producing output to curating it

The nature of the job changes. Without deliberate redesign, the freed capacity is absorbed by noise. With it, the role is reorganised around judgment.

3rd

Architectural Parity

Model access is commoditised. Advantage lives in encoded institutional knowledge, hosted on infrastructure the institution controls.

The only order where durable competitive advantage exists. Also the only one that cannot be procured: it has to be built.

Three orders of AI effect in institutional settings. First-order optimisation produces visible metrics and no strategic position. Second- and third-order work produces strategic position and, in the short term, no metrics leadership knows how to read.

How I Arrived at This View

The argument above is my own, but it is not armchair speculation. It rests on two pieces of work that I ran in parallel through late 2025 and early 2026: a systematic literature review of AI adoption in financial institutions, and a field study that I report only at the aggregate, methodological level here. The purpose of this section is to make the empirical scaffolding visible, not to relitigate the findings.

Systematic Literature Review (PRISMA 2020)

A PRISMA 2020 protocol guided the review across four databases (Semantic Scholar, OpenAlex, CrossRef, Google Scholar) and six specialist journals in financial innovation, digital banking, and electronic markets, covering the period 2019 through 2026. The dual-pathway design was deliberate: a database-only search systematically underrepresents practitioner-oriented contributions that show up in specialist venues the major indexes do not cover well.

Pathway 1 · Database Search

Records Identified

Semantic Scholar, OpenAlex, CrossRef, Google Scholar · ~422 records

↓

Duplicate Removal

DOI matching + title similarity >90% · Output: ~198 unique records

↓

Title / Abstract Screening

6 inclusion + 6 exclusion criteria · 121 excluded · Output: 77 pass

↓

Eligibility Assessment

Abstracts, metadata, domain expertise · 31 excluded · Output: 46 included

Pathway 2 · Targeted Hand-Search

Journals Identified

Financial Innovation, Electronic Markets, Digital Finance, IJBM, JFSM, Journal of Digital Banking · ~86 titles

↓

Duplicate Removal

Deduplication · Output: ~84 unique records

↓

Title / Abstract Screening

~40 excluded · Output: 44 pass

↓

Eligibility Assessment

20 excluded · Output: 24 included

Combined Qualitative Synthesis

Total studies included · Database: 46 · Hand-search: 24

Fig. 1 — PRISMA 2020 dual-pathway flow diagram. Database search (n = 46) and targeted hand-search across six specialist journals (n = 24). Total: 70 studies in qualitative synthesis.

Field Study and Analytical Structure

In parallel with the literature work I ran a field study with professionals across equity research, corporate finance, sales and trading, buy-side investment management, and adjacent functions, between late 2025 and early 2026. The specific content of those conversations stays confidential: nothing in this essay is attributable to any individual or institution. What I do surface here is the analytical structure that emerged from coding the material using a standard Gioia approach.

That structure produced five aggregate dimensions. Three of them map cleanly onto the three orders of effect and become the three pillars discussed later in this essay. Two emerged inductively and function as boundary conditions on the transformation sequence rather than additional pillars.

1st Order

Education / Creator-to-Curator Transition

Role Identity Shift · AI Competency Development · Redefinition of Analyst Value

How the cognitive role of the professional changes when AI absorbs the production layer, and what has to be learned for the curator role to be performed competently.

2nd Order

Process Redesign / Capacity Flip

Routine Task Burden · Strategic Time Liberation · Economic Urgency

How workflows have to be restructured so that the capacity freed by AI ends up in the higher-value half of the role rather than in additional routine throughput.

3rd Order

Encoded Judgment / Sovereign Knowledge Architecture

Foundation Model Commoditisation · Proprietary Knowledge Encoding · Sovereign Infrastructure

How competitive differentiation has to be rebuilt around the model in a world where model access itself is commoditised, and why the architecture layer is the only durable one.

Boundary

Institutional Embedding

Human Judgment Boundaries · Regulatory Integration · Organisational Legitimacy

How regulatory constraints (BaFin, MiFID II, EU AI Act), legacy infrastructure, and compliance workflows determine the pace and shape of any of the three orders.

Boundary

Relational Infrastructure

Client Trust Architecture · Human-AI Collaboration Design · Market Sentiment

How trust-based client relationships and tacit market sentiment set a ceiling on what can be encoded, and redirect freed capacity toward relational rather than productive activity.

Fig. 2 — Aggregate dimensions from the Gioia analysis. Three map onto the three orders of effect; two emerged inductively as boundary conditions. First-order codes and direct attributions are withheld here to protect the confidentiality of the field study.

The two boundary conditions matter because they set the bounds inside which the three orders have to be navigated. Institutional embedding (regulation, legacy infrastructure, approval topology) sets the pace. Relational infrastructure (client trust, tacit market feel) sets a ceiling on what can be encoded and a direction for where freed capacity is most productively redeployed. Neither is an additional pillar. Both condition how the three-pillar sequence has to be adapted to each institution's situation.

1stFirst-Order Effects: The Productivity Lift

First-order effects are the ones you can put on a quarterly slide. An analyst drafts a note materially faster. A banker responds to mail in a fraction of the previous time. A developer accepts a meaningful share of Copilot suggestions. Every vendor in the AI stack publishes studies claiming some version of these numbers, and many of them are directionally right.

The uncomfortable fact is that first-order effects are not strategic. They are individual and additive. In a market where access to foundation models is priced by the API call and available to anyone with a credit card, a faster email is not a moat. It is a capability your competitor can license on the same day as you, by calling the same vendor, at the same price. The lift is real. The differentiation is not.

Worse, first-order metrics distract from the structural question. When leadership looks at the Copilot dashboard and sees adoption curves rising, the inference is: the programme is working. But the programme is only generating value of a kind every competitor can replicate by signing the same contract. The activity is real. The position is not.

The Copilot Fallacy, in its purest form, is the decision to optimise at the first order of effect only, treating the second and third orders as someone else's problem or a later phase that never comes. Once the procurement is in place and the dashboards are wired up, the institution has proven to itself that it is "doing AI," and the organisational energy for the harder work evaporates.

2ndSecond-Order Effects: Role Restructuring and the Capacity Flip

Second-order effects appear when enough people in a role use AI that the role itself starts to deform.

In knowledge work, the deformation has a direction. The routine, labour-intensive portion of a job, the data aggregation, the first-draft writing, the format-and-file cycle, collapses toward zero marginal time. What remains is what foundation models cannot reliably do without a human in the loop: interpretive judgment, contextual calibration, the recognition of when an output is wrong in a way that matters. Value redistributes from the ability to produce output toward the ability to curate and shape it.

I call this the Capacity Flip: the structural inversion of time allocation between routine and strategic work once AI is properly embedded. Before the flip, a financial analyst might spend roughly three quarters of the working day on routine production, a quarter on interpretive work. After the flip, the ratio inverts. The job title is unchanged. The job is not.

Before: Content Creator

Routine Production (~75%)

Strategic Work (~25%)

→

The
Capacity
Flip

After: Output Curator

Routine / AI-Supervised (~25%)

Strategic Work (~75%)

Value defined by the ability to…

Produce analysis
Aggregate data manually
Draft reports
Format presentations

Accountable for the process

Value defined by the ability to…

Transform AI output into insight
Apply interpretive judgment
Develop contrarian positions
Deepen client engagement

Accountable for the outcome

The Capacity Flip. Ratios are illustrative. The point is the direction of the inversion, not a specific percentage.

The flip does not happen automatically. It happens when the institution redesigns the workflow to capture the freed capacity and redirect it into the higher-value activity. Without that deliberate redesign, freed capacity gets eaten by meeting creep, reporting overhead, or simply more of the same routine work done in parallel. The productivity dashboard still rises. The job is still the old job. Nothing second-order has happened.

The Copilot Fallacy, at the second order, is the failure to notice that the point of AI-driven efficiency is to rebalance what the role is for, not to produce more of what the role used to do. Second-order transformation requires naming the flip, committing to it, and making it the basis for how the role is hired, measured, and promoted. That is an organisational intervention, not a technological one, and in my experience it is the point at which most AI programmes stall.

3rdThird-Order Effects: Architectural Parity and Encoded Judgment

Third-order effects appear at the market level, once second-order restructuring has diffused across enough institutions.

The decisive observation at this order is what I call architectural parity. Three, four, maybe five foundation models sit at the frontier at any given time, and every serious institution has access to all of them. The marginal capability difference between the top models is real but narrow, and it is shrinking. Model choice, in strategic terms, is commoditising.

If the model is not where advantage lives, then advantage has to live in what is encoded around the model: proprietary valuation frameworks, sector-specific analytical methods, institutional quality standards, accumulated judgment about which questions to ask, which inputs to trust, and which shortcuts are unacceptable. All of this exists in any serious institution. In most of them, it exists tacitly: in the heads of senior people, in unwritten stylistic rules, in templates no one has formalised, in conventions every junior professional absorbs by osmosis and none of them could articulate.

Third-order transformation is the systematic conversion of that tacit institutional knowledge into structured, AI-readable form, hosted on infrastructure the institution controls. I call this encoded judgment. Two things follow from it.

First, encoded judgment is what differentiates institutional AI output from commodity AI output. A vendor's generic model, pointed at a generic prompt, cannot replicate a methodology the institution itself has never written down. Once the methodology is encoded, the output carries the institution's signature in a way no external system can reproduce, and the case for human sign-off becomes defensible rather than ceremonial.

Second, encoded judgment compounds. Every additional piece of institutional expertise that gets structured into the system deepens its reach. Every application of the system surfaces cases where the encoding is incomplete, driving the next cycle of encoding. A procured generic tool produces no such compounding. It depreciates at the pace of the underlying model and reverts to commodity capability when the vendor releases the same integration to everyone else.

Sovereignty is the architectural complement. If the institution's differentiating knowledge is encoded into vendor infrastructure it does not control, the differentiation is rented, not owned, and it is rented from a counterparty with a commercial interest in keeping the tenant dependent. The Copilot Fallacy, at the third order, is the assumption that procurement decisions at the vendor level do not foreclose architectural decisions at the institutional level. They do. Every prompt that embeds proprietary methodology into a third-party API is a small forfeiture of sovereignty, and in aggregate these forfeitures amount to giving the vendor the thing that was supposed to distinguish you.

If foundation model access is commoditised, competitive advantage cannot come from model selection. It has to come from what is encoded around the model, in infrastructure the institution controls.

Sovereign AI Mastery: The Synthesis

The three orders of effect map onto three organisational interventions, in sequence. Taken together, they are what I call the Three-Pillar Model.

The Copilot Fallacy
(Orders of Effect)

1st Order

Productivity Lift

Individual tasks accelerated. Measured, marketed, undifferentiating.

2nd Order

Role Restructuring

The Capacity Flip. Creator becomes curator.

3rd Order

Architectural Parity

Model access commoditised. Advantage lives in encoded judgment.

answered by

→

Three Pillars
(Sequential Interventions)

Pillar 1

Education

Teach the role what AI actually does to it. Creator-to-curator transition.

Pillar 2

Process Redesign

Restructure the workflow to capture the Capacity Flip.

Pillar 3

Encoded Judgment

Convert tacit institutional knowledge into sovereign infrastructure.

maps to

→

Dynamic Capabilities
(Micro-Foundations)

Sensing

Recognise how AI changes the task environment.

Seizing

Reconfigure workflows to capture the shift.

Transforming

Build distinctive, compounding knowledge assets.

enables

→

Sovereign AI Mastery

Outcome

Sovereign AI Mastery

The institutional capacity to autonomously govern, leverage, and evolve AI across all three orders. The organisation commands its own transformation rather than being directed by vendors.

Three orders of effect, three pillars, three Dynamic Capabilities micro-foundations, one outcome. The sequence is load-bearing: each pillar creates preconditions for the next.

The sequence reflects a dependency structure, and the sequencing is not negotiable. Education is the work of teaching a role what AI actually does to it, and it is what makes the second-order shift intelligible to the people who have to live it. Process redesign, the deliberate restructuring of the workflow to capture the Capacity Flip, creates the operational substrate for the third order. Encoded judgment, the architectural layer, is where sovereignty is built.

Invert the order and each intervention misfires. Invest in tools before redesigning the work and the tools optimise the wrong output. Redesign the work before the people understand what they are replacing and the redesign happens around a process no one is equipped to govern. Encode judgment into infrastructure the institution does not control and the sovereignty never materialises. The Three-Pillar Model is not a menu.

Three Diagnostic Questions

Three questions, one per order, let any institution locate itself on this path:

Can professionals critically evaluate and improve AI-generated output, or do they accept it uncritically? (1st order · Education)
Have workflows been redesigned around human-AI complementarity, or is AI bolted onto existing processes? (2nd order · Process Redesign)
Is proprietary knowledge encoded in sovereign infrastructure the institution controls, or does it live inside generic vendor solutions? (3rd order · Encoded Judgment)

Each order, tackled alone, fails in a predictable way.

1st Order

Education

Creator-to-Curator Transition

Can professionals critically evaluate AI output?

2nd Order

Process Redesign

The Capacity Flip

Have workflows been redesigned for human-AI complementarity?

3rd Order

Encoded Judgment

Sovereign Knowledge Architecture

Is proprietary knowledge encoded in sovereign infrastructure?

All three orders

Sovereign AI Mastery

Full sequence

The institutional capacity to autonomously govern, leverage, and evolve AI. The organisation commands its own transformation.

Skip 1st Order Sophisticated systems no one can govern

Skip 2nd Order Informed professionals trapped in legacy workflows

Skip 3rd Order Efficient but undifferentiated automation

Full sequence: Sovereign AI Mastery

Failure modes. Each order alone breaks in a predictable way. Only the full sequence produces Sovereign AI Mastery.

What This Means

The productivity dashboard is real. It is also insufficient. The institutions that will matter in five years are not the ones with the highest Copilot adoption curves today. They are the ones that have already absorbed what the Capacity Flip requires of their roles, and that have already begun the architectural work of encoding what they know into infrastructure they control.

Architectural parity is the condition. Encoded judgment is the response. Sovereignty is the outcome. Anything short of that is first-order optimisation in strategic clothing, which is the Copilot Fallacy in three words.

The mistake I see most often is the belief that the first order is where transformation is happening, because the first order is where it is visible. The metric that is easiest to produce is almost never the metric that matters. Second- and third-order effects do not announce themselves on a quarterly basis. They show up, slowly, in which institutions are still relevant in the next decade and which ones are not.

The transformation challenge is not technological. It is organisational: the capacity to govern, leverage, and evolve AI as a sovereign institutional competence rather than a procured service dependency.

From Framework to Implementation

The Copilot Fallacy names what most AI strategies get wrong. The Three-Pillar Model shows what the correction requires. If your organisation is in the middle of this transition and needs a framework for the architecture layer, we can help.

Schedule a Conversation

Also read: When Agents Become the Trading Desk · When the Middle Disappears