A Premier on AI Value Chain

Executive Summary

Where Is Value Captured Today?

As of mid-2026, the largest concentration of profit sits in two layers:

Silicon and chips: In FY 2026, NVIDIA generated $215.9B with 71-75% margins. TSMC captured 70% of advanced node segment of a global foundry market with 62% margins.
Cloud infrastructure: Microsoft's AI business surpassed a $37B annual revenue run rate. AWS, Azure, and Google Cloud collectively represent a $250+ billion market.

Commoditization Risks

Foundation models: Token prices have fallen ~600-fold since 2020. Economy-tier models have a price half-life of 1.1 years.
Middleware & Tools: Commoditizing quickly, driven by open-source alternatives.
Thin-wrapper applications: Face severe margin compression as underlying models improve.

The AI Value Chain in One Simple Mental Model

AI is not just a model. It is a factory system. Every link in this chain must work.

Silicon & Chips

Cloud & Data Center

Data Infrastructure

Foundation Models

Middleware & Tools

AI Applications

Services & Adoption

The Seven-Layer AI Value Chain Map

The AI value chain has seven primary layers, built from the ground up. Each has different economics and competitive dynamics.

1

Silicon & Chips

Physical Infrastructure

What it does: Designs and manufactures the specialized processors (GPUs, TPUs) required for heavy AI workloads.

Why it matters: It is the foundational hardware constraint; without these chips, no AI processing or model training can happen.

Players: NVIDIA, TSMC, ASML, AMD, Broadcom (Google's TPU)

Margins: Highest (53-78%)

Commoditization Risk: Low. Designing frontier AI chips requires billions in R&D. NVIDIA holds 80%+ of AI training share while TSMC holds ~70% of Advanced Node segment of global foundry market share. These are among the deepest moats in technology.

2

Cloud & Data Centers

Infrastructure + Platform

What it does: Houses the servers, procures the massive energy needed, and delivers computing power over the internet.

Why it matters: Scales AI access globally and absorbs the immense capital expenditure (CapEx) required to build AI factories.

Players: AWS, Azure, Google Cloud, Equinix

Margins: High (38-60%)

Commoditization Risk: Low to Medium. Hyperscalers benefit from enormous scale, multi-year contracts, and data gravity.

3

Data Infrastructure

Software + Data

What it does: Manages, stores, cleans, vectors, and structures the massive datasets needed to train and run models.

Why it matters: High-quality data is the raw fuel for AI. Poor data pipelines lead to hallucinations and failed enterprise deployments.

Players: Snowflake, Databricks, Scale AI, Pinecone

Margins: High (50-68%)

Commoditization Risk: Medium. Basic storage is commoditized, but curated data and vector DB performance remain differentiated.

4

Foundation Models

Intelligence Layer

What it does: Creates the core, general-purpose AI brains (LLMs, vision models) trained on massive internet-scale data.

Why it matters: Determines the baseline intelligence, reasoning capabilities, and processing power of all downstream applications.

Players: OpenAI, Anthropic, Google, Meta (Llama), DeepSeek

Margins: Medium (<50%, Declining)

Commoditization Risk: High & Rising. Token prices have fallen ~600x since 2020. Open-weight models now match proprietary models on many benchmarks.

5

Middleware & Tools

Software Layer

What it does: Provides the software frameworks to connect raw models to databases, user interfaces, and agentic workflows.

Why it matters: Makes it possible for standard developers to actually build functional, stable apps on top of unpredictable raw models.

Players: LangChain, Hugging Face, Weights & Biases

Margins: N/A

Commoditization Risk: High. Open-source dominates. Orchestration frameworks are converging rapidly.

6

AI Applications

User-facing Layer

What it does: Delivers the final, user-facing software products (like AI assistants or specialized enterprise tools).

Why it matters: This is where AI actually interfaces with users to solve real-world human and business problems.

Players: Cursor, Perplexity, Microsoft Copilot, Harvey

Margins: Margins: High (40-60%)

Commoditization Risk: Medium to High. High for generic apps; lower for deeply integrated vertical applications with proprietary data.

7

Services & Adoption

Services Layer

What it does: Provides the human consulting, implementation strategy, and change management expertise to organizations.

Why it matters: Bridges the gap between technology existence and actual enterprise value realization (ensuring ROI).

Players: McKinsey, BCG, Accenture, Deloitte

Margins: Lowest (30-40%)

Commoditization Risk: Medium. Human expertise is hard to replicate, but AI may automate some consulting tasks.

How the Chain Works: An Enterprise Example

To understand how these layers interact in practice, consider a realistic scenario: A Fortune 500 bank deploying an AI customer support agent. Here is how the entire ecosystem participates in bringing that use case to life.

7

The Strategy (Services Layer)

The bank hires consulting firms like Accenture or McKinsey to design the AI transformation roadmap, identify the highest-ROI use cases, and ensure compliance with strict financial regulations.

6

The Interface (App Layer)

The bank's developers build a secure chat interface, perhaps utilizing platforms like Microsoft Copilot Studio or a custom-built React frontend, for their customer service representatives to interact with.

5

The Orchestration (Middleware Layer)

They use frameworks like LangChain to securely connect the chat interface to the bank's internal databases, ensuring the AI grounds its answers in real, proprietary customer data.

4

The Brain (Model Layer)

The application sends the user's prompt (scrubbed of PII) to a foundation model like OpenAI's GPT-4o or Anthropic's Claude 3.5 via API to reason through the financial query and generate a response.

3

The Memory (Data Layer)

The bank's millions of policy documents are stored, chunked, and retrieved rapidly as vectors using a specialized data platform like Databricks or Pinecone.

2

The Compute (Cloud Layer)

All of this software routing, API hosting, and vector database storage runs reliably in a massive Microsoft Azure or AWS data center.

1

The Hardware (Silicon Layer)

Ultimately, the Azure data center processes that specific API request using clusters of NVIDIA H100 GPUs, which were physically manufactured in Taiwan by TSMC.

Money Flow Across the Value Chain

A structural breakdown of the Money Flow. Notice how the economic value flows backward: originating from the enterprise paying for adoption and apps, filtering all the way down to the physical silicon.

1

Consulting

Enterprise Adoption

Professional services paid

2

MS Copilot

App Delivery

SaaS subscription paid

3

OpenAI / Azure

Model Inference

API / per-use compute paid

4

OpenAI, Anthropic

Model Training

Base training compute paid

5

Azure, AWS

Data Center Ops

Cloud infrastructure paid

6

Dell, SMCI

Server Assembly

Hardware sales paid

7

TSMC

Chip Fabrication

Manufacturing fees paid

8

NVIDIA

Chip Design

Silicon margins captured

The key observation: Multiple companies get paid for a single user query. Because the capital originates at the Enterprise Adoption layer and trickles down, the companies closest to the physical infrastructure at the bottom (NVIDIA, TSMC, Hyperscalers) capture the most concentrated economic value from the entire chain above them.

The Inference Cost Battleground

Understanding the difference between training and inference is critical for investors. Training is an enormous, one-time fixed cost (up to $1B+ for frontier models). Inference is the ongoing variable cost. As AI moves into production, inference is becoming the dominant workload and cost center.

Key Economic Dynamics

Token price collapse unlocks usage: Prices have fallen 600x since 2020, but enterprise spend tripled in a year (Jevons paradox).
Agentic AI multiplies consumption: A single task may call the model 50–500 times.
Outcome-based pricing is the future: E.g., Intercom Fin ($0.99/resolved ticket). It decouples revenue from the commodity input (tokens) and ties it to value.

Pricing Models

API / Per-Token Directly scales, but revenue declines as token prices fall.

Subscription / Seat-based Predictable, but heavy users destroy margins.

Outcome-based Insulated from token price deflation; charges for actual value.

Value Capture vs. Value Destruction

Where Profits Accrue

The "Picks and Shovels" Strategy

Selling Picks and Shovels is more reliably profitable than mining gold. NVIDIA, TSMC, and Hyperscalers get paid regardless of whether individual AI apps succeed.

Proprietary Data & Workflow

Companies with real AI value possess proprietary data advantages, deep workflow integration, and measurable customer outcomes (e.g., Hebbia AI for Finance, Harvey AI for Legal).

Where Value is at Risk

Model Price Compression

Token prices falling 600x directly destroys revenue potential for model providers. OpenAI lost an estimated $5B in 2025 despite $12B+ in revenue.

Thin Wrapper Applications

Apps adding an interface over an API are extremely vulnerable. When providers improve models natively, wrappers lose their reason for existence.

Current and Potential Bottlenecks

The constraints governing the speed and scale of the AI revolution.

Power and Energy

The largest physical constraint. Data center grid connection lead times extend 24–72 months. A single NVIDIA rack consumes 120+ kW. Power grid infrastructure was not built for this scale.

Advanced Chip Fabrication

TSMC controls >90% of chips at 7nm and below. All major AI accelerators depend on a single company's fabs, concentrated in a geopolitically sensitive region.

High-Bandwidth Memory (HBM)

A global shortage from the top three suppliers (SK Hynix, Samsung, Micron) constrains overall GPU production and deployment capacity.

The "Data Wall"

The risk that easily accessible, high-quality human-generated data is becoming exhausted, forcing reliance on expensive synthetic data generation or bespoke labeling.

Where The Real Opportunities Lie in 2026-2031

The most attractive opportunities over the next five years span infrastructure optimization, enterprise deployment, and domain-specific applications.

1. AI Inference Infrastructure

What it is: Companies making AI inference faster, cheaper, and more efficient through custom silicon, routing, and caching.

Why it is Valuable: Inference will be the dominant AI workload. Token consumption is exploding, making inference cost the primary battlefield.

Key Players: Groq, Together AI, Fireworks AI, Baseten, NVIDIA (Vera Rubin platform), Google & Amazon custom ASICs.

2. Enterprise Integration & Data Prep

What it is: Tools and services helping enterprises prepare data, integrate AI into workflows, and move from pilot to production.

Why it is Valuable: 95% of enterprises report no meaningful AI ROI yet. Bridging the gap between technology and actual workflow is the largest unmet need.

Key Players: Databricks, Snowflake, Scale AI, Accenture, Deloitte, Palantir.

3. Vertical AI in Regulated Industries

What it is: Purpose-built AI applications for specific industries like healthcare, legal, finance, and manufacturing.

Why it is Valuable: Vertical apps command premium prices by solving specific high-value problems using proprietary data that horizontal tools cannot match.

Key Players: Harvey (legal), Abridge (healthcare documentation), Elicit (scientific research), Palantir AIP (defense).

4. AI Safety & Observability

What it is: Tools helping organizations monitor, govern, and control AI systems (tracing agents, detecting hallucinations).

Why it is Valuable: As autonomous agentic AI scales, oversight becomes mission-critical. Regulatory pressure (e.g., EU AI Act) enforces mandatory demand.

Key Players: LangFuse, Arize Phoenix, Braintrust, Datadog, Splunk.

5. Physical AI and Robotics

What it is: AI systems that operate in the physical world—autonomous vehicles, humanoid robots, industrial automation, and drones.

Why it is Valuable: Physical AI represents the next massive wave after software AI, driven by new foundation model capabilities in spatial and world reasoning.

Key Players: Waymo, Tesla, Boston Dynamics, UBTECH, NVIDIA Isaac platform.

Investor Lens: What Matters for Investment Decisions

Evaluating capital allocation across the AI Value Chain requires focusing on defensibility and economic fundamentals rather than hype.

Focus on Hard Moats

Seek companies constrained by physical realities (foundries, power grids) or deep technical expertise (chip design). High switching costs and capital intensity create more defensible barriers than easily replicable software wrappers.

Monitor the CapEx Gap

Watch the correlation between hyperscaler infrastructure capital expenditures and actual enterprise downstream software revenue. If enterprise value doesn't justify the build-out, a massive market correction is imminent.

Track Pricing Power

Invest in layers with inherent pricing power (currently infrastructure and silicon). Be highly cautious allocating capital to layers facing severe commoditization, open-source alternatives, and rapid price compression (e.g., raw foundation models).

Strategic Scenarios (Next 5 Years)

Bull Case: Demand Accelerates

AI adoption broadens rapidly. Agentic AI proves transformational, driving token consumption 10–100x higher. Global AI infra spend exceeds $1.5T by 2029.

Winners NVIDIA (continued chip demand), TSMC, Cloud hyperscalers, Power/Energy companies, AI application leaders, Enterprise AI services firms.

Sufferers Companies that underinvested in AI, traditional software vendors that failed to adapt, commodity model providers with no differentiation.

Signals to Watch Accelerating hyperscaler capex, enterprise AI budgets growing faster than expected, agentic AI delivering measurable ROI, inference demand outpacing efficiency gains.

✓

Base Case: Monetization is Uneven (Most Likely)

Infra spending grows but moderates (20-30% by 2029). Enterprise adoption accelerates in tech/finance but stalls elsewhere. Model commoditization compresses startup margins.

Winners NVIDIA and TSMC (sustained but decelerating growth), Microsoft and AWS (cloud growth), vertical AI leaders, Enterprise data platforms.

Sufferers AI model companies without profitability paths, overfunded AI startups without product-market fit, enterprises with failed AI pilots that didn't scale.

Signals to Watch Hyperscaler capex growth moderates, pilot-to-production conversion improves slowly, token prices continue falling while consumption grows, high-profile startup failures.

Bear Case: Spending Slows

Enterprise ROI disappointments trigger budget cuts. Echoes of the 2000 telecom fiber overbuild. Regulatory restrictions constrain deployment. Hyperscaler capex writedowns.

Winners Big Tech (can absorb losses via diversified revenue), essential infrastructure (TSMC, ASML), value-oriented investors buying discounted assets.

Sufferers NVIDIA (revenue deceleration), speculative AI infrastructure plays (CoreWeave), VC-dependent startups, companies with heavy capex and negative free cash flow.

Signals to Watch Hyperscaler capex guidance cuts, GPU utilization declining, startup down-rounds increasing, budgets freezing, rising "AI hasn't worked for us" sentiment.

Final Synthesis

The Most Important Bottleneck

Power and energy. Everything else can be solved with money and talent. You cannot will new power plants into existence faster than physics and regulation allow.

The Strongest Moat

TSMC's fabrication. 70% global foundry share, 62%+ margins, and no viable competitor within years at the leading edge. The revolution depends on their factories.

The Biggest Risk

The investment-to-revenue gap. If downstream enterprise revenue does not materialize to justify the $830B upstream capex, significant overcapacity and write-downs will follow.

What Companies Should Do

Invest in data infra and processes. The winners will have clean data, clear workflows, and governance frameworks, not just the most impressive AI demos.

AI is a factory system. The winners are not always those with the best models. The winners are the companies that control scarce resources, reduce real business costs, and capture durable economic value.

Follow the bottlenecks. Follow the margins. Follow the lock-in.

Core Thesis

Executive Summary

Where Is Value Captured Today?

Commoditization Risks

The AI Value Chain in One Simple Mental Model

The Seven-Layer AI Value Chain Map

Silicon & Chips

Cloud & Data Centers

Data Infrastructure

Foundation Models

Middleware & Tools

AI Applications

Services & Adoption

How the Chain Works: An Enterprise Example

The Strategy (Services Layer)

The Interface (App Layer)

The Orchestration (Middleware Layer)

The Brain (Model Layer)

The Memory (Data Layer)

The Compute (Cloud Layer)

The Hardware (Silicon Layer)

Money Flow Across the Value Chain

Enterprise Adoption

App Delivery

Model Inference

Model Training

Data Center Ops

Server Assembly

Chip Fabrication

Chip Design

The Inference Cost Battleground

Key Economic Dynamics

Pricing Models

Value Capture vs. Value Destruction

Where Profits Accrue

The "Picks and Shovels" Strategy

Proprietary Data & Workflow

Where Value is at Risk

Model Price Compression

Thin Wrapper Applications

Current and Potential Bottlenecks

Power and Energy

Advanced Chip Fabrication

High-Bandwidth Memory (HBM)

The "Data Wall"

Where The Real Opportunities Lie in 2026-2031

1. AI Inference Infrastructure

2. Enterprise Integration & Data Prep

3. Vertical AI in Regulated Industries

4. AI Safety & Observability

5. Physical AI and Robotics

Investor Lens: What Matters for Investment Decisions

Focus on Hard Moats

Monitor the CapEx Gap

Track Pricing Power

Strategic Scenarios (Next 5 Years)

Bull Case: Demand Accelerates

Base Case: Monetization is Uneven (Most Likely)

Bear Case: Spending Slows

Final Synthesis

The Most Important Bottleneck

The Strongest Moat

The Biggest Risk

What Companies Should Do