NVIDIA and cloud (AWS, Azure, GCP): profitable. Google and Meta AI-enhanced ads: profitable. Model labs (OpenAI, Anthropic): ~30% gross margin on inference but training costs consume it. Not yet consistently GAAP profitable. HBR (2025): unproven path to operating profit.

Infrastructure Basics12 min read

How Do AI Companies Make Money? The Real Revenue Story in 2026

Q: How does OpenAI make money?

OpenAI: API token pricing (GPT-4.1: $5/M input, $15/M output), ChatGPT Plus ($20/mo), enterprise contracts. Estimated 2025 ARR: ~$4B. Target 2026: $30B. Gross margin on inference: ~30%. Training costs consume most gross profit. Source: TLDL 2026, Epoch AI 2025.

Q: How much does it cost to use an AI API per query?

GPT-4.1: $5/M input + $15/M output. Typical query (1,500 in + 500 out): ~$0.015. 10-page doc analysis: ~$0.04. Mistral Large: ~$2/M input. Open-source self-hosted: near-zero marginal cost. Anthropic Claude, Google Gemini: $3-15/M depending on tier.

Q: Why do so many AI companies lose money?

Frontier model training costs $100M-$1B+, consuming the ~30% gross margin from inference. At $4B ARR and 30% margin, gross profit is ~$1.2B. Training + operating costs can match or exceed that. Epoch AI (2025): model-level margins positive, company-level operating profit near zero.

Q: What is the difference between how OpenAI and Google make money from AI?

OpenAI: sells model access via subscriptions ($20/mo) and API tokens. AI is the product. ~$10/user/year. Google: uses AI to enhance ad targeting for 4B users generating ~$100/user/year. Also sells Gemini Advanced ($19.99/mo) and Cloud APIs. Ad-enhanced revenue dominates. Source: New Yorker 2024.

Q: How much does Microsoft Copilot cost and how does it generate revenue?

Copilot for M365: $30/user/month. GitHub Copilot: $10-19/user/month. Copilot is a 27% premium add-on over Office 365 E3. Microsoft: ~400M Office users. 10% conversion = ~$14.4B annualised Copilot revenue. AI features in Word, Excel, Teams, Outlook.

Q: How does NVIDIA make money from AI?

NVIDIA sells GPUs (H100, H200, B200) for AI training and inference to OpenAI, cloud providers, and enterprises. Data center revenue: >$100B FY2024. Model: hardware sales paid upfront, not per-query. Cloud providers (AWS, Azure, GCP) are largest customers.

By Amara|Updated 4 July 2026

How AI companies make money 2026: four-layer revenue stack from GPU hardware through cloud infrastructure to model APIs and AI-enhanced advertising

Key Numbers

$4B

Estimated OpenAI ARR in 2025, with a stated target of $30B for 2026

TLDL AI Companies Landscape 2026

$100B+

NVIDIA data center revenue in fiscal year 2024, driven by AI chip demand from model labs and cloud providers

NVIDIA Q4 FY2024 earnings

$10

Revenue per ChatGPT user per year, one-tenth what Google earns per user through AI-enhanced advertising

New Yorker, Financial Page 2024

~30%

Estimated gross margin on running frontier AI models like GPT-4, before R&D and training costs

Epoch AI, 2025

$30/user

Microsoft Copilot list price per enterprise Microsoft 365 user per month

Microsoft 2025 pricing

Key Takeaways

1AI companies make money through four main channels: API usage fees charged per token, monthly subscriptions ($20/month consumer, $30/user enterprise), multi-year enterprise contracts, and selling the compute infrastructure AI runs on. The most profitable layer today is hardware and cloud, not model access.
2OpenAI estimated $4B ARR in 2025 at approximately 30% gross margin on inference. That gross profit is largely consumed by training the next model generation, which is why AI labs are revenue-rich but not yet consistently GAAP profitable at the company level.
3ChatGPT generates approximately $10 per user per year. Google generates $100 per user per year from the same population. That 10x gap explains why every major AI company prioritises enterprise contracts over consumer scale: one $1M enterprise deal replaces 100,000 consumer subscribers.

ChatGPT has over 900 million weekly active users. It generates approximately $10 per user per year. Google reaches 4 billion users and generates $100 per user per year, almost entirely through advertising that AI makes more accurate. That gap is the defining fact of AI business models in 2026.

There are four layers in the AI revenue stack, and they do not earn money at the same rate. At the bottom, chip makers and cloud providers selling the compute AI runs on are already generating large, confirmed profits. In the middle, model providers like OpenAI and Anthropic are growing fast but spending their gross profits on training the next generation. At the top, applications built on AI are growing quickly but competing intensely on price.

According to analysis by Epoch AI, running a frontier AI model can generate a gross margin of around 30%, which sounds healthy until you account for the cost of training the next model. A single training run for a frontier model now costs hundreds of millions of dollars. The margin from one year of inference can be consumed by one training campaign. Understanding which businesses are actually profitable, and why, requires looking at each layer of the stack separately.

1The Four Layers of the AI Revenue Stack
2API Token Pricing: How Pay-Per-Query Revenue Works
3Subscriptions and Enterprise Contracts: Where Bulk Revenue Sits
4Hardware and Cloud Infrastructure: The Most Profitable AI Business
5Are AI Companies Profitable? The Unit Economics Explained
6The Per-User Revenue Gap: Why AI Needs Enterprise More Than Consumer
7AI Revenue Projections: What Analysts Forecast Through 2030

The Four Layers of the AI Revenue Stack

AI revenue does not flow from a single source. It flows through a stack of four distinct layers, each with different margins, business models, and profitability profiles.

Layer one is hardware and compute infrastructure. NVIDIA sells the GPUs that train and run AI models. Cloud providers (AWS, Azure, Google Cloud) rent that compute by the hour. These are the most clearly profitable AI businesses in 2026. NVIDIA's data center revenue exceeded $100 billion in fiscal year 2024, driven almost entirely by demand for H100 and H200 GPUs from AI labs and cloud providers (NVIDIA Q4 FY2024 earnings). The cloud providers add their own margins on top of hardware cost, selling managed AI infrastructure to companies that cannot or do not want to run their own GPU clusters.

Layer two is foundation model access. OpenAI, Anthropic, Google DeepMind, xAI, and Mistral sell access to large language models via APIs (per-token pricing), subscriptions ($20-30/month per user), and enterprise contracts. This is the most visible layer publicly but is not yet the most profitable.

Layer three is AI-enhanced software. Microsoft sells Copilot as an add-on to Office 365 at $30 per user per month. Google bundles Gemini into Workspace at $10-30 per user per month depending on tier. These are AI features layered onto already-profitable software businesses.

Layer four is AI-optimised advertising. Google and Meta use AI to improve ad targeting, content ranking, and fraud detection. This is the least visible form of AI monetisation and arguably the most profitable. Google generates approximately $100 per user per year (New Yorker, 2024) largely through advertising made more valuable by AI.

Revenue Layer	Who Earns Here	Business Model	Profitability (2026)
Hardware and compute	NVIDIA, AMD, cloud providers	GPU sales, hourly compute rental	High: NVIDIA >$100B data center revenue
Foundation model access	OpenAI, Anthropic, xAI, Mistral	API tokens, subscriptions, enterprise	Mixed: ~30% gross margin, not yet GAAP profitable
AI-enhanced software	Microsoft, Google Workspace	SaaS add-ons and premium tier pricing	High: layered onto profitable existing products
AI-optimised advertising	Google, Meta	Ad targeting improvements, no separate cost	Very high: pure margin enhancement on existing revenue

The practical implication: if you use ChatGPT, you are in layer two. If you use Google Search, you are in layer four. The company making the most money from AI in both cases is NVIDIA, which sold the hardware that processes your query.

API Token Pricing: How Pay-Per-Query Revenue Works

The API layer is where AI models are sold as a metered utility. Companies pay per token, where one token is roughly three to four characters of text. Pricing varies by model tier and has been falling steadily as competition increases.

As of March-April 2026, the major model providers charge the following for their flagship models:

Model	Input per million tokens	Output per million tokens
GPT-4.1 (OpenAI)	$5.00	$15.00
GPT-4o (OpenAI)	$2.50	$5.00
Claude 4 Sonnet (Anthropic)	~$3.00-5.00	~$15.00
Gemini 1.5 Pro (Google)	~$3.50	~$10.50
Mistral Large (Mistral)	~$2.00	~$6.00

To put those numbers in context: a typical business document processing task, reading a 10-page contract (~8,000 input tokens) and producing a 500-token summary, costs approximately $0.05 at GPT-4.1 pricing. At one million documents per day, that is $50,000 per day or $18 million per year, which explains why legal, insurance, and financial services are among the largest enterprise API customers.

The Subscription Math

A typical ChatGPT conversation uses roughly 1,500 input tokens and 500 output tokens. At GPT-4.1 pricing, that is about $0.015 per conversation in raw compute cost. A ChatGPT Plus subscriber paying $20 per month and having 100 conversations generates approximately $1.50 in compute cost, a 13x revenue-to-compute ratio at the conversation level. The actual gross margin across the full user base is far lower because of infrastructure overhead, the free-tier users who generate cost without revenue, and the expense of running other model types, but it illustrates that the per-conversation economics of subscriptions are not inherently broken.

OpenAI has cut API prices multiple times since 2023. GPT-4-class models that cost $60 per million tokens at launch now cost $5-15 per million. This price compression is driven by hardware efficiency improvements (newer GPU generations run inference cheaper), model optimisation (quantisation and distillation reduce compute per token), and competitive pressure from Anthropic, Google, and open-source alternatives. Cheaper tokens mean more usage but narrower margins on each unit sold.

Subscriptions and Enterprise Contracts: Where Bulk Revenue Sits

For the companies with the most AI revenue, subscriptions and enterprise contracts matter more than public API access. Consumer subscriptions at $20/month generate predictable recurring revenue. Enterprise contracts at six to eight figures annually are where the revenue concentrates.

Consumer Subscriptions

The major AI consumer subscription prices as of Q1-Q2 2026:

Product	Price	Provider	Key features
ChatGPT Plus	$20/month	OpenAI	GPT-4.1, DALL-E 3, Sora access, priority capacity
Claude Pro	$20/month	Anthropic	Claude 4, priority access, extended context
Gemini Advanced	$19.99/month	Google	Gemini Ultra, bundled with 2TB Google One storage
GitHub Copilot	$10-19/month	Microsoft	Code completion, chat in IDE, PR summarisation
Copilot for M365	$30/user/month	Microsoft	AI in Word, Excel, Teams, Outlook
ChatGPT Team	~$25-30/user/month	OpenAI	Admin controls, higher limits, workspace features

The $20/month price point is not accidental. OpenAI set it in early 2023 and Anthropic matched it. At a roughly 30% gross margin on inference, $20/month covers the compute cost of a reasonably active user and leaves margin for overhead.

Enterprise: The Largest Revenue Pool

Enterprise contracts are where AI companies generate their largest individual deals. ChatGPT Enterprise is sold on custom pricing that typically involves per-seat licensing plus usage commitments. OpenAI has confirmed enterprise customers across financial services, legal, and technology sectors but does not publish specific contract values.

For Microsoft, Copilot at $30/user/month represents a 27% price premium over the common Office 365 E3 license ($22/user/month). Microsoft has an estimated 400 million Office 365 users. If 10% convert to Copilot, that is 40 million users at $30/month, or $14.4 billion in annualised Copilot revenue. Analysts treating Microsoft's AI monetisation as the most credible near-term enterprise AI revenue story are pointing at exactly this conversion math (Liontrust AI Revenue Report, 2026).

"We make money from our API, from subscriptions, and from our enterprise products. The challenge is that compute costs are enormous, and we are reinvesting everything into the next generation." (Sam Altman, CEO OpenAI, Lex Fridman Podcast, March 2024)

Hardware and Cloud Infrastructure: The Most Profitable AI Business

The clearest, most confirmed AI revenue story is not in model subscriptions. It is in the hardware and cloud infrastructure that model training and inference require. For a look at what that infrastructure physically looks like and how much power it consumes, see AI data center power consumption.

NVIDIA's data center segment generated over $100 billion in fiscal year 2024. That is more than the estimated combined ARR of every frontier AI model lab in existence.

"We are at the beginning of a new industrial revolution where companies are racing to build AI factories." (Jensen Huang, CEO NVIDIA, GTC Keynote, March 2024)

NVIDIA's role as the primary supplier of AI accelerators means it collects revenue from every layer of the stack: model labs training new models, cloud providers running inference, and enterprises deploying private AI infrastructure.

The Picks-and-Shovels Calculation

TLDL AI Companies Landscape 2026 estimates combined ARR across the major frontier model labs:

OpenAI: approximately $4 billion ARR (2025)
Anthropic: approximately $1 billion ARR (2025)
xAI: approximately $100 million ARR (2025)
Mistral AI: approximately $100 million ARR (2025)
Combined: approximately $5.2 billion

NVIDIA's data center revenue alone was over $100 billion in fiscal 2024. The company selling hardware to train and run AI models earns roughly 20 times more than all frontier AI model companies combined. This is the picks-and-shovels dynamic from gold rush history applied to AI: the companies selling equipment to the miners profit more than the miners.

Cloud Provider AI Revenue

AWS, Microsoft Azure, and Google Cloud all generate substantial revenue from AI infrastructure, though none publishes a standalone AI revenue line:

AWS runs Trainium (training) and Inferentia (inference) chips on its platform, offers Amazon Bedrock for managed model access, and rents NVIDIA GPU time from its GPU-equipped instances.
Microsoft Azure hosts OpenAI models under an exclusive partnership and resells API access; the Intelligent Cloud segment grew 21% year-over-year in FY2025 with AI explicitly cited as a growth driver.
Google Cloud sells Vertex AI and GPU-enabled compute; Google Cloud revenue grew 28% year-over-year in Q4 2024, with AI workloads cited as a primary driver.

NVIDIA and the cloud providers are, as of 2026, the clearest financial beneficiaries of the AI investment cycle. That money flows from AI labs training models, from enterprises deploying inference workloads, and from cloud providers building their own AI infrastructure to serve both groups. Understanding what AI data centers actually are helps clarify why the infrastructure layer captures so much of the revenue.

AI Revenue Comparison by Company: Confirmed and Estimated 2025 (USD Billions)

NVIDIA is confirmed reported revenue (FY2024). All other figures are analyst estimates from TLDL AI Companies Landscape 2026 and have not been independently audited.

AI Revenue Comparison by Company: Confirmed and Estimated 2025 (USD Billions)
Category	Value	Unit
NVIDIA (data center)	100	B USD
OpenAI (est. ARR)	4	B USD
Anthropic (est. ARR)	1	B USD
xAI (est. ARR)	0.1	B USD
Mistral (est. ARR)	0.1	B USD

Are AI Companies Profitable? The Unit Economics Explained

The answer depends entirely on which layer you are asking about.

Hardware and cloud providers (NVIDIA, AWS, Azure, Google Cloud): profitable. These companies report real operating income. NVIDIA's data center gross margins are above 70%. Cloud AI is a positive-margin business for all three major hyperscalers.

Big Tech using AI to enhance advertising (Google, Meta): profitable. AI makes existing ad platforms more effective with no separate cost line on the income statement. This is the purest form of AI monetisation.

Independent model labs (OpenAI, Anthropic, xAI, Mistral): growing revenue, not yet consistently GAAP profitable. Research from Epoch AI explains the mechanism clearly.

The Gross Margin Trap

"Each model generation can be profitable at the gross-margin level, but total company profits may still be negative because R&D for the next model generation is so expensive." (Epoch AI, Can AI Companies Become Profitable?, 2025)

According to Epoch AI's analysis of AI company economics, running a frontier model can achieve a gross margin of approximately 30%. At $4 billion in revenue, that is around $1.2 billion in gross profit from inference. That sounds viable. The problem is the cost of the next model.

Training GPT-4 was estimated to cost between $100 million and $500 million. Training frontier models in 2025-2026 costs more. Anthropic has reportedly spent over $1 billion on model training in a single year. If gross profit from running the current model is $1.2 billion and training the next one costs $1-2 billion, the company is investing all its operational gross profit into the next generation of R&D, leaving operating income near zero.

The Number Most Guides Don't Show

Here is the calculation that makes the economics clear. OpenAI prices GPT-4.1 at $5 per million input tokens and $15 per million output tokens. A typical API query uses roughly 2,000 tokens total. The compute cost per query at those prices, assuming 30% gross margin, means OpenAI earns $0.02 per query in revenue and retains $0.006 in gross profit.

To generate $1 billion in gross profit from API calls at $0.006 per query, OpenAI would need approximately 167 billion queries annually. At 900 million weekly users each making one query per week, that is 46.8 billion queries per year. Reaching $1 billion in gross profit from API economics alone requires approximately 3.5 times the current ChatGPT query volume at current pricing, before accounting for free-tier queries that generate no revenue.

This is a scale problem, not a fundamental economics failure. AI query volume is growing fast. But it explains why OpenAI needs subscription revenue, enterprise contracts, and continued external capital alongside API revenue. Any company relying on consumer API revenue alone is unlikely to reach operating profit at current prices.

The Per-User Revenue Gap: Why AI Needs Enterprise More Than Consumer

The most revealing comparison in AI business models is per-user revenue, and the numbers are stark.

ChatGPT has over 900 million weekly active users. According to analysis cited in The New Yorker's Financial Page (2024), it generates approximately $10 per user per year. Alphabet reaches approximately 4 billion users and generates approximately $100 per user per year, almost entirely through advertising that AI makes more effective. Meta reaches 3.5 billion users and generates approximately $70 per user per year through the same mechanism.

ChatGPT has a similar-sized user base to a large ad platform. It monetises at one-tenth the rate.

This gap is structural. Google and Meta have had decades to build the advertising infrastructure that turns user attention into revenue: ad auction systems, measurement tools, agency relationships, and targeting data developed across billions of interactions. ChatGPT is three years old.

AI chat is also not an advertising surface in the same way. A Google search results page runs four to eight paid ads. A ChatGPT response does not (yet) run ads. Until AI assistants introduce advertising or another high-volume per-user monetisation mechanism, the revenue-per-user gap with ad platforms will persist.

The enterprise math is more direct. A single enterprise contract at $1 million per year replaces 100,000 consumer subscribers generating $10 per year each. A contract at $10 million replaces 1 million subscribers. This is why every major AI company's commercial strategy in 2025-2026 concentrates on enterprise: the per-customer revenue is orders of magnitude higher, and the churn rate is lower.

According to NVIDIA's State of AI 2026 survey, 88% of enterprises report that AI has increased annual revenue, and 87% say it has reduced annual costs. That two-sided value proposition is the sales argument that justifies large enterprise contracts that small per-user consumer fees cannot replicate.

Revenue Per User Per Year: AI Chat vs Ad Platforms (2024)

Google and Meta figures are total advertising revenue divided by monthly active users. ChatGPT figure is an estimate based on reported ARR and user count, not a disclosed per-user revenue figure.

Revenue Per User Per Year: AI Chat vs Ad Platforms (2024)
Category	Value	Unit
Google (Alphabet)	100	$/user/year
Meta	70	$/user/year
ChatGPT (OpenAI)	10	$/user/year

AI Revenue Projections: What Analysts Forecast Through 2030

The market size projections for AI vary widely depending on how "AI revenue" is defined, but the directional trend from every major analyst firm is the same: large and fast-growing.

Grand View Research estimates the global AI market at approximately $196 billion in 2023, projecting growth to around $1.8 trillion by 2030, a compound annual growth rate of approximately 37-38%.

McKinsey estimates that generative AI could add $2.6 to $4.4 trillion annually to the global economy across use cases, though this is economic impact including productivity gains and cost savings, not vendor software revenue.

Goldman Sachs projects that AI could drive a 7% increase in global GDP and lift productivity growth by approximately 1.5 percentage points annually over a decade.

Gartner forecasts AI software markets expanding at over 20% CAGR through 2030, with foundation model platforms and AI application services among the fastest-growing segments.

For individual companies, the near-term revenue targets circulating in 2026 are:

Company	2025 ARR Estimate	2026 Revenue Target	Source
OpenAI	~$4B	$30B	TLDL AI Companies Landscape 2026
Anthropic	~$1B	$12B	TLDL AI Companies Landscape 2026
Google (AI-attributed)	$5B+ explicitly attributed	Not separately disclosed	TLDL and Google filings
xAI	~$100M	Not disclosed	TLDL
Mistral	~$100M	Not disclosed	TLDL

The gap between 2025 ARR and 2026 targets is striking. OpenAI's $30 billion target represents roughly a 7x increase on 2025 ARR in a single year. Most independent analysts treat those targets as aspirational rather than firm forecasts. The more conservative projection that analysts broadly agree on: AI software and cloud AI services will be a multi-hundred-billion-dollar market by the late 2020s, with infrastructure and cloud capturing the largest confirmed revenue, and model access companies still establishing sustainable unit economics.

Frequently Asked Questions

How does OpenAI make money?

OpenAI makes money through three main channels: API access (charging per token for GPT-4.1 at $5/M input and $15/M output), consumer subscriptions (ChatGPT Plus at $20/month and ChatGPT Team at roughly $25-30/user/month), and enterprise contracts (ChatGPT Enterprise on custom pricing for large organisations). Industry estimates place OpenAI's 2025 ARR at approximately $4 billion, with a stated 2026 revenue target of $30 billion (TLDL AI Companies Landscape 2026). Gross margin on inference is estimated at approximately 30%, but model training costs for the next generation consume most of that gross profit, leaving overall operating income near zero.

Is AI profitable?

Profitability depends on which layer you are asking about. Hardware and cloud infrastructure (NVIDIA, AWS, Azure, Google Cloud) are clearly profitable. NVIDIA's data center revenue exceeded $100 billion in fiscal 2024 with gross margins above 70%. Big Tech companies using AI to enhance advertising (Google, Meta) are also profitable, because AI improves their ad platforms with no separate cost line. Independent frontier model labs (OpenAI, Anthropic, xAI) generate $1B to $4B in annual revenue but are not yet consistently GAAP profitable: they run approximately 30% gross margins on inference but can spend $1B or more of that gross profit on training the next model generation, leaving operating income at or near zero. The Harvard Business Review analysis (2025) describes AI model labs as having compelling revenue growth but an unproven path to consistent operating profit.

How much does it cost to use an AI API per query?

The cost depends on the model and the length of the request and response, measured in tokens. At GPT-4.1 pricing ($5/M input, $15/M output), a typical query with 1,500 input tokens and 500 output tokens costs approximately $0.015 (1.5 cents). A 10-page document analysis using 8,000 input tokens costs about $0.04. More affordable models are significantly cheaper: Mistral Large starts at around $2/M input, and open-source models run on self-hosted hardware cost near zero per query in marginal terms. Google Gemini and Anthropic Claude are priced in a similar range to GPT-4-class models, roughly $3-15 per million tokens depending on input vs. output and model version.

Why do so many AI companies lose money?

Most frontier AI companies lose money at the operating level because training the next model costs more than the gross profit from running the current one. At an estimated 30% gross margin on inference (Epoch AI, 2025), a company with $4 billion in revenue generates around $1.2 billion in gross profit. Training a frontier model in 2025-2026 costs hundreds of millions to over $1 billion. Operating expenses including staff, offices, and non-compute costs add further. The result: revenue grows fast, gross margins are positive, but total operating profit is near zero because the business continuously reinvests in the next generation. This pattern is structurally similar to pharmaceutical R&D: high development costs, positive product margins, delayed total profitability until scale is reached.

What is the difference between how OpenAI and Google make money from AI?

OpenAI makes money by selling direct access to its models: subscriptions ($20/month), API tokens ($5-15/million), and enterprise contracts. AI is OpenAI's core product. Google primarily makes money through advertising, and uses AI to make that advertising more valuable: better search ranking, more accurate ad targeting, and more relevant content recommendations increase the value of ads served to 4 billion users. Google generates approximately $100 per user per year (New Yorker, 2024), compared to approximately $10 per user per year for ChatGPT. Google also sells direct AI access through Gemini Advanced ($19.99/month) and Google Cloud AI APIs, but this is a smaller share of total revenue than AI-enhanced advertising.

How much does Microsoft Copilot cost and how does it generate revenue?

Microsoft Copilot for Microsoft 365 is priced at $30 per user per month, added on top of existing Office 365 or Microsoft 365 licenses. GitHub Copilot costs $10 to $19 per user per month depending on the plan. Copilot functions as an AI add-on to Microsoft's existing software stack: Word, Excel, Teams, Outlook, and PowerPoint gain AI-generated draft suggestions, meeting summaries, and data analysis features. The $30/user/month price is a 27% premium over the common Office 365 E3 license price. If Microsoft converts 10% of its estimated 400 million Office 365 users to Copilot, that generates approximately $14.4 billion per year in additional subscription revenue, making it one of the largest discrete AI revenue opportunities in enterprise software.

How does NVIDIA make money from AI?

NVIDIA makes money from AI by selling the GPUs and AI accelerators used to train and run AI models. Its H100, H200, and B200 GPU series are the standard hardware for frontier model training at OpenAI, Anthropic, Google, Meta, and Microsoft. NVIDIA also sells NVLink networking, DGX servers, and its CUDA and TensorRT software stack. Data center revenue exceeded $100 billion in NVIDIA's fiscal year 2024, its largest and fastest-growing segment. Cloud providers (AWS, Azure, Google Cloud) are NVIDIA's largest customers, buying GPUs to rent back to AI companies and enterprises. NVIDIA does not charge per query or per token: it sells hardware upfront, capturing value regardless of whether AI products built on that hardware become profitable.

AI Data Centers

What Are AI Data Centers? The Full 2026 Breakdown

10 min read

AI Energy

AI Data Center Power Consumption: The Real Numbers for 2026

11 min read