TL;DR

A late-June Thorsten Meyer AI report says High Bandwidth Memory has become the main pressure point in the 2026 memory squeeze. The report says HBM demand from AI accelerators is taking fab capacity from DDR5 RAM and GDDR7 graphics memory, with supply sold out through 2026.

A late-June report from Thorsten Meyer AI says High Bandwidth Memory has become the component driving the 2026 memory squeeze, as AI accelerator demand pulls fab capacity away from DDR5 RAM and GDDR7 graphics memory.

The report describes HBM as a vertical stack of DRAM dies, linked by through-silicon vias and mounted near a GPU on an interposer. That layout gives AI accelerators roughly five to ten times the bandwidth of standard graphics memory, helping chips such as Nvidia’s H100, H200 and B200 feed data fast enough to keep compute units busy.

The same design makes HBM costly in fab capacity. Thorsten Meyer AI says one bit of HBM consumes roughly three to four times the wafer area of one bit of DDR5, because stacked dies are larger, yields are harder and a single defect can spoil a full tower. The report estimates per-stack prices at about $200 for HBM3, $300 for HBM3E and around $500 for HBM4.

The report says SK Hynix leads the HBM market, with Samsung attempting a 2026 comeback and Micron sold out for 2026. It also says all three major suppliers had qualified for HBM4 by June 2026, shifting the race from basic qualification to volume, yield and execution.

At a glance
reportWhen: Published in late June 2026; market con…
The developmentA late-June Thorsten Meyer AI report identifies HBM as the component now shaping memory pricing, availability and some GPU supply in 2026.
AI Dispatch · Reality Check · The Memory Squeeze · Part 2 of 10

HBM ate the fab

The thing the factories make instead of your RAM is a tower of stacked memory bolted to every AI chip. In three years it went from niche part to the component that sets the price of nearly all the world’s memory — and now a chunk of its GPUs.

What it is — and why it’s so wafer-hungry
BASE LOGIC DIE
8–16 DRAM dies · TSVs · 1 stack

A tower, not a sheet

HBM stacks DRAM dies vertically, links them with thousands of through-silicon vias, and sits beside the GPU to deliver 5–10× the bandwidth of normal graphics memory. AI is bandwidth-bound — without it, the world’s most expensive silicon sits starved for data. But stacking is inefficient: one HBM bit eats 3–4× the wafer area of DDR5, and one defect can ruin a whole tower.

≈ 8 HBM stacks wrap every AI GPU
The annual arms race — faster, denser, dearer
HBM3
~819 GB/s
per stack · the H100 era
~$200 / stack
HBM3E
~1.18 TB/s
2026 workhorse · H200, B200
~$300 / stack  (+20% for ’26)
HBM4
~2.8 TB/s
new logic base die · Nvidia “Rubin”
~$500 / stack (est.)
The three-horse race for the most coveted chip
SK Hynix
~50–62%
the leader; ~90% of its HBM goes to Nvidia
Samsung
~28–40%
2026 comeback; qualified for Rubin HBM4
Micron
~5–10%
sold out for 2026; HBM4 for inference chips
June 2026: all three qualified for HBM4 — the question shifts from “can you ship?” to “who ships best?”
−30–40%
It didn’t just eat your RAM — it ate your GPU too. With suppliers prioritizing HBM, the GDDR7 memory consumer cards need went short; Nvidia reportedly cut RTX 50-series production by a third or more in H1 2026.
The take

This isn’t artificial scarcity — AI really is bandwidth-bound, HBM really is the fix, and it really does eat 3–4× its weight in fab capacity. The discomfort is structural: one component, coupled to one customer’s demand, now sets the price of nearly all memory and a slice of GPUs. The market is now $35B → ~$100B by 2028, ~41% of all DRAM revenue (was 8% in 2023), and sold out through 2026. The one hope: with all three suppliers finally racing on HBM4, competition can add supply. The matching risk: if AI demand corrects, HBM is where it breaks first. Next: DDR5 now, DDR6 soon.

Sources: Silicon Analysts; Introl; TrendForce; DigiTimes; Unibetter; Astute Group; Reuters. Per-stack pricing is estimated/point-in-time; bandwidth per JEDEC/vendor specs. As of late June 2026, fast-moving.
thorstenmeyerai.com

AI Demand Reprices Memory

The shift matters because HBM competes for the same broad manufacturing base that supplies ordinary RAM and parts of the consumer GPU market. If memory makers can earn far more from AI-bound HBM stacks than from commodity modules, the report says fab allocation naturally moves toward higher-margin HBM.

That pressure is no longer limited to servers. Thorsten Meyer AI says shortages of GDDR7 memory, used in consumer graphics cards, contributed to Nvidia reportedly cutting RTX 50-series production by a third or more in the first half of 2026. The report frames the issue as structural supply pressure, not a simple inventory hiccup.

Amazon

High Bandwidth Memory HBM3 module

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

From Niche Stack to Bottleneck

HBM moved from a specialty part to a central AI component within three years, according to the report. HBM3 powered the H100 era at roughly 819 GB/s per stack, while HBM3E reached about 1.18 TB/s and became the 2026 workhorse for chips such as the H200 and B200.

The next generation, HBM4, is expected to reach roughly 2.8 TB/s per stack and use a new logic base die, with Nvidia’s Rubin platform cited as a major target. Thorsten Meyer AI, citing Silicon Analysts, Introl, TrendForce, DigiTimes, Unibetter, Astute Group and Reuters, says the HBM market could grow from $35 billion to about $100 billion by 2028.

“HBM Ate the Fab”

— Thorsten Meyer AI report

Nvidia Tesla P100 900-2H400-0000-000 GPU Computing Processor - 16 GB - HBM2 - PCIE 3.0 X16 (Certified Refurbished)

Nvidia Tesla P100 900-2H400-0000-000 GPU Computing Processor – 16 GB – HBM2 – PCIE 3.0 X16 (Certified Refurbished)

GPU Computing Processor

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Pricing and Output Remain Fluid

Several figures remain estimates or reported ranges, including HBM4 per-stack pricing, supplier market shares and the scale of any RTX 50-series production cuts. The report says its pricing data is point-in-time and that market conditions were fast-moving in late June 2026.

It is not yet clear how quickly HBM4 qualification will turn into stable volume, how much yield will improve, or whether AI accelerator demand will keep rising at the same pace. A demand pullback would likely hit HBM suppliers before the rest of the memory market.

GIGABYTE GeForce RTX 5080 Gaming OC 16G Graphics Card, WINDFORCE Cooling System, 16GB 256-bit GDDR7, GV-N5080GAMING OC-16GD Video Card

GIGABYTE GeForce RTX 5080 Gaming OC 16G Graphics Card, WINDFORCE Cooling System, 16GB 256-bit GDDR7, GV-N5080GAMING OC-16GD Video Card

Powered by the NVIDIA Blackwell architecture and DLSS 4

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

HBM4 Shipments Set the Pace

The next milestone is the HBM4 ramp for Nvidia Rubin and other AI platforms. Memory buyers will be watching whether SK Hynix, Samsung and Micron can turn qualification into enough supply to ease pressure on DDR5, GDDR7 and next-generation memory pricing.

Patriot Viper Venom DDR5 RAM 16GB (2X8GB) 6000MT/s CL36 1.35v UDIMM Desktop Gaming Memory Kit Compatible with Intel XMP/AMD EXPO - PVV516G600C36K

Patriot Viper Venom DDR5 RAM 16GB (2X8GB) 6000MT/s CL36 1.35v UDIMM Desktop Gaming Memory Kit Compatible with Intel XMP/AMD EXPO – PVV516G600C36K

Capacity: 16GB(2 x 8GB)

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Key Questions

What is HBM?

High Bandwidth Memory is stacked DRAM placed very close to an accelerator, giving AI chips much higher data bandwidth than standard graphics memory. It uses vertical die stacks and through-silicon vias to move data quickly.

Why does HBM affect DDR5 supply?

The report says HBM uses far more wafer area than DDR5 for the same bit capacity. When manufacturers assign more wafers to AI memory, fewer are available for commodity RAM.

Are gaming GPUs affected?

According to the report, GDDR7 shortages have affected consumer graphics cards as suppliers prioritize HBM production. Nvidia reportedly reduced RTX 50-series output in the first half of 2026, though the exact scale remains reported rather than confirmed by the company in the source material.

Which companies make HBM?

The report names SK Hynix, Samsung and Micron as the three main suppliers. It says SK Hynix leads, Samsung is trying to regain share in 2026, and Micron is sold out for 2026.

What could ease the memory squeeze?

More HBM4 capacity, better yields and stronger supplier competition could ease pressure. The report says the main risk is that AI demand keeps absorbing capacity faster than memory makers can add it.

Source: Thorsten Meyer AI

You May Also Like

industrial kinetic displays shape hybrid exhibition and automotive service space in tbilisi

A Tbilisi automotive service center has been redesigned with industrial materials and kinetic displays, creating a hybrid environment for vehicle care, retail, and exhibitions.

Purchase order exception tracker for small manufacturers

A new purchase order exception tracker for small manufacturers is set to be tested as a workflow solution to manage supplier issues amid supply volatility.

Trade and supply-chain operations signal monitor: US-Iran talks to begin Sunday in Switzerland as Tehran closes the strait over Lebanon fi

US-Iran negotiations are set to start Sunday in Switzerland, with Tehran closing the strait over Lebanon, impacting global trade and supply chains.

Warranty claim packet builder for appliance repair shops

A new tool designed to streamline warranty claims for independent appliance repair shops is entering testing, promising better documentation and reduced rework.