5/24/2026 at 6:32:39 PM
An interesting implication of this is that AI inference and training has a path to a ~3x hardware cost reduction (and maybe ~2x total cost reduction) without any technical innovation whatsoever, we just need to wait for dram supply to meet demand (either by manufacturing scaling or just waiting for the current rate of manufacturing to fill the demand spike).by gpm
5/24/2026 at 7:23:44 PM
The memory makers will not expand demand drastically. It is in the nature of their business to keep the market under-supplied, otherwise the following oversupply will kill them. Instead, supply is just rerouted from less profitable segments such as mobile and personal computing.by radialstub
5/24/2026 at 7:40:26 PM
China is about to flood the market and prove this notion wrong. If there is demand they want to meet it with supply.But to your point, that is exactly how American companies like to play now. No one is stopping them from screwing over the consumer.
I have a Micron near me and they are building another chip facility but we are years away still so I suspect China will beat them to the punch.
by brandensilva
5/24/2026 at 7:56:03 PM
Yeah, more global competition in DRAM would be great.SK Hynix and Samsung are South Korean.
by shdh
5/24/2026 at 7:48:04 PM
I suspect Chinese factories will get built first, but quality may take a few years to really nail down.Basically:
China floods the market with cheaper but less QA'd parts, makes a gazillion dollars, is able to spend said money to fix yields / QA issues and streamline operations, by the time that happens Micron and maybe a few other existing players will have new memory production, and then we'll have a flood of cheap, reliable memory. 4yr, maybe?
by esseph
5/24/2026 at 7:36:28 PM
Supply and demand always balance out. There is no way manufacturers aren’t going to compete away these inflated margins, as long as they feel like this demand is sustainable.by ec109685
5/24/2026 at 7:56:40 PM
Only in the most naive sense.If it costs you $1B and five years to build out new supply and you think demand will not sustain for more than three years, it does not make sense to expand supply.
Instead you will maintain your margins currently and await demand to decrease back to your current supply.
This is pretty common and as others have pointed out is even more common in markets where competition is slow and lead times are long.
Ammunition is a great example over the last decade or so as political turnover caused relatively short lived demand spikes and manufacturers didn't expand supply because they knew once political winds shift, demand would decrease.
by cco
5/24/2026 at 7:47:40 PM
You know there's other strategies? Companies can be more clever than naively undercutting each other...Memory in particular ... https://en.wikipedia.org/wiki/DRAM_price_fixing_scandal
The entry-cost to getting into memory is on the order of $billions and years - you can do just about anything...
by kristopolous
5/24/2026 at 7:38:40 PM
There's very few manufacturers, I believe 3 globally? And there's a large moat. Nobody can compete with them in the next 10 years. It's really not hard to coordinate action between 3 companies.by array_key_first
5/24/2026 at 7:47:59 PM
There are trillions to be made. That moat won't be as insurmountable in hindsight.by KeplerBoy
5/24/2026 at 7:58:05 PM
There really aren't though. The reason there's only three is because memory is a commodity and margins are historically very low. It's not a very good business to be in, generally.In the past when memory supply was short and then rebounded, many companies went out of business because making memory was no longer profitable.
by array_key_first
5/24/2026 at 7:34:53 PM
Apple could always decide to build their own fab or some such thing.by jayd16
5/24/2026 at 7:29:34 PM
It sure looks like Sam Altman's masterful gambit to corner the memory market has had unforeseen consequences.by overfeed
5/24/2026 at 7:46:27 PM
Is any of this actually unforeseen? Buying the vast majority of the world’s supply of something does have mostly predictable consequences.by roxolotl
5/24/2026 at 6:51:32 PM
What’s the lifespan/refurbishability of the capex elements like the “GPU” modules or even the DRAM soldered into them?by Waterluvian
5/24/2026 at 7:28:29 PM
For lifespan, AWS is still running a ton of T4 GPUs from 2018, that power a lot of computer vision models. A ton of these will have a long life, not all ML is about frontier LLMs.by jmalicki
5/24/2026 at 7:15:16 PM
I wonder if we will see an adoption of alternative floating point formats. IEEE floats are notoriously terrible at lower widths (<= 16 bits). Floating point formats such as posits do much better at 16 or 8 bits. If you could train at 16 bits per value instead of 32, and suffer a much smaller inaccuracy penalty than you would from IEEE32 to IEEE16...by andrepd
5/24/2026 at 7:10:21 PM
For some reason I still haven't heard any predictions on when new fabs will come online to meet the current demand. This shouldn't be too hard to find out, since the building time of fabs is very predictable process.The difficult question is more whether foreseeable memory demand will remain at the current level, grow even further, or shrink again.
by cubefox
5/24/2026 at 7:19:28 PM
> a path to a ~3x hardware cost reductionReally?
How long do we have to wait until that ... cost reduction hits us?
by shevy-java
5/24/2026 at 7:34:33 PM
For supply to meet demand. Depends very much on how aggressively producers scale and on how demand grows or shrinks.Safe to say at least a year or two. It'd be shocking if it took a decade.
by gpm
5/24/2026 at 7:10:05 PM
Supply will not meet demand. What incentive do the handful of dram manufacturers have to end the party? This is what happens when legal monopolies finally win control. Dont't worry. The patents will expire in a few decades. Our grandkids will see DDR5 get cheap again. The system functions as intended.by sandworm101
5/24/2026 at 7:35:29 PM
Patents is not the issue here. Not even close.The up-front investment of a memory fab is measured in billions, and takes years to construct and get running. The margin on the chips themselves is terrible, so without scale its not worth even trying. DDR5 is a industry standard that takes some effort to conform to, but the licence fees is a drop in the bucket to the cost of creating a fab.
The fabricators were cautious about increasing production, and slow to start planning. It takes further time to build up capacity, and if the demand drops down, they may end up producing dram at a loss when the market flips over to oversupply. The demand whiplash could kill any company that dared betting on increasing production. See the "bullwhip effect" https://en.wikipedia.org/wiki/Bullwhip_effect which has killed semiconductor fabricators before.
There is a discussion to be had about how to maintain national semiconductor production in Europe and US as a strategic industry, but historic attempts have all failed.
by aDyslecticCrow
5/24/2026 at 7:24:48 PM
I have fairly simplistic view of the economics involved here. Could you explain why the ability to sell more chips wouldn't be sufficient enough incentive to increase supply?by fitblipper
5/24/2026 at 7:32:13 PM
Not the person you’re replying to, but RAM has historically been a boom-or-bust business, and companies that invest to meet demand during a boom cycle usually have that new capacity come online just in time for the bust.If it was just variable costs and new capacity was available today they’d do it. But there are substantial fixed costs and delays to increasing capacity, and that uncertainty makes it risky.
by brookst
5/24/2026 at 7:33:31 PM
Bringing on new fabs takes many years and billions of dollars. You're exposing yourself to a lot of risk if you build now and find that the gold rush is over by the time your new capacity is online.by the_snooze
5/24/2026 at 7:08:02 PM
2-3x is completely dwarfed by the remaining improvements in training which is still in its infancy relativelyby eldenring
5/24/2026 at 7:18:18 PM
Probably, but at some point we're very likely to run out of significant training improvements and it's not clear that we'll see that point coming from a long way out.Likewise it's probably dwarfed by improvements in how we make dram - continuing the roughly exponential (maybe a bit less recently) scaling of chips - but not necessarily.
The 2x from returning to previous costs is interesting because it's practically guaranteed, and it's on top of everything else. We're just currently "overpaying" (relative to the stable market price) for the manufacture of dram because of a sudden increase in demand.
by gpm
5/24/2026 at 7:17:53 PM
Unless there's a new paradigm, scaling up is all they can do to improve performance. They've shrunk down all the way to 1-bit models and all the low-hanging fruit is gone. There's no way for them to get much smaller, so they have to get bigger and faster to meet expectations.by BearOso
5/24/2026 at 7:49:03 PM
this is just not true at all, there are massive leaps from algorithms, data, etc. every year. scale is one axis of many and you need to get them all correct.by eldenring