alt.hn

5/20/2026 at 9:02:11 PM

GPU Memory Math for LLMs: Formula That Tells You What Fits on Your GPU

https://theahmadosman.substack.com/p/gpu-memory-math-for-llms-2026-edition

by XMasterrrr

5/20/2026 at 10:44:08 PM

This isn't very useful.

V of context is not equal across models.

Also, huggingface tells you how big the model is for the exact one you have in your hand, why the weird guesswork? Dynamic quants are not going to magically fit some formula.

by DiabloD3

5/20/2026 at 9:48:38 PM

This is super useful. Most of the time I go to run a model off Hugging Face on my 64GB MBP I run into issues where I drastically overestimated what it could do. :>

by metadat