No, that sounds right. 24GB isn’t enough to feasibly run 27B parameters. The rul...

brandall10 · 2025-05-01T15:13:11 1746112391

That rule of thumb is only related to 8 bit quants at low context. The default for ollama is 4 bit, which puts it roughly about 14GB.

The vast majority of people run between 4-6 bit depending on system capability. The extra accuracy above 6 tends to not be worth it relative to the performance hit.

simonw · 2025-05-01T14:00:32 1746108032

You also need to leave space for other apps. If you run a 27B model on a 32GB machine you may find that you can't productively run other apps.

I have 64GB and I can only just fit a bunch of Firefox and VS Code windows at the same time as running a 27B model.