5/29/2026 at 12:55:45 PM
The official Q4_K_S gguf is quite good and has very good 35 tps generation on a M1 mac studio. Should be much faster on recent Macs, especially M5.by tarruda
5/29/2026 at 3:47:14 PM
What’s “Q4_K_S gguf” and where do I get it? Is it easy to install and configure on a MacBook?by SilverElfin
5/29/2026 at 3:55:46 PM
https://huggingface.co/stepfun-ai/Step-3.7-Flash-GGUF and you can use ollama: https://docs.ollama.com/import#Importing-a-GGUF-based-model-...by throw1234567891