alt.hn

7/4/2026 at 3:54:56 PM

Show HN: Gemma 3 inference in pure C++ with Metal acceleration

https://github.com/ybubnov/metalchat

by ybubnov

7/4/2026 at 6:02:03 PM

Looks really cool, thank you. I can't find anything about performance. Is it faster? Or is it just a cool demo?

by k1r111

7/4/2026 at 11:36:29 PM

That’s in my short list of next things to do. In the recent releases my primary focus was on compact size of the executable and modern C++ API.

by ybubnov