alt.hn

5/12/2026 at 5:35:51 PM

FairyFuse: Multiplication-Free LLM Inference on CPUs via Fused Ternary Kernels

https://arxiv.org/abs/2604.20913

by PaulHoule

5/12/2026 at 11:49:27 PM

Paper looks great. No GitHub link that I can find though. Maybe I'll take a crack at an implementation if I've got some extra free time.

by Reubend