4/4/2026 at 10:27:59 PM
I tried TQ for vector search and my findings is not good, it is not worth it if you cannot use GPU, however I got same quality of search as 32f using 8bit quantI wrote ann ext for sqlite, using tq, I do save a lot on space but 32f is still faster despite everything I have tried
by netdur
4/5/2026 at 12:16:27 AM
you’re right that 32f is faster on raw query time, quantization adds extra step. main benefit on download size since gzip won’t help much, which matters most in browser contextsby teamchong
4/5/2026 at 1:05:12 AM
So i assumed it would get crushed by OPQ (which requires training)by ninja3925