4/18/2026 at 12:44:15 PM
Thank you Michael Rabin for your excellent work. Rest in Peace.Rabin Fingerprinting is one of my favorites of his contributions. It's a "rolling hash" that allows you to quickly compute a 32-bit (or larger) hash at *every* byte offset of a file. It is used most notably to do file block matching/deduplication when those matching blocks can be at any offset. It's tragically underappreciated.
I've been meaning to write up a tutorial as part of my Galois Field series. Someday..
Thank you again!
by xorvoid
4/18/2026 at 1:28:22 PM
I recently found his fingerprint algorithm and wrote a utility that uses it to find duplicate MIPS code for decompilation[0] and build unique identifiers that can be used to find duplicates without sharing any potentially copyrighted data[1].This replaced some O(n²) searches through ASCII text, reducing search time from dozens of seconds to fractions of a second.
0 - https://github.com/ttkb-oss/mipsmatch 1 - https://github.com/ttkb-oss/mipsmatch/wiki/Identifiers
by jonhohle
4/18/2026 at 4:13:24 PM
Important to note that FastCDC is about an order of magnitude for block deduplication and is generally considered the state of art for such an approach (speed of computing the hash is more important than absolutely optimal distribution of hashes).by vlovich123
4/19/2026 at 1:36:33 AM
That's where I knew the name from. Thank you!I wrote a Rabin—Karp implementation in ~2006 as part of the spam and threat scanning stack for the MX Logic mail service. It was incredibly performant, letting us test {n} bytes against an essentially unlimited number of string signatures in O(n) time.
by syncsynchalt
4/18/2026 at 1:54:16 PM
I'm working on a data annotation system based around Rabin fingerprints. They're a really neat idea.I especially like how if you end up with hash characteristics that you don't like, your can just select a different irreducible Galois polynomial and now you've got a whole new hash algorithm. It's like tuning to a different frequency.
For me it means I don't have to worry about cases where there aren't enough nearby fingerprints for the annotation to adhere to, I can just add or remove polynomials until I get a good density.
by __MatrixMan__
4/19/2026 at 5:32:53 PM
Could you send link to Galois Field series please?by jason_s
4/20/2026 at 3:17:57 PM
https://xorvoid.com/galois_fields_for_great_good_00.htmlby xorvoid