Turbo's Thoughts

Software

May 30, 2019

Fuzzy File Deduplication via Partial Content Hashing

Presenting an algorithm for fast, imperfect (fuzzy) file hashing to aid file deduplication.

Licenses (unless indicated otherwise) are CC-BY-NC-4.0 for prose, Apache-2.0 for code.