Detecting different versions of a document, whatever it’s named

Jun 25, 2026

AI Generated ✨

Documentor

Design and groundwork began on a new Documentor capability that recognizes when several files are really versions of the same document – revisions of one another – even when their names give nothing away, and that can re-number them into one clean, consistent version sequence.

  • Completed the design for automatic version detection – how Documentor will group files that are revisions of one another and assign each a clean version number, working from the documents' own content and history rather than their filenames.
  • Built and tested the detection foundation: a compact content 'fingerprint' for every scanned file that recognizes near-duplicate revisions, not just byte-for-byte identical copies.
  • Optimized the fingerprinting algorithm to run roughly eleven times faster, so it stays quick even across very large folders.
  • Wired it into the existing folder scan so fingerprints are computed automatically in the background and reused on later scans, keeping the app responsive even on large libraries.