[Bug 264950] New port: misc/dedup

From: <bugzilla-noreply_at_freebsd.org>
Date: Mon, 04 Jul 2022 17:20:16 UTC
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=264950

--- Comment #5 from Oleg Borodin <borodin@unix7.org> ---

(In reply to Daniel Engberg from comment #4)

sysutils/duff - uses SHA1 to SHA256
sysutils/dupd - uses xxHash
sysutils/fdupes - uses MD5 
sysutils/filedupe - uses MD5
sysutils/jdupes - uses xxHash
sysutils/rmlint - one utility which can use the HighwayHash algorithm, but is
overloaded with functionality and dependencies.

I used one of the fastest HighwayHash which is comparable to xxHash in speed,
but as far as I understand from tests it has less chance of collisions even
with unchanged initial vector (I could be wrong here).

All you need to do to find files with duplicate content is to hash sum the
content and compare against a database in the file tree, and print the result
(of course before doing so check that the file sizes match, because that's easy
to do).

But if you think this port duplicates the functionality of the others and
doesn't make sense to add it, so be it.

Perhaps it makes sense to close this ticket.

-- 
You are receiving this mail because:
You are the assignee for the bug.