I’ve got this folder called vast/todo/t.temp
that’s got a 100 GB of stuff from old computers in it. Typically, I just copy stuff there and tell myself I’ll get back to it. There are 61,287 files, none less than a year old, and (as of now) only 5 of those 60-thousand files are less than 2 years old.
How will I ever “get back to” making sense of all that junk? Enter duff – the CLI duplicate file finder. Just say:
$ duff *.txt
and it tells you something like this:
2 files in cluster 1 (19925 bytes, digest 8b5cc01edd340e91957b54f10c22d6d3283b7962) ccc.txt zzz.txt
Then you decide whether you want to nuke ‘ccc.txt’ or ‘zzz.txt’. Bob’s wife’s your aunt.
Installation is just port install duff