How is an exact duplicate determined
How is "exactnes" determined? Checksum? byte for byte? size? pattern? Date of creation?
Thanks
Thanks
1
person has this question
I have this question, too!
Tell me when someone answers.
The more people who ask this question, the more it gets noticed.
The more people who ask this question, the more it gets noticed.
Create a customer community for your own organization
Plans starting at $19/month
-
Inappropriate?In dupeGuru, when you choose the "Content" scanning method, the checksum of the files are used to determine if they are the same.
The company says
this answers the question
-
Inappropriate?I have a bunch of VIDEO_TS files on my drive. When I scan for exact matches DupeGuru creates a checksum of all these files and it takes forever. It should be able to determine that these files are not identical by comparing a checksum of just the first one or two MBs of each file. Of course if the first chunk matched it would still need to checksum the entire file, but there is the potential for a massive performance increase here!
-
Inappropriate?Yes, it would, but in normal circumstances, it's rare that it makes a difference. Before md5 are computed, filesizes are used to see what file need it. Only files having other files with the same size will have their md5 computed.
-
Since all VIDEO_TS files are the same size (1 GB), they always need to have their checksum calculated. And because they are huge it takes a long time. Well, it would be nice then to be able to exclude large files as a preference, like being able to exclude small files. -
Interesting point (ticket created).
Loading Profile...



EMPLOYEE