Git: Find duplicate blobs (files) in this tree

Posted by Readonly on Stack Overflow See other posts from Stack Overflow or by Readonly
Published on 2008-10-22T06:39:23Z Indexed on 2010/04/30 10:27 UTC
Read the original article Hit count: 166

Filed under:

This is sort of a follow-up to this question.

If there are multiple blobs with the same contents, they are only stored once in the git repository because their SHA-1's will be identical. How would one go about finding all duplicate files for a given tree?

Would you have to walk the tree and look for duplicate hashes, or does git provide backlinks from each blob to all files in a tree that reference it?

© Stack Overflow or respective owner

Related posts about git