Given a 1TB data set on disk with around 1KB per data record, how can I find duplicates using 512MB

Posted by user288609 on Stack Overflow See other posts from Stack Overflow or by user288609
Published on 2010-04-04T05:21:46Z Indexed on 2010/04/06 21:33 UTC
Read the original article Hit count: 221

There is 1TB data on a disk with around 1KB per data record. How to find duplicates using 512MB RAM and infinite disk space?

© Stack Overflow or respective owner

Related posts about c++

Related posts about data-structures