ZFS, dedupe and PST files

Posted by Unreason on Server Fault See other posts from Server Fault or by Unreason
Published on 2010-12-06T09:29:33Z Indexed on 2012/09/02 21:39 UTC
Read the original article Hit count: 215

Filed under:
|

I am interested to know what would be expected maximum dedupe ratio for a set of PST files.

I have ~40G of pst files from ~15 usres with high level of duplication of attachments. I am running tests to see if I can have significant space savings if I store the data on ZFS with dedupe.

For this purpose I have installed a test setup of Nexenta, but was wondering if someone here had already done this and what level of deduplication I might expect (or in another words how sensitive are pst files to block alignment and what are the parameters that can influence the ratio?).

Initial test show very low dedupe ratio and I did find explanation that block level dedupe would not be efficient here and that byte level dedupe would be much better (and that it should be performed by application that is aware of internal organization), so I am just double checking here if someone have some more input.

Otherwise I will probably be converting PST files to IMAP.

© Server Fault or respective owner

Related posts about outlook

Related posts about zfs