ZFS and block deduplication

Richard Pieri richard.pieri-Re5JQEeQqe8AvxtiuMwx3w at public.gmane.org
Wed Apr 27 19:42:31 EDT 2011


On Apr 27, 2011, at 5:00 PM, Edward Ned Harvey wrote:
> 
> It's even more difficult than that ...  Yes, many files span multiple
> blocks, and therefore begin at the beginning of one block and end in the
> middle of another block, but the hashes are calculated on a per-block basis
> up to 128k.  So any files that are smaller than 128k *might* occupy a block
> by themselves, but since they're probably being written a whole bunch of
> files at a time, most like the write aggregation is consolidating many small
> writes into a single block.

This is not difficult.  I just target the httpd executable instead of apachectl.  The apache2 executable on the Debian system that I am looking at right now is 357K so it will have at least two blocks to itself.  The exim4 executable is 680K, taking at least four blocks to itself.  ntpd is 410K, three blocks.  sshd is 429K, three blocks.  It's not hard to find a system binary to attack this way.


> And of course the countermeasure of all the above is trivial.  Enable
> verification.  ;-)

No, the countermeasure is to keep system and user storage separate from each other, like system admins have been doing for about as long as we've had multi-user systems to admin.

--Rich P.






More information about the Discuss mailing list