[tahoe-dev] How large can a Small Distributed Mutable File be?

zooko zooko at zooko.com
Wed Nov 7 10:42:00 PST 2007


Folks:

Brian's excellent design document mutable.txt [1] describes the  
design of Small Distributed Mutable Files, which are the next big  
feature in Tahoe (as per the Roadmap [2]).

I'm reading that document now, and it suggests that you should use  
the SDMF feature only for files < 1 MB in size.  This raises the  
question in my mind -- what's the practical limit?

There is no hard-coded limit in the SDMF format, but changing  
anything -- even changing one byte or appending one byte -- in an  
SDMF is implemented as re-encrypting, erasure coding, Merkle-tree- 
building, and uploading the entire file.  That's why we put "Small"  
in the name.

So it is a matter of users -- is there somebody out there who will be  
happy to do all that every time they change their file, even though  
their file is 1 GB?  Are there people who will be unsatisified with  
the total performance of SDMFs even if their files are only 100 KB?

I guess a big part of it is how often you change the file, and  
whether you need to wait for the new change to be written before you  
can go on and do something else.

I guess we will learn more about this question as soon as SDMFs are  
complete enough to benchmark, which I think will be today.

Regards,

Zooko

[1] http://allmydata.org/trac/tahoe/browser/docs/mutable.txt
[2] http://allmydata.org/trac/tahoe/roadmap



More information about the tahoe-dev mailing list