[tahoe-dev] Estimating reliability
Shawn Willden
shawn-tahoe at willden.org
Thu Jan 8 11:08:57 PST 2009
Has anyone done any work on code to estimate relability, given assumptions for
k, N, peer network size and peer availability? More to the point, has anyone
written any code to compute recommended values for k and N, to achieve
specified reliability for a getwork of a given size and an assumption about
peer availability?
I'm just musing about what is needed to create a very simple configuration
process for a backup network, and I think it would be much easier for a user
to pick a reliability figure from a list, where each probability of loss
value is paired with an estimate of the expansion factor. Actually, I want
to make it even simpler than an expansion factor. Assuming that in a fair
network each backup client should share as much space to others on the
network as it consumes from them, I want to calculate how much they need to
back up (by looking at their filesystem) and tell them "X% reliability for
your data will require Y GB of your storage and take Z time to back up".
With that, they can make an informed decision about X, and let the client
then compute appropriate k and N.
If no one has done this, I'll add it to my list of things to think about. If
anyone has a paper/article on the calculation, that would be useful.
Otherwise, I'll bust out my old stats texts and work it out myself.
Shawn.
More information about the tahoe-dev
mailing list