[volunteergrid2-l] Recommended settings
Shawn Willden
shawn at willden.org
Mon Jun 27 12:33:25 PDT 2011
On Mon, Jun 27, 2011 at 11:37 AM, Marco Tedaldi <marco.tedaldi at gmail.com>wrote:
>
> On 27.06.2011 08:52, Shawn Willden wrote:
> > Good questions!
> I take this as a compliment :-)
>
It was intended as one :-)
> >> Is it still adviseable to set the "shares total" and the "shares happy"
> >> to the same value?
> >>
> >
> > It depends on what you want to accomplish.
> >
> > If you set shares-happy lower than shares-total, then when there are less
> > than shares-total nodes available (but >= shares-happy), then your upload
> > will succeed. Success is good... but personally for backup usage I would
> > prefer that the upload fail if I'm not getting the full degree of
> redundancy
> > that I want.
> >
> Maybe set the "shares total" to something higher than the desirde degree
> of redundance to get some additional security if available? Ok... might
> be a waste of disk space.
>
A discussion on tahoe-dev has got me rethinking my opinion here.
For best file reliability, you want to get your shares dispersed as widely
as possible. For the VG2 grid right now, that's 10 shares. So I think you
should set shares-total to 10.
On the other hand, that may mean that if one server is down, you can't
upload... ah, but there's shares-happy. So I think you should set
shares-happy to something less than 10. How much less depends on how you
want to trade off write-availability against read-availability. If you set
shares-happy to be relatively small then you'll always be able to write, but
a couple more servers going off-line might make your files unreadable. If
you set shares-happy to be large (perhaps 10), then you sometimes might not
be able to write a file... but you'll maximize your chances of being able to
read it later.
On balance, I think with the current state of the grid I'm going to use
shares-total=10 and shares-happy=9. You might want to reduce shares-happy
to 8.
And "K" is the value for "shares needed" right?
>
Yes, sorry. We often use "N" for shares-total and "K" for shares-needed.
> Ok... I'll add a cron job for that. Or I will end up with my backup gone!
>
Yes, very important! Of course, we've chosen to set the expiration time at
one year (that might be revised downward in the future, but we decided to
start conservatively), so you shouldn't have to worry too much about it.
> And I don't think that hard links are supported, right? I loves me my
> hand backup-hardlink script :-)
>
Sort of. During backup, the idempotency of immutable files means that tahoe
backup won't actually upload the file the second (or later) time it comes
across it. It will spend time reading and hashing every time it finds a
hardlink, though.
During restore, tahoe will just write multiple copies of hardlinked files.
My backup tool explicitly notices and handles hardlinks, avoiding both of
those issues. Or it would if it worked :)
> If I ever get back to my backup tool and finish it, I actually do keep
> > versions. But my tool (which uses Tahoe for the storage) isn't in state
> > where it's really even usable by me, much less anyone else. When I get
> my
> > family settled into our new home, I should have more time...
>
> Isn't there a backup utility that supports tahoe as backend? was ist
> nepomuk or deja dup? Did someone already test it or is using it?
>
Hmm, I'm not sure. Ask on tahoe-dev.
> what interfaces are you using anyway?
The HTTP API.
--
Shawn
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://tahoe-lafs.org/cgi-bin/mailman/private/volunteergrid2-l/attachments/20110627/9df74089/attachment.html>
More information about the volunteergrid2-l
mailing list