[volunteergrid2-l] howdy

Sat Mar 24 17:16:25 UTC 2012

On Fri, 23 Mar 2012 12:22:59 -0700, Brian Warner wrote:
> Hi folks.. I just joined the mailing list so I could hear from y'all
> about how Tahoe is working for you. In particular, I've heard some
> anecdotal reports about serious latency problems. Would folks mind if 
> I
> attached a non-server client to VG2 and uploaded a few tiny
> files/directories to see what's going on? (I kind of suspect a bug in
> which TCP connections are getting silently lost, but the client 
> doesn't
> realize it yet, and the uploader or the mutable-file-publisher is 
> then
> waiting on a very slow TCP timeout).
>
Hi Brian,

most definitely, I would much appreciate it.

To demonstrate (one of) the problems, I'm attaching a screenshot of 
what is happening on a deep-check --repair --add-lease run I started a 
few minutes ago. The screenshot was taken about 10 minutes after 
starting, and it's still stuck at the initial queries stage, supposedly 
waiting for a reply from the 16th server. This time, something(tm) seems 
to have timed out after exactly 16 minutes and 59 seconds; then 
afterwards, the actual deep-check went through pretty quickly. 
Interestingly enough, this problems seems to appear most of the time, 
but not all of the time -- but if it does, it's always the connection 
with ej3fwcecqssij4ljf6esjkmflhook6jk. (Ted, don't be offended, I'm just 
stating what I observed :-) ).
Another interesting one is inxoy6uiulkr2uwm6s3rmz6jzyiywkvi, which took 
7 seconds to reply (even though both my node, and that one, are 
physically and network-wise relatively close). By coincidence, that is 
also the node which is showing loads of gaps on the stats page ( 
https://vg2-stats.rosenkeller.org/ ), even though I assume it's not 
being constantly restarted, but online 24/7.

Again, folks: don't take this personally. I'm only describing the 
situation and trying to find out what is going wrong, and why it seems 
to be mostly related to a few nodes, while others seem rock-stable. This 
*could* of course all be network-related, but from my experience, I 
doubt it. Network issues are usually intermittent and not regularly 
repeating. Maybe Brian can indeed gain some valuable insight and shed 
some light on this, so I'm definitely in favor of his proposal.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: screen.png
Type: image/png
Size: 136413 bytes
Desc: not available
URL: <https://tahoe-lafs.org/cgi-bin/mailman/private/volunteergrid2-l/attachments/20120324/40006c95/attachment-0001.png>