[volunteergrid2-l] [ej3fw] connection timeouts

Shawn Willden shawn at willden.org
Tue Mar 27 02:55:52 UTC 2012


FYI, I've been uploading all day with no problems.  I'm not getting the
high data rates (~300 KBps) that I got last year when the grid was smaller,
but they're still quite reasonable (~100 KBps) and I think they're probably
a result of the greater geographic diversity we've achieved -- which is a
good tradeoff!

On Mon, Mar 26, 2012 at 10:23 PM, Steve Dodson <steve.dodson at gmail.com>wrote:

> Thanks Brian for looking into this!  The timeout config has been added to
> both rhp99 and hiro
>
>
> On Mon, Mar 26, 2012 at 9:00 AM, Iantcho Vassilev <ianchov at gmail.com>wrote:
>
>> timeout.keepalive=120 added
>>
>>
>> Great work :)
>>
>> Iantcho
>>
>>
>> On Mon, Mar 26, 2012 at 17:28, Shawn Willden <shawn at willden.org> wrote:
>>
>>> Great work, Brian!
>>>
>>> I've added the keepalive to my configuration.  I'll watch it for a bit
>>> to see if the behavior looks good.
>>>
>>>
>>> On Sun, Mar 25, 2012 at 11:40 PM, Brian Warner <warner at lothar.com>wrote:
>>>
>>>>
>>>> I did some brief digging this afternoon, uploading small files and
>>>> watching with a packet sniffer. I was connected to 15 out of 17 servers
>>>> (all but ianchov's [aty4r] and slush-backup's [2na4j]).
>>>>
>>>> One thing I observed was that my connection to stercor's server (ej3fw)
>>>> was dropping and reconnecting about once every 16 minutes. I noticed
>>>> this by looking at the "Since" column on the welcome page's server list:
>>>> it shows a timestamp of a few seconds after node reboot for most
>>>> servers, but that one server showed a fairly recent timestamp, changing
>>>> every once in a while.
>>>>
>>>> I think I figured it out, and have a workaround (as well as notes on
>>>> tools to build to help diagnose the issue more easily). I'll write a
>>>> longer letter to the tahoe-dev list with the details, so everyone can
>>>> see them. For VG2's purposes, my advice is to do at least one of:
>>>>
>>>>  1: add [node]timeout.keepalive=120 to all client's tahoe.cfg
>>>>  2: have stercor add timeout.keepalive=120 to the server's tahoe.cfg
>>>>  3: have stercor reconfigure the NAT/router box to increase the timeout
>>>>    for "idle" TCP connections to at least 10 minutes (it currently
>>>>    might be set to more like 5 minutes)
>>>>
>>>> I this this might explain a few problems, but I've certainly seen others
>>>> that this doesn't account for. I'll keep digging.
>>>>
>>>> cheers,
>>>>  -Brian
>>>> _______________________________________________
>>>> volunteergrid2-l mailing list
>>>> volunteergrid2-l at tahoe-lafs.org
>>>> https://tahoe-lafs.org/cgi-bin/mailman/listinfo/volunteergrid2-l
>>>> http://bigpig.org/twiki/bin/view/Main/WebHome
>>>>
>>>
>>>
>>>
>>> --
>>> Shawn
>>>
>>> _______________________________________________
>>> volunteergrid2-l mailing list
>>> volunteergrid2-l at tahoe-lafs.org
>>> https://tahoe-lafs.org/cgi-bin/mailman/listinfo/volunteergrid2-l
>>> http://bigpig.org/twiki/bin/view/Main/WebHome
>>>
>>
>>
>> _______________________________________________
>> volunteergrid2-l mailing list
>> volunteergrid2-l at tahoe-lafs.org
>> https://tahoe-lafs.org/cgi-bin/mailman/listinfo/volunteergrid2-l
>> http://bigpig.org/twiki/bin/view/Main/WebHome
>>
>
>
>
> --
> soli Deo gloria,
>
> Steve Dodson
>
> _______________________________________________
> volunteergrid2-l mailing list
> volunteergrid2-l at tahoe-lafs.org
> https://tahoe-lafs.org/cgi-bin/mailman/listinfo/volunteergrid2-l
> http://bigpig.org/twiki/bin/view/Main/WebHome
>



-- 
Shawn
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://tahoe-lafs.org/cgi-bin/mailman/private/volunteergrid2-l/attachments/20120326/2105c6d1/attachment-0001.html>


More information about the volunteergrid2-l mailing list