[tahoe-lafs-trac-stream] [tahoe-lafs] #1636: Unhandled error in Deferred while uploading (current darcs build)
tahoe-lafs
trac at tahoe-lafs.org
Tue Feb 14 01:14:09 UTC 2012
#1636: Unhandled error in Deferred while uploading (current darcs build)
----------------------------+------------------------
Reporter: killyourtv | Owner: zooko
Type: defect | Status: assigned
Priority: critical | Milestone: 1.9.2
Component: unknown | Version: 1.9.0
Resolution: | Keywords: regression
Launchpad Bug: |
----------------------------+------------------------
Changes (by killyourtv):
* cc: killyourtv@… (added)
Comment:
Replying to [comment:7 zooko]:
[...]
> killyourtv appears to be a reliable reporter of bugs so far -- his
reports are low on speculation
[...]
That was until comment:2 above. :| *Sigh*.
Anyhow...I'm back testing with was the latest git revision as of a few
days ago (the webui shows 1.9.0.post77, rev
2da8339dd762aa96e73b1c1d98e217df43802418).
{{{
allmydata-tahoe: 1.9.0.post77,
foolscap: 0.6.2,
pycryptopp: 0.5.29,
zfec: 1.4.22,
Twisted: 11.1.0,
Nevow: 0.10.0,
zope.interface: unknown,
python: 2.7.2+,
platform: Linux-debian_wheezy/sid-x86_64-64bit_ELF,
pyOpenSSL: 0.13,
simplejson: 2.3.2,
pycrypto: 2.5,
pyasn1: unknown,
mock: 0.7.2,
sqlite3: 2.6.0 [sqlite 3.7.10],
setuptools: 0.6c16dev3
}}}
Once again, patches from #1007, #1010, and #68 have been applied.
Everything, so far, seems to be working fine. Or...I should say that
things are much better than they had been. It's been running for ~36 hours
now.
The only problems I've had (a couple error 500s during upload or repair)
can probably be attributed to our not-always-as-stable-as-we'd-like grid.
When the failures occurred the tasks remained on the status page, such as
{{{
publish 3l6y3gkgdxxwwsn2mwtdmssjse No 0B 0.0%
Started
publish zyt7tuse472ikqnjc7dmw3ai4y No 589B 0.0%
Pushing shares
}}}
These entries are from around two hours ago (it currently just after 1am
UTC). These tasks are no longer running but they're still in the "Active
Operations" section.
{{{
Started: 22:53:57 13-Feb-2012
Storage Index: 3l6y3gkgdxxwwsn2mwtdmssjse
Helper?: No
Current Size: 0
Progress: 0.0%
Status: Started
Retrieve Results
Encoding: 5 of 10
Sharemap:
0 -> Placed on [3t4enhba]
1 -> Placed on [eyp5vu5v], [bgahn3ft]
2 -> Placed on [3ejzcskd]
3 -> Placed on [uxqmbg6c]
5 -> Placed on [jvgf7m73]
6 -> Placed on [3teqr2cf], [n64vwepc]
7 -> Placed on [q5b52rmg]
8 -> Placed on [ql3cupul], [p6463dyg]
9 -> Placed on [poeonyl6]
Timings:
Total: ()
Setup:
Encrypting: 0us ()
Encoding: 0us ()
Packing Shares: ()
RSA Signature:
Pushing: ()
}}}
{{{
Started: 23:18:25 13-Feb-2012
Storage Index: zyt7tuse472ikqnjc7dmw3ai4y
Helper?: No
Current Size: 589
Progress: 0.0%
Status: Pushing shares
Retrieve Results
Encoding: 3 of 10
Sharemap:
0 -> Placed on [p6463dyg], [3t4enhba]
1 -> Placed on [n64vwepc]
2 -> Placed on [basz6g7o]
3 -> Placed on [5mefqfis]
4 -> Placed on [eyp5vu5v]
5 -> Placed on [3ejzcskd]
6 -> Placed on [jvgf7m73]
7 -> Placed on [vvnvgne4]
8 -> Placed on [iqhk5e2t]
9 -> Placed on [uxqmbg6c]
Timings:
Total: ()
Setup: 5.7ms
Encrypting: 283us (2.08MBps)
Encoding: 151us (3.90MBps)
Packing Shares: ()
RSA Signature: 6.5ms
Pushing: ()
}}}
This is the only entry in twistd.log from around that time:
{{{
2012-02-13 23:37:48+0000 [-] Unhandled error in Deferred:
2012-02-13 23:37:48+0000 [-] Unhandled Error
Traceback (most recent call last):
File "/usr/lib/python2.7/dist-packages/nevow/flat/twist.py",
line 42, in cb
_drive(iterable, finished)
File "/usr/lib/python2.7/dist-packages/nevow/flat/twist.py",
line 26, in _drive
finished.callback('')
File "/usr/lib/python2.7/dist-
packages/twisted/internet/defer.py", line 362, in callback
self._startRunCallbacks(result)
File "/usr/lib/python2.7/dist-
packages/twisted/internet/defer.py", line 458, in _startRunCallbacks
self._runCallbacks()
--- <exception caught here> ---
File "/usr/lib/python2.7/dist-
packages/twisted/internet/defer.py", line 545, in _runCallbacks
current.result = callback(current.result, *args, **kw)
File "/usr/lib/python2.7/dist-packages/nevow/appserver.py", line
181, in _cbFinishRender
self.finishRequest( True )
File "/usr/lib/python2.7/dist-packages/nevow/appserver.py", line
176, in finishRequest
server.Request.finish(self)
File "/usr/lib/python2.7/dist-packages/twisted/web/http.py",
line 866, in finish
"Request.finish called on a request after its connection was
lost; "
exceptions.RuntimeError: Request.finish called on a request after
its connection was lost; use Request.notifyFinish to keep track of this
}}}
If I'm around if/when it happens again I'll generate an incident report.
All that said, things look *much better*.
--
Ticket URL: <https://tahoe-lafs.org/trac/tahoe-lafs/ticket/1636#comment:11>
tahoe-lafs <https://tahoe-lafs.org>
secure decentralized storage
More information about the tahoe-lafs-trac-stream
mailing list