[tahoe-dev] testnet anomaly report
Brian Warner
warner-tahoe at allmydata.com
Mon Feb 11 18:04:40 PST 2008
I've been working on some tools to look at a set of storage servers and
identify anomalous shares:
* CHK shares for which more than one set of k-of-N encoding parameters
were used
* SDMF files for which more than one set of k-of-N parameters were used
* SDMF files for which we see multple versions present in the grid
The first and second could occur if the same file were uploaded with
different parameters: for CHK files, this should happen a lot less now
(hopefully never) that our Content Hash Key value includes the encoding
parameters in the hash.
The third happens when some storage servers are offline when a new version of
a directory is uploaded. Our SDMF mutable-file upload algorithm is designed
to handle this case, but ticket #272 points out that we haven't implemented
the necessary recovery code yet. So these files are a problem (at least when
there are fewer than 'k' shares of the newer version available).
The new 'tahoe catalog-shares NODEDIRS..' command walks the given node
directories and emits one line for each share that it discovers. This line
summarizes the file: what storage index is it for, what encoding parameters
were used, etc. The 'misc/find-share-anomalies.py' tool then analyzes a
collection of these "share catalogs" and produces a report with any problems
that it sees.
I built a share catalog about an hour ago and ran the anomaly tool over the
results. The resulting report is attached to this message, with a summary
(produced with 'grep -v " "') of the troublesome storage index values inlined
here:
CHK multiple encodings:
16a9zdg5ccjdawwd81cepti3rw
3a5rjfz15horsuxbynyepowyxa
3b457f9rtx9bbi4cfbaty81xsa
44kai1tui348689nrw8fjegc8c
5u7byy7awoe3jxf4zwshbehedc
68g6c6umnemx67bsjmmt9auf9w
998i58ckhnaxtc8z158j8u84ur
detktbkm8qwzs585j6txaf5ybc
gh3h1p9q9nar7zqkeqiokq4xto
inj1mu6wgfnrrmfnppgxmzwhec
jc96f1kb5wfsjmfsfdeurq7ohy
nobodcjahtjbsfp9hn8w3h1pqw
x4kemmi948sai6nbjsf1kxdzqe
yxusm7susgmwtaok9459dz314e
SDMF multiple versions:
1hze8idrkgsix455werhonqw8r
4mbw74wnrzg9971p7yqezamdxy
6653wrq68yyn1gsg79rntxpx6w
as3ojdp1uo9ue8m16at94pubsh
b4egmzyd59zumayjj7wdnbyuwr
b51ni3x4f54nzor1zuy6zj85dr
c7itaz3h8orc4ueq1zey6wr5cr
cgjcez4bjdm5ayk8onodgpgejc
czejfukk9mmy7fe7mm5fbmkuah
dk5seyn4rueaeq4owiihxnb6jr
fw677bowff36rwmkpnffpf5ewa
g9idg8u9r7frxyuq8tz567hana
heqeiqidwpuemhdnrgkdjqkbzr
odhn3paaycdidpatfmrowjbhhr
oo9npgn5xwfwp5uozwhqh3pzjh
smuqyechtpqe1bmrqi4xp7p18e
t1aj6bghrta157prg5pkrz9kpr
tmoi9qjq5oudpj517ktyizxfpr
uq3r7febbao7k8z1htghr373io
we59fyq31434zumrczccgzabne
xug7zwkw4cdy9q6rko4jaodrjr
y4sa84ux1n69uq6j8ef9b1wimh
If you're experiencing problems modifying a directory, you might want to see
if it's in this list (pass the URI:DIR2: cap to 'tahoe dump-cap' to extract
the right storage index).
cheers,
-Brian
-------------- next part --------------
A non-text attachment was scrubbed...
Name: testnet-anomalies.out
Type: application/octet-stream
Size: 76140 bytes
Desc: not available
Url : http://allmydata.org/pipermail/tahoe-dev/attachments/20080211/9e17d1aa/attachment-0001.obj
More information about the tahoe-dev
mailing list