Changes between Initial Version and Version 2 of Ticket #970


Ignore:
Timestamp:
2010-02-23T10:00:16Z (15 years ago)
Author:
warner
Comment:

wow, it sounds like you were actually able to provoke a real UCWE! Well, a real collision, at least.

So, did really none of the PUT commands result in an error? I would have expected at least one of them to emit a UCWE. Re-running the test and sending the output of each curl instance to a separate logfile would help answer this question. Also double-checking that curl emits errors to stdout when it gets a 500 or whatever HTTP error code UCWE maps to.

If you could, please do a file-check (with --verify) on the directory in question. With the dircap you show, the command would be "tahoe check --verify --raw $FW". I'm expecting to see a small number of shares of each version, for several different versions.

The file-check output will tell us, but what were the encoding parameters in use when you ran this test? I know from another ticket you were experimenting with parameters on the order of 40-of-50.. if the dirnodes (and other mutable files) were created with these same parameters, they'd be much more vulnerable to UCWE than with the normal 3-of-10 encoding. If that was a factor here, we might want to consider separate encoding-parameter configs for dirnodes (or perhaps for all mutable files), so that you can use safer 3-of-10 for them and more efficient 40-of-50 for immutable bulk data. (Note that protection from UCWE comes from small "k", whereas the usual reliability against server problems comes from having a large N-k).

(also, incidentally, when pasting large shell transcripts into a Trac page like this one, you should wrap the block with triple-curlies, so that Trac will not try to interpret the comments as WikiFormatting. And please attach other things as trac attachments instead of e.g. codepad links, because a few months from now when somebody comes back to look at this ticket, the pastebin will have expired and the contents lost)

Legend:

Unmodified
Added
Removed
Modified
  • Ticket #970

    • Property Keywords error usability added
    • Property Priority changed from critical to minor
    • Property Component changed from unknown to code-frontend-web
    • Property Summary changed from Tahoe Put can cause directory corruption to webapi PUT via multiple nodes can cause directory corruption but does not report UncoordinatedWriteError
  • Ticket #970 – Description

    initial v2  
    141461
    1515-------------------
     16{{{
    1617bash-3.2$ cat test.sh
    1718#!/bin/sh
     
    3334        fi 
    3435        done
    35 
     36}}}
    3637-------------------------------------------
    3738
    3839Error returned from curl
    39 
     40{{{
    4041UnrecoverableFileError: the directory (or mutable file) could not be retrieved, because there were insufficient good shares. This might indicate that no servers were connected, insufficient servers were connected, the URI was corrupt, or that shares have been lost due to server departure, hard drive failure, or disk corruption. You should perform a filecheck on this object to learn more.
    41 
     42}}}
    4243
    4344------------------------