#1710 closed enhancement (fixed)
Magic Folder: implement "Water Dragons" section of design doc
Reported by: | davidsarah | Owned by: | daira |
---|---|---|---|
Priority: | normal | Milestone: | 1.12.0 |
Component: | code-frontend-magic-folder | Version: | 1.9.1 |
Keywords: | unlink space-efficiency gc otf-magic-folder-objective4 | Cc: | |
Launchpad Bug: |
Description (last modified by warner)
The prototype implementation of drop-upload in #1429 adds new or changed files in the local directory to the upload directory, but it does not unlink files in the upload directory when they are deleted locally.
Note that:
- some users might not want files to be immediately unlinked from the upload directory, in case the local deletion was accidental.
- the space for unlinked files would not be reclaimed unless and until garbage collection is performed. (This applies already to old versions of changed files.)
Change History (51)
comment:1 Changed at 2012-04-06T04:17:13Z by davidsarah
comment:2 Changed at 2014-12-02T19:49:06Z by warner
- Component changed from code-frontend to code-frontend-drop-upload
- Description modified (diff)
comment:3 Changed at 2015-06-01T16:11:09Z by daira
- Keywords magic-folder added
Add magic-folder keyword to all drop-upload tickets.
comment:4 Changed at 2015-06-01T16:21:02Z by daira
- Keywords otf-magic-folder-objective-4 added; drop-upload magic-folder removed
- Milestone changed from undecided to 1.11.0
- Owner set to dawuud
- Summary changed from drop-upload: optionally unlink locally deleted files from upload directory to Magic Folder: implement local deletion of files as described in "Air Dragons" section of design doc
The intended behaviour for Magic Folder is described at docs/proposed/magic-folder/remote-to-local-sync.rst#deletion-of-a-file.
comment:5 Changed at 2015-06-01T16:29:39Z by daira
- Summary changed from Magic Folder: implement local deletion of files as described in "Air Dragons" section of design doc to Magic Folder: implement local deletion of files/directories as described in "Air Dragons" section of design doc
comment:6 Changed at 2015-06-01T16:30:12Z by daira
- Summary changed from Magic Folder: implement local deletion of files/directories as described in "Air Dragons" section of design doc to Magic Folder: implement local deletion of files/directories as described in "Water Dragons" section of design doc
comment:7 Changed at 2015-06-01T16:39:32Z by daira
- Keywords otf-magic-folder-objective4 added; otf-magic-folder-objective-4 removed
comment:8 Changed at 2015-06-10T02:04:15Z by dawuud
I wrote some rough draft code that handles the local file deletion case:
this temporary dev branch: https://github.com/david415/tahoe-lafs/tree/scan-parent
this commit: https://github.com/david415/tahoe-lafs/commit/a256ddb50d384aa18ac241dcdb9d5757725ad9d2
but we should finish our rebase and cherry-pick this commit onto the new dev branch when it is available...
comment:9 Changed at 2015-06-10T21:49:32Z by dawuud
i put those changes here: https://github.com/david415/tahoe-lafs/tree/2438.otf-objective-4.2-local-file-deletion.1 but we first need to get the tests to pass before working more on this...
comment:10 Changed at 2015-06-11T22:03:00Z by dawuud
my latest using an added api call in backupdb.py specifically for checking if a given filename has an entry in the db:
https://github.com/david415/tahoe-lafs/commit/594de028e81ce8afdc536fca4844b4c14fe9f6d0
comment:11 Changed at 2015-06-24T23:53:07Z by dawuud
in preparation for actually completing this task i've merged in the magic-folder CLI tools dev branch... and thus have been able to prepare tahoe magic-folder test grids using the CLI. Here we've written rough draft versions of test_persistence and test_move_tree that use the magic-folder CLI.
https://github.com/david415/tahoe-lafs/tree/1710.local-deletion-water-dragon.1 https://github.com/david415/tahoe-lafs/commit/0054798c9095466eea8446b66146156c9f2f4dc0 https://github.com/david415/tahoe-lafs/commit/ae9c492b79472fe71a30ee15dece87e29d18b7c1
we've introduced code duplication... next we should make sure all the magic-folder tests share this code instead of duplicating it.
we could benefit from a review but we are not currently blocked
comment:12 Changed at 2015-06-25T21:35:59Z by dawuud
cleaned up duplicate code here... https://github.com/david415/tahoe-lafs/commit/6775c38f3eb40c494d7a06e5fa9dfee97f286507
comment:13 Changed at 2015-06-27T03:27:08Z by dawuud
i'm building up Alice-Bob tests here so we can have multiple writers to a magic-folder in a test:
https://github.com/david415/tahoe-lafs/tree/1710.local-deletion-water-dragon.1.test-alice-and-bob.1
comment:14 Changed at 2015-06-29T21:03:49Z by dawuud
I had to change the magic-folder cli unit test api when i added the Alice + Bob test. in this commit i fixed the non-Alice+Bob magic-folder CLI tests: https://github.com/david415/tahoe-lafs/commit/e2ce372cbf02dc2ad56a87b505e157d576b7101b
comment:15 Changed at 2015-07-02T04:24:47Z by dawuud
I've added a naive downloader and remote scan mechanism... The basic "sharing" between Alice and Bob is working in so far as the unit test passes now.
this dev branch: https://github.com/david415/tahoe-lafs/tree/1710.local-deletion-water-dragon.1.test-alice-and-bob.1
this commit: https://github.com/david415/tahoe-lafs/commit/d214c7c2143e75a67a23b47d80220ff4f9837fbd
this test: ./bin/tahoe debug trial allmydata.test.test_magic_folder.RealTest?.test_alice_bob
comment:16 Changed at 2015-07-02T18:55:35Z by dawuud
just now added "version" to the magic folder db schema... creating a version 3 of tahoe backupdb schema... which is unutilized by the existing tahoe backup command but is used by only magic folders thus far.
here we also roughly schetch out the addition and deletion behavior with the local version number being incremented by file modifications and deletions...
https://github.com/david415/tahoe-lafs/commit/209c0285be35aaa587cd71aee6bcc1bac4d3e452
comment:17 Changed at 2015-07-03T01:23:13Z by dawuud
added deletion/version propagation... and a basic test for that however i am having trouble getting the test_persistence test to pass... here's my latest:
https://github.com/david415/tahoe-lafs/tree/1710.local-deletion-water-dragon.1.test-alice-and-bob.1 https://github.com/david415/tahoe-lafs/commit/ccdba44e8a51787ca39a9db41b975e9290727831
comment:18 Changed at 2015-07-06T20:50:21Z by dawuud
Alice and Bob mock unit test fixed in my dev branch: https://github.com/david415/tahoe-lafs/tree/1710.local-deletion-water-dragon.1.test-alice-and-bob.1
comment:19 Changed at 2015-07-20T23:54:44Z by daira
comment:20 Changed at 2015-07-31T23:23:39Z by dawuud
slight progress with breaking work in progress commits on my dev branch here: https://github.com/david415/tahoe-lafs/tree/2477.refactor-all-the-things.0
here the alice + bob test checks for proper version numbers... and we can identitify that the current difficulty happens right after alice deletes the file... her magic-folder db entry for that file should be updated with the new version number but it is not... and so the test fails because version number mismatch.
comment:21 Changed at 2015-08-20T11:11:46Z by dawuud
more progress in my dev branch... i got rid of the live-lock... however the test exits with a dirty reactor.
comment:22 Changed at 2015-08-20T16:10:25Z by daira
Merged David's changes and mine (to use deferredutil.HookMixin) into https://github.com/tahoe-lafs/tahoe-lafs/tree/2477.refactor-all-the-things.1.
comment:23 Changed at 2015-08-24T11:29:54Z by dawuud
found and fixed two more dropped deferreds but the alice bob unit test still errs: https://github.com/david415/tahoe-lafs/tree/2477.refactor-all-the-things.1
comment:24 Changed at 2015-08-25T20:16:01Z by daira
After our Tuesday pairing, tests now pass on that branch.
comment:25 Changed at 2015-08-28T12:47:03Z by dawuud
I do not believe we've implemented this Water Dragon feature: """When a Magic Folder client restarts, we can detect files that had been downloaded but were deleted while it was not running, because their paths will have last-downloaded records in the magic folder db without any corresponding local file."""
comment:26 Changed at 2015-08-31T12:19:33Z by daira
Yes, I believe that's correct (that it is not implemented). As well as scanning for existing files/directories at startup, we should simulate change events for paths in the magic folder db at which no file/directory exists.
We can do that by obtaining the set of all entries in the local_files table before doing the scan, removing paths from that set as we process them, and then processing all those remaining in the set.
comment:27 Changed at 2015-08-31T21:41:01Z by dawuud
i added a get_all_files as a magic-folder db util function... but this test_persistence unit test seems broken as the startup code path does not run twice like it should... https://github.com/david415/tahoe-lafs/tree/2489.write_downloaded_file.1
comment:28 Changed at 2015-09-02T18:15:43Z by dawuud
I'm trying to fix the test_alice_bob test... and I've made some progress here... with this simple commit that teaches the uploader to ignore temp suffix files we create: ".tmp" and ".backup" :
https://github.com/david415/tahoe-lafs/tree/1710.ignore-temp-files.0
comment:29 Changed at 2015-09-16T14:18:24Z by daira
- Summary changed from Magic Folder: implement local deletion of files/directories as described in "Water Dragons" section of design doc to Magic Folder: implement "Water Dragons" section of design doc
comment:30 Changed at 2015-09-21T08:52:46Z by dawuud
Daira pointed out yet another bug due to our Uploader / Downloader code refactor; i made a fix commit here: https://github.com/david415/tahoe-lafs/tree/2438.magic-folder-stable.1
""" Add RemoteScanMixin to share our _get_collective_latest_file method
Both the Uploader and Downloader need to know the latest collective remote file... """
https://github.com/david415/tahoe-lafs/commit/6e8728ca0fd6a0186648bb64e187e999255f7bf3
comment:31 Changed at 2015-10-02T21:15:06Z by daira
Replying to dawuud:
Daira pointed out yet another bug due to our Uploader / Downloader code refactor; i made a fix commit here: https://github.com/david415/tahoe-lafs/commit/6e8728ca0fd6a0186648bb64e187e999255f7bf3
The https://github.com/tahoe-lafs/tahoe-lafs/commits/2438.magic-folder-stable.3 branch fixes this in a different way; the 2438.magic-folder-stable.1 branches are now obsolete.
comment:32 Changed at 2015-10-05T12:02:57Z by dawuud
- Owner changed from dawuud to daira
I think we can close this ticket now that all of the deletion propagation features have been implemented.
Can we close it?
comment:33 Changed at 2015-10-06T00:11:07Z by daira
We're close, but I need to review whether all of the Water Dragons features have been implemented.
comment:34 Changed at 2015-10-15T10:58:19Z by dawuud
I think we did not finish implementing deletion i did part of it; and meejah mentioned he added that feature. Please post the dev branch location so we can rebase it into our new dev/stable branches.
comment:35 Changed at 2015-10-15T10:58:51Z by dawuud
- Owner changed from daira to meejah
comment:36 Changed at 2015-10-15T13:08:33Z by daira
https://github.com/tahoe-lafs/tahoe-lafs/commits/2438.magic-folder-stable.4 is the current stable branch. https://github.com/tahoe-lafs/tahoe-lafs/commits/2521.magic-db-schema.1 is the current, broken, development branch.
comment:37 Changed at 2015-10-16T18:10:26Z by meejah
There's also https://github.com/meejah/tahoe-lafs/commits/1710.actually-unlink-files.0 which I will do a version of rebased onto magic-db-schema.1 (I guess?)
comment:38 Changed at 2015-10-19T13:39:10Z by daira
- Owner changed from meejah to daira
- Status changed from new to assigned
I'm reviewing that branch now.
comment:39 Changed at 2015-10-19T17:31:34Z by daira
My current work-in-progress is at 2521.magic-db-schema.1.wip .
comment:40 Changed at 2015-10-20T15:18:54Z by meejah
I have a https://github.com/meejah/tahoe-lafs/commits/1710.actually-unlink-files.1 branch now also, and another unit-test and further smoke test logic.
There's still a case which doesn't work:
If alice creates a file, then bob deletes it, and then alice re-creates it bob never sees it. (It *does* however, work, if bob is the one to re-create it). I'm still investigating this.
comment:41 Changed at 2015-10-20T15:19:26Z by meejah
I mean: I have more things to push to the .1 branch, including some as-yet-unwritten code to fix the issue I mention above :)
comment:42 Changed at 2015-10-24T22:21:53Z by daira
daira: dawuud meejah: the deletion bug is an omission in the design doc. in the case of existent files, the event when the file is replaced by the downloader will be ignored because the entry for that path in the magic folder db has the same size/mtime/ctime as the new file daira: the same *should* be the case for a deleted file. so the (size, mtime, ctime) in the db needs to be the same as returned by fileutil.get_pathinfo. the latter is (None, now, now) in the current implementation, which I think is probably wrong... daira: because the current time at the point of the get_pathinfo call won't/can't match the time in the database daira: the right fix is probably to change get_pathinfo to return PathInfo(isdir=False, isfile=False, islink=False, exists=False, size=None, mtime=None, ctime=None) for the ENOENT case daira: and make sure that's also what is written to the db when the downloader detects a deleted file
comment:43 Changed at 2015-10-24T22:22:21Z by daira
(None is converted to an SQL NULL.)
comment:44 Changed at 2015-10-24T22:25:21Z by daira
We also need to think about what happens when a file is replaced by a directory or vice versa. Those are treated as different paths (the directory path ends in /), so it's a bit different from the deleted file case.
comment:45 follow-up: ↓ 46 Changed at 2015-10-24T22:37:14Z by meejah
the case I have a fix for is: on say "bob's" side if "alice" deletes: the downloader downloads a "delete", so moves the file to "whatever.tmp" but immediately bob's uploader gets an inotify for the moved file, and when that gets processed it uploads another version (with delete=True) -- but so then if "alice" now replaces the file, bob doesn't download it because the versions match.
comment:46 in reply to: ↑ 45 Changed at 2015-10-24T23:04:46Z by daira
Replying to meejah:
the case I have a fix for is: on say "bob's" side if "alice" deletes: the downloader downloads a "delete", so moves the file to "whatever.tmp"
Do you mean "whatever.backup"? Nothing should ever be moved to "whatever.tmp".
In any case, the IN_MOVED_FROM event for "whatever" should be ignored as described in comment:42. (The IN_MOVED_TO event should also be ignored because it's for an ignorable filename pattern.)
but immediately bob's uploader gets an inotify for the moved file, and when that gets processed it uploads another version (with delete=True) -- but so then if "alice" now replaces the file, bob doesn't download it because the versions match.
Yes, this sounds like the same case that I was talking about. It isn't a problem that Bob's uploader gets an inotify event, but that event should be ignored because Bob's db should already have been updated with metadata showing that the file has been deleted.
comment:47 follow-up: ↓ 48 Changed at 2015-10-24T23:23:34Z by daira
I will update the design doc tomorrow to clarify what should happen.
comment:48 in reply to: ↑ 47 Changed at 2015-10-25T13:23:05Z by daira
Replying to daira:
I will update the design doc tomorrow to clarify what should happen.
See https://github.com/tahoe-lafs/tahoe-lafs/commit/5d2365f6c4cc388c2d74e43b2c1b7e8fc4c4ab86 (the previous three commits to the design doc are also worth reading).
comment:49 follow-up: ↓ 50 Changed at 2015-10-25T13:33:33Z by daira
So, I believe the fix is just to add this before the last line of MagicFolderDB.is_new_file:
if not pathinfo.exists and row[0] is None: return False
comment:50 in reply to: ↑ 49 Changed at 2015-12-08T15:13:56Z by daira
- Resolution set to fixed
- Status changed from assigned to closed
Replying to daira:
So, I believe the fix is just to add this before the last line of MagicFolderDB.is_new_file:
if not pathinfo.exists and row[0] is None: return False
This fix is on the 2438.magic-folder-stable.5 branch, and there are tests for deletion. So I believe this is fixed, at least until we have a new design that handles the >2-client case.
comment:51 Changed at 2016-03-22T05:02:52Z by warner
- Milestone changed from 1.11.0 to 1.12.0
Milestone renamed
On 06/04/12 03:20, erpo4 wrote on the vg2 list:
It would indeed default to off, since we don't want the behaviour to change surprisingly from the current behaviour on a Tahoe upgrade.