[tahoe-dev] [tahoe-lafs] #1003: add-lease may fail to mark a node if the path by which it is reachable changes during marking

tahoe-lafs trac at allmydata.org
Sat Mar 20 16:13:23 PDT 2010


#1003: add-lease may fail to mark a node if the path by which it is reachable
changes during marking
---------------------------------------------------+------------------------
 Reporter:  davidsarah                             |           Owner:  somebody 
     Type:  defect                                 |          Status:  new      
 Priority:  major                                  |       Milestone:  undecided
Component:  code                                   |         Version:  1.6.0    
 Keywords:  leases gc preservation usability docs  |   Launchpad_bug:           
---------------------------------------------------+------------------------
 Tahoe uses a relatively simplistic mark/sweep-style GC algorithm: {{{tahoe
 deep-check --add-lease}}} is used to perform a deep traversal from a given
 root node, and extend the leases of all nodes visited by the traversal.

 This is subject to race conditions if the path by which a node is
 reachable changes during marking. For example, suppose we have two always-
 reachable mutable directories, A and B, and we also have a subtree that is
 referenced only from C. When the marking phase begins, there is a link
 from A to C, but during marking, a {{{tahoe mv}}} or equivalent is used to
 move that link to be from B to C. If the traversal visits B before the
 move, and A after it, then it will fail to mark C even though it was
 reachable throughout.

 This is mitigated somewhat by the recommended marking schedule;
 source:src/docs/garbage-collection.txt#L26 says:
 {{{
 The current recommended values for a small Tahoe grid are to renew the
 leases once a week, and to give each lease a duration of 31 days.
 }}}
 This gives a node 4 chances to be marked. It could be argued that it is
 unlikely for the above race condition to happen 4 times for a given node
 -- but there is no guarantee of that, or even a way to bound the
 probability of it happening, other than by avoiding mutations during
 marking.

 There are distributed GC algorithms that would handle this correctly, but
 they are complicated. For the time being, we should document it as a known
 issue.

-- 
Ticket URL: <http://allmydata.org/trac/tahoe/ticket/1003>
tahoe-lafs <http://allmydata.org>
secure decentralized file storage grid


More information about the tahoe-dev mailing list