Changes between Version 23 and Version 24 of FAQ


Ignore:
Timestamp:
2010-07-01T14:48:33Z (15 years ago)
Author:
terrell
Comment:

LAFSify

Legend:

Unmodified
Added
Removed
Modified
  • FAQ

    v23 v24  
    1313A: There isn't currently a way to disable or skip the encryption phase, but if you watch the status page on your local tahoe-lafs node for uploads, you'll see that the encryption time is orders (yes, plural) of magnitude smaller than the upload time, so there isn't much performance to be gained by skipping the encryption. We prefer 'secure by default', so without a compelling reason to allow insecure operation, our plan is to leave encryption turned on all the time.
    1414
    15 '''Q: Where should I look for current documentation about Tahoe's protocols?'''
     15'''Q: Where should I look for current documentation about the Tahoe-LAFS protocols?'''
    1616
    1717A: http://tahoe-lafs.org/source/tahoe/trunk/docs/architecture.txt
     
    4040'''Q: Would it make sense to just use RAID-0 and let Tahoe-LAFS deal with the redundancy?'''
    4141
    42 A: The Allmydata grid didn't bother with RAID at all: each Tahoe storage server node used a single spindle.
    43 The "RAID and/or Tahoe" question depends upon how much you trust RAID vs how much you trust Tahoe, and how expensive the different forms of
    44 repair would be. Tahoe can correctly be thought of as a form of "application-level RAID", with more flexibility than the usual RAID0/4/5
     42A: The Allmydata grid didn't bother with RAID at all: each Tahoe-LAFS storage server node used a single spindle.
     43The "RAID and/or Tahoe-LAFS" question depends upon how much you trust RAID vs how much you trust Tahoe-LAFS, and how expensive the different forms of
     44repair would be. Tahoe-LAFS can correctly be thought of as a form of "application-level RAID", with more flexibility than the usual RAID0/4/5
    4545styles (I think RAID-0 is equivalent to 1-of-2 encoding, and RAID-5 is like 2-of-3).
    4646
    4747Using RAID to achieve your redundancy gets you fairly fast repair, because it's all being handled by a controller that sits right on top of
    48 the raw drive. Tahoe's repair is a lot slower, because it is driven by a client that's examining one file at a time, and since there are a lot of
     48the raw drive. Tahoe-LAFS's repair is a lot slower, because it is driven by a client that's examining one file at a time, and since there are a lot of
    4949network roundtrips for each file. Doing a repair of a 1TB RAID-5 drive can easily be finished in a day. If that 1TB drive is filled with a
    50 million Tahoe files, the repair could take a month. On the other hand, many RAID configurations degrade significantly when a drive is lost, and
    51 Tahoe's read performance is nearly unaffected. So repair events may be infrequent enough to just let them happen quietly in the background and
     50million Tahoe-LAFS files, the repair could take a month. On the other hand, many RAID configurations degrade significantly when a drive is lost, and
     51Tahoe-LAFS's read performance is nearly unaffected. So repair events may be infrequent enough to just let them happen quietly in the background and
    5252not care much about how long they take.
    5353
     
    5959* drive/hardware costs[[BR]]
    6060
    61 it becomes a tradeoff between money (number of tahoe storage nodes, what sort of RAID [if any] you use for them, how many disks that means, how
     61it becomes a tradeoff between money (number of Tahoe-LAFS storage nodes, what sort of RAID [if any] you use for them, how many disks that means, how
    6262much those disks cost, how many computers you need to host them, how much bandwidth you spend doing upload/download/repair), bandwidth costs,
    6363read/write performance, and probability of file loss due to failures happening faster than repair.
    6464
    65 In addition, Tahoe's current repair code is not particularly clever: it doesn't put the new shares in exactly the right places, so you can
     65In addition, Tahoe-LAFS's current repair code is not particularly clever: it doesn't put the new shares in exactly the right places, so you can
    6666easily get shares doubled up and not distributed as evenly as if you'd done a single upload. This is being tracked in ticket #610.
    6767