[tahoe-dev] [tahoe-lafs] #616: bug in repairer causes sporadic hangs in unit tests

tahoe-lafs trac at allmydata.org
Thu Feb 12 00:00:32 UTC 2009


#616: bug in repairer causes sporadic hangs in unit tests
---------------------------+------------------------------------------------
 Reporter:  zooko          |           Owner:       
     Type:  defect         |          Status:  new  
 Priority:  major          |       Milestone:  1.3.1
Component:  code-encoding  |         Version:  1.2.0
 Keywords:                 |   Launchpad_bug:       
---------------------------+------------------------------------------------
Changes (by warner):

  * milestone:  1.3.0 => 1.3.1


Comment:

 as mentioned in #611, we disabled the repair-from-corruption tests, and
 have only rarely seen lost-progress in the remaining repair-from-deletion
 test.

 Zooko fixed one bug in the repairer which would have caused lost-progress,
 but didn't see any other obvious ones.

 I've seen lost-progress in repair-from-deletion twice now (after zooko's
 fix), but it's pretty rare (and therefore hard to analyze). Since repair-
 from-deletion is supposed to be deterministic, the only entropy source
 remaining is the order in which download reads and upload writes are
 interleaved, which means it's going to be a long hard struggle to capture
 enough information for analysis.

 So we're going to push this one out to 1.3.1 . We'd like to have a perfect
 repairer in 1.3.0, but we also want to have a 1.3.0 soon, and a repairer
 which hangs once out of every thousand uses might be good enough for that.

-- 
Ticket URL: <http://allmydata.org/trac/tahoe/ticket/616#comment:1>
tahoe-lafs <http://allmydata.org>
secure decentralized file storage grid


More information about the tahoe-dev mailing list