Opened at 2024-11-15T15:14:31Z
Last modified at 2024-12-07T10:16:33Z
#4126 assigned defect
CI test_system fails inconsistently — at Version 6
Reported by: | hacklschorsch | Owned by: | hacklschorsch |
---|---|---|---|
Priority: | normal | Milestone: | undecided |
Component: | dev-infrastructure | Version: | n/a |
Keywords: | ci | Cc: | benoit@… |
Launchpad Bug: |
Description (last modified by btlogy)
CI reactors under test.test_system on CircleCI fail inconsistently ONLY in the Tahoe-lafs Circle CI org. Cannot reproduce locally on Nixos nor on GitHub? CI (inside similar docker images).
Possible root cause discussed in https://github.com/tahoe-lafs/tahoe-lafs/pull/1381#issuecomment-2476885548 meejah writes:
The unclean-reactor errors may be simply a downstream symptom of the real errors that also happen in that run (e.g. several tests time out).
My own tests suggest that indeed, raising the SystemTests? timeout make a couple of flaky tests much more stable:
Failure count | Test name |
1 | allmydata.test.test_system.HTTPSystemTest.test_mutable_mdmf |
3 | allmydata.test.test_system.HTTPSystemTest.test_mutable_sdmf |
30 | allmydata.test.test_system.HTTPSystemTest.test_upload_and_download_convergent |
11 | allmydata.test.test_system.HTTPSystemTest.test_upload_and_download_random_key |
This ticket is similar but not equal to ticket:4085, ticket:4022, ticket:2994 .
Change History (6)
comment:1 Changed at 2024-11-15T15:22:05Z by hacklschorsch
- Description modified (diff)
- Owner set to hacklschorsch
comment:2 Changed at 2024-11-15T15:22:44Z by hacklschorsch
- Summary changed from Tests time out on CircleCI to Tests time out on CircleCI, subsequent 'unclean reactor' errors
comment:3 Changed at 2024-11-15T15:23:06Z by hacklschorsch
- Status changed from new to assigned
comment:4 Changed at 2024-11-15T15:40:38Z by hacklschorsch
comment:5 Changed at 2024-11-15T15:41:19Z by hacklschorsch
Raising timeout in https://github.com/tahoe-lafs/tahoe-lafs/pull/1387
comment:6 Changed at 2024-12-04T13:45:03Z by btlogy
- Component changed from unknown to dev-infrastructure
- Description modified (diff)
- Keywords ci added
- Summary changed from Tests time out on CircleCI, subsequent 'unclean reactor' errors to CI test_system fails inconsistently
If I'm not mistaken, those reactor
A better fix to the dirty reactors might be to clean them up - and indeed, SystemTests does some reactor-cleanup dance in tearDown(), which *should* also be called if a test fails - but it seems that does not always happen or always help.
Also see the discussion in https://stackoverflow.com/questions/39883058/teardown-not-called-after-timeout-in-twisted-trial