Summary0000026: hangs w/ one job that never starts
Descriptionpython -j 12 will finish 168 of the 169 tests, while the last one will never get started. Sometimes two tests will hang without ever getting started. It's likely a race condition in the integration test Queue class.

It's never the same test that's the one that fails to start, so this problem is not likely to be a problem with Rosetta.
Steps To ReproduceHappens more often when -j is high.
Command Line Usedpython -j24
Notes
AndrewLeaverFay (Administrator)
2012-06-26 16:41

The subprocess module in python 2.x deadlocks on calls to Popen. This is fixed in python3.

I'm adding a new integration test launch script,, for use in python3.
smlewis (Administrator)
2012-08-02 12:58

Sergey (?) stated at the dev meeting that this can be fixed in Python 2.? by using fork instead of subprocess, without the inconvenient need for Python 3.
Sergey (Administrator)
2012-09-07 19:49

Committed a path at r51043. Please use new command line option '--fork' when running and see if it helps. This commit also should improve cntr-c handling.
smlewis (Administrator)
2012-09-10 09:12

I tried testing post 51043, but couldn't get the 'bad behavior' to reproduce reliably (not that it ever did) - I was going to try to do a ten-runs-of-each comparison between, --fork, and, but the former only failed the first time (out of ten tries) and so I didn't bother with the latter. Maybe it was non-normal load conditions (Sunday afternoon, nobody on the itest machine...)
Sergey (Administrator)
2012-09-28 13:50

I rewrote script using Unix 'fork' command to span subprocess. This seems to fix the issue.

