Project

General

Profile

Actions

Bug #1626

open

Timeout script dies with "Signal 1 (HUP) caught by ps"

Added by Pavel Shved about 13 years ago. Updated about 11 years ago.

Status:
Open
Priority:
Normal
Assignee:
Category:
Infrastructure
Start date:
08/09/2011
Due date:
% Done:

0%

Estimated time:
Detected in build:
svn
Platform:
Published in build:

Description

At Alexey's Git machine we observed a strange behavior of the ps command invoked by timeout script:

Signal 1 (HUP) caught by ps (procps version 3.2.8).
Please send bug reports to <feedback@lists.sf.net> or <albert@users.sf.net>
Fail: Bad close ps:  at /home/tester/ldv/opt/bin/../shared/sh/timeout line 224.

This was the staging/comedi/drivers/ni_pcidio.c driver on commit 02f8c6aee8df3cdc935e9bdd4f2d020306035dbe, rule 32_7. Timeout script has detached blast and failed, blast being left running forever.

Given that ps was executed approximately 25 million times during the run, any darkest corner has a good chance to be hit.

Alexey proposed to account for such potential failures, and keep running if the failure occurred just once.

Actions #1

Updated by Evgeny Novikov about 11 years ago

  • Status changed from New to Open
  • Assignee changed from Pavel Shved to Vitaly Mordan
  • Priority changed from Low to Normal

New Resource Manager should avoid such the situations.

Actions

Also available in: Atom PDF