[H-GEN] a process that won't die

Martin Pool mbp at linuxcare.com.au
Fri Oct 6 02:43:31 EDT 2000


On Fri, Oct 06, 2000 at 04:02:59PM +1000, Daniel Quinlan wrote:
> On Fri, Oct 06, 2000 at 11:09:22AM +1000, Martin Pool wrote:
> > If the machine is still up, you can track this down by working out
> > which semaphore is held.  Please run this command to show the wait
> > channel for all the processes:
> > 
> >   ps awwx -eo pid,tt,user,fname,tmout,f,wchan
> 
> 23021 ?        root     useradd      - 100 do_down

do_down means it was trying to lower a semaphore.  Unfortunately
I think there's no way to tell which one without running gdb on kcore.

> someone else suggested running lsof which yielded
> 
> host:~# lsof -p 23021 -lR
> COMMAND   PID PPID     USER   FD   TYPE DEVICE  SIZE   NODE NAME
> useradd 23021 6082        0  cwd    DIR    8,1  2048  34830 [0801]
> useradd 23021 6082        0  rtd    DIR    8,1  1024      2 [0801]
> useradd 23021 6082        0  txt    REG    8,1 50364 120947 [0801]
> useradd 23021 6082        0    0u   CHR    3,3        49840 [0801]
> useradd 23021 6082        0    1u   CHR    3,3        49840 [0801]
> useradd 23021 6082        0    2u   CHR    3,3        49840 [0801]
> useradd 23021 6082        0    3u   REG    3,2     0  57572 [0302]
> useradd 23021 6082        0    4w   REG    3,2  4608  61743 [0302]
> useradd 23021 6082        0    5r   REG    3,2   498  57571 [0302]
> useradd 23021 6082        0    6wW  REG    8,1     0  28728 [0801]
> useradd 23021 6082        0    7u   REG    8,1  1742  28958 [0801]
> useradd 23021 6082        0    8u   REG    8,1  1141  28940 [0801]
> useradd 23021 6082        0    9r   REG    8,1  1742  28958 [0801]
> useradd 23021 6082        0   10w   CHR    4,0        49902 [0801]

If you care enough, you can use

  find / -inum 28728 

to find out which files they are.  Probably 28728 is the .pwd.lock
file, and you've run into a bug in the kernel locking code.

  # ls -li /etc/.pwd.lock 

will help.

> and this might be of interest. (this being why I'm upgrading the kernel)
> host:/etc# uname -a
> Linux host 2.0.34 #2 Thu Jul 9 10:57:48 EST 1998 i586 unknown

So if there was a bug it was probably fixed three years ago.

-- 
Martin Pool, Linuxcare, Inc.
+61 2 6262 8990
mbp at linuxcare.com, http://www.linuxcare.com/
Linuxcare. Support for the revolution.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 232 bytes
Desc: not available
URL: <http://lists.humbug.org.au/pipermail/general/attachments/20001006/955b625b/attachment.sig>


More information about the General mailing list