Markfw
Moderator Emeritus, Elite Member
- May 16, 2002
- 25,555
- 14,511
- 136
Reboot the computer and it may fix it. I think I remember having this and that was the fix.Bringing this thread back to the top again because I have a new problem. An X99 2676v3 Mint box in particular that has two 1070's in it has a FAH problem where the WUs appear to get stuck at 99.99%. Trying to stop and restart the client does not work, and "kill -9" does not work either. This is what I get after trying to kill the client(s):
What I have learned is that the client has become a zombie process, possibly because of my overzealous use of kill -9, or some other reason. using "ps j" to determine the parent process of the zombies returns value "1." Since 1 is "init," and the processes stay this way indefinitely, it means I have to reboot at this point , since init can't be killed, and FAHClient can't be restarted while it's zombie-self "lives" on. A curious thing is why there are two entries, I will have to keep an eye on that to see if there are two when it's running normally.Code:c7x99@c7x99-C7X99-OCE-F:~$ ps -A | grep FAH 1820 ? 00:00:18 FAHClient <defunct> 1822 ? 00:02:26 FAHClient <defunct> 3574 ? 00:10:37 FAHControl
Edit:
When running "normally," there are two FAHClients. The parent of the first is init, the parent of the second is the first FAHClient:
Code:c7x99@c7x99-C7X99-OCE-F:~$ ps -A | grep FAH 1805 ? 00:00:00 FAHClient 1808 ? 00:00:01 FAHClient 1841 ? 00:00:00 FAHCoreWrapper 1865 ? 00:00:00 FAHCoreWrapper 2704 ? 00:00:03 FAHControl c7x99@c7x99-C7X99-OCE-F:~$ ps j 1805 PPID PID PGID SID TTY TPGID STAT UID TIME COMMAND 1 1805 1805 1805 ? -1 Ssl 124 0:00 /usr/bin/FAHClient /etc c7x99@c7x99-C7X99-OCE-F:~$ ps j 1808 PPID PID PGID SID TTY TPGID STAT UID TIME COMMAND 1805 1808 1805 1805 ? -1 Sl 124 0:01 /usr/bin/FAHClient --ch