Recovering from prtdiag hanging on CMT boxes.
October 4, 2011
Posted by on
Occasionally I will run into a situation on my RAC clusters (and systems with Sun Connect) where I will see and hundreds of prtdiag processes hanging in the process listing. This will cause the system load to just explode.
I have seen this occur on T2000’s and T5120’s.
You could run pkill to clean them up, but that won’t fix the problem.
The problem is that PICL is hanging and this is fixed by restarting it with:
svcadm restart picl
After restarting picl, all the prtdiag processes should complete.