Question Runtime Fault server HP Proliant dl380 G10

aliibrahim231

Junior Member
Dec 11, 2019
2
0
6
I have an HP Server dl380 G10, it was working fine for several months, but suddenly it started to shut down by itself at random times, with all the leds flashing for three times which means memory power fault as by the documentation, when starting over the server and running all diagnostics for the memory and HDD they all are passed, stress test is passed for 48 hours, but the server still crashes at random time, from the iLO 5 the reported error is under BIOS/HARDWARE HEALTH which is always green and OK unless when the server is crashed it gives the System Error:
Server Critical Fault (Service Information: Runtime Fault, Memory, Processor 2 Memory Channels 1-3 (04h))
I have only one Memory DDR4 size 16GB located on Processor 1 DIMM 8
I have only one processor (Processor 1)
The fault message says processor 2 channels 1-3 and those are all empty.
Searched a lot but no help,
I would appreciate any help or idea about this issue.
Thanks in advance.
 

aliibrahim231

Junior Member
Dec 11, 2019
2
0
6
Start with ensuring your memory is populated correctly, slot 8 seems correct, but I'm having trouble verifying that with HP's documentation

Next, firmware: https://support.hpe.com/hpesc/public/km/product/1010026819/hpe-proliant-dl380-gen10-server-models?ismnp=0&l5oid=1010026818#t=DriversandSoftware&sort=relevancy&layout=table&f:mad:kmswsoftwaretypekey=[swt8000194,swt8000193]&f:mad:kmswtargetproductbaseenvironmentlatest=[1010026819_A OS Independent]&f:mad:kmswtargetproductenvironmentlatest=[1010026819_A OS Independent]

The BIOS and iLO probably need updating. You need to have warranty or maintenance to qualify for download per their "entitlement required" message.
thanks for the reply, slot 8 is the correct as in the HP documentation for the DL380 Gen10 which states that if 1 DIMM to be populated then it should be the 8 slot,
for the BIOS and iLO i have updated them to the most current version,
i opened a case in HP but the closed it due to the server being in Syria so the won't support us
 

ch33zw1z

Lifer
Nov 4, 2004
39,033
19,718
146
Ok, then youre on your own, which means you'll have to have some spare compatible hardware to test with.

I would start with fully inspecting the internal areas of the server and verify there's no screws or metal out of place causing some kind of short on the board. Make sure all cables are in place and not damaged.

Next, ensure power to the server is clean and in spec. Flaky power can cause nuisance issues that come and go.

If all that checks out, move on to replacing the memory dimm

if the problem persists, try a different cpu.

If the problem still persists, it's likely a motherboard problem, and replacing the mother board may fix it. But exhaust all other previous steps first, while you look up replacement parts.