• We’re currently investigating an issue related to the forum theme and styling that is impacting page layout and visual formatting. The problem has been identified, and we are actively working on a resolution. There is no impact to user data or functionality, this is strictly a front-end display issue. We’ll post an update once the fix has been deployed. Thanks for your patience while we get this sorted.

Why do CPU's suddenly stop working?

Page 2 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.
Had this happen to a friend once:

He was mounting the heatsink on his new Duron. He was trying to attach those annoying heatsink clamps and when he finally got it on, he heard a crack. He took off the heatsink and found his brand new Duron with a crack down the middle. Too Bad... 🙁
 
Having it working fine and then suddenly stop working is the typical failure mechanism for a CPU. All clocked semiconductor products (memory, CPU's, basically any chip whose sequence is directly controlled by a clock-like signal) should commonly fail like this when they fail - which is pretty rare. A process known as burn-in - which the 'real' burn-in performed by the manufacturer under carefully controlledd conditions (not to be confused with "home-brew" burn-in) - attempts to weed out parts that are likely to fail quickly, but under all processes a certain very small amount are likely to slip through this process. When they die, they tend not to die gradually (in fact, I can't really think of a way that it could die gradually) like a mechanical part. They either work, or they don't and when they die it should be suddenly and unexpectedly.

pkomma, you are wise to check your motherboard for damage too since CPU's that fail often take the power regulators with them with they go (less often the reverse is also true; the regulator dies taking the CPU with it). CPU's in the field tend to last a long time and generally it tends to be user error that is the culprit (not that I'm in any way saying this happened here... just that in my experience that was the case). Intel strives to keep it's failure rate extremely low, and I'm sure that AMD does likewise. I personally just think that this is probably just a case of bad luck.

Patrick Mahoney
Microprocessor Design
Intel Corp.
 
I had a STEP P3-550e, that bit the dust on me. I came home one morning, no boot, nothing. They sold the CPU's as 550@733. I was barely OCing the thing past 550, because of my video card (remember the old "BX days"). I had been using the P3 for about three months, when it kicked the bucket.
STEP's CPU's came with a lifetime warranty. I called them, they gave me an RMA and that was that. It took a little longer, since they had to "build" a replacement.

Sometimes CPU's just go on to that big RMA box in the sky 🙁

 
People with more experience will correct me if I'm wrong, but I've heard bad things about the cooling efficiency of the heat-sink fans that ship with the retail AMD processors. I recall that my Pop's 1100 mhz Athlon was having problems about six months ago, and the solution was to install a larger 400 watt power supply and a bigger hsf.

If you decide to reinstall a replacement processor to run this rig, consider buying a better heat sink.
 
...you are wise to check your motherboard for damage too since CPU's that fail often take the power regulators with them with they go (less often the reverse is also true; the regulator dies taking the CPU with it).


Could it be that a previous CPU failure, or damage to the voltage regulation circuits of the motherboard is now causing it (the board) to kill CPU's? As someone who overclocks a lot, I usually have a program (of some kind) running in Windows to monitor the CPU temp. With Athlons, the first thing I would suspect is a temperature problem but without knowing the temps this failure may never be resolved...
 


<< Having it working fine and then suddenly stop working is the typical failure mechanism for a CPU. >>



I agree with this 100%.

However, (you knew it was coming! 😉 ) I had one odd egg in the basket. I built a budget system for someone back in 1997 (Cyrix 166+, AMPTRON 8900, 16 MB EDO, etc.) and it worked perfectly for nearly 3 years until one day it started getting GPF's fatal exceptions, etc. I thought it was typical OS rot (Windows 95B) so I looked at the system and tested the ram which tested fine. The cpu was running ok temp wise, but I decided to put it in another system I had sitting on the bench (Shuttle HOT 565 with 233 MMX Intel) and sure enough that (Cyrix) CPU was causing that system to crash left and right! I dropped a P 200 MMX in the other system (messing with all those berg jumpers sucks nowadays!) and the system was running flawlessly. I took out the two 8MB SIMMS and put a 64MB DIMM in there and the person was very happy.

Anyways, that's the first CPU that I've seen that failed but not 100%. (as in either it works fine or it is dead) I thought that was rather strange.

Cheers!
 
Sounds like Pkomma covered all his bases. Like PM said this could be indeed bad luck. Maybe try a different frequency bin. Your chip is retail and therefore covered by warranty.
 
Back
Top