MACHINE_CHECK_EXCEPTION: How to know which is the faulty component?

brutrak

Junior Member
Jun 2, 2004
7
0
0
Hi guys, I'm new here. Hope I can help later on, but now I have a problem.

Since some weeks ago I've been experiencing blue screens in my windows XP system, showing a MACHINE_CHECK_EXCEPTION error. This only happens when I'm playing my F1 favorite simulator.

I know this is probably a hardware problem, but how can I know which is the component I have to replace?

I've used memtest and the memory is ok. I suspect it can be a motherboard problem, because the computer didn't boot at the beginning of the problem, until I cleared CMOS and pulled out and in some cards and memory.

Is there any way I could know what is wrong, or at least know what's the most probable?

Many thanks in advance!

bru
 

jcwagers

Golden Member
Dec 25, 2000
1,150
14
81
Well, I'm going to mention this because you'll probably hear this a lot on this forum....especially when you ask for help. :D We actually need to know what the specs are on your system to help you troubleshoot the problem. Try to be as specific as possible. There are a TON of very knowledgeable people on here...but they can't help you if they don't know what you're running. :) Once you get your system specs up, I'm sure you'll get more hits! Good luck and keep us informed about what you try! :)

jc
 

ja83

Member
May 31, 2004
86
0
0
"A machine check exception occurs when Windows XP and your hardware platform cannot recover from some type of hardware error so that the system can continue to run successfully and reliably. More specific diagnosis of machine check exceptions is difficult and there is no general solution. Contact your hardware manufacturer or a computer hardware technician for help with troubleshooting this issue.

Machine check exceptions are frequently caused by one of the following conditions:

-Running the processor or mainboard beyond its specifications. For example, overclocking the processor or bus. Microsoft recommends that you run your hardware at the manufacturer-rated speeds.

-Noisy power, overstressed power strips, outmatched power supplies and failing power supplies can destabilize your computer. Make sure that you have a stable, reliable power supply to your computer.

-Extreme thermal conditions caused by the failure of cooling devices such as fans. Make sure your cooling devices are all working.

-Damaged memory, or memory that is not the correct type for your computer. If you recently changed the memory configuration, revert to the previous configuration to determine what is wrong. Make sure that you are using the correct memory for your computer."
 

brutrak

Junior Member
Jun 2, 2004
7
0
0
Excuse the lack of info. Here are my specs:

Motherboard Soltek SL-85ERV2
Pentium 4 2.4B 533FSB
Memory Kingston 512MBs DDR333
HD Seagate Barracuda 40GBs
DVD-Rewriter Nec 2500A
MSI Geforce 4200ti
Generic Power Supply 300W
Microsoft Sidewinder Wheel USB

Not overclocked at the moment, but overclocked it for some hours, a week before the problems started.

Tonight it's rebooting even worse, just listening to mp3. No need to play to experience the problems described.

Sometimes is a "MACHINE_CHECK:EXCEPTION", sometimes (less) is "IRQL_LESS_NOT_EQUAL", or something similar.

I've installed Motherboard Monitor, and temps go from 44C-59C.

Hope it helps, many thanks!!

bru
 

EeyoreX

Platinum Member
Oct 27, 2002
2,864
0
0
If you are getting random BSODs at various times, and throwing that IRQ error in, seems to point to memory problems. If possible I'd either try removing a stick of RAM (if you have two) and seeing if the error goes away. If not, try the other. If that still doesn't work see if you can get some other RAM to try (borrow or buy, I know, easier said than done). Memtest is a good place to start, but it is not perfect and can sometimes "pass" memory that in fact needed replacing.

That's where I'd start if I were you. Good luck!! Keep us updated if this works or not.

\Dan
 

Linux23

Lifer
Apr 9, 2000
11,370
741
126
Originally posted by: brutrak
Excuse the lack of info. Here are my specs:

Motherboard Soltek SL-85ERV2
Pentium 4 2.4B 533FSB
Memory Kingston 512MBs DDR333
HD Seagate Barracuda 40GBs
DVD-Rewriter Nec 2500A
MSI Geforce 4200ti
Generic Power Supply 300W
Microsoft Sidewinder Wheel USB

Not overclocked at the moment, but overclocked it for some hours, a week before the problems started.

Tonight it's rebooting even worse, just listening to mp3. No need to play to experience the problems described.

Sometimes is a "MACHINE_CHECK:EXCEPTION", sometimes (less) is "IRQL_LESS_NOT_EQUAL", or something similar.

I've installed Motherboard Monitor, and temps go from 44C-59C.

Hope it helps, many thanks!!

bru


all video/audio/system drivers up to date?
 

brutrak

Junior Member
Jun 2, 2004
7
0
0
All drivers are up to date:

- Nvidia 56.72 forceware (from nvidia site)
- Via Hyperion 4-in-1 4.51.
- XP SP1

I've removed one of the memory sticks, and it keeps failing. 90% "MACHINE_CHECK_EXCEPTION", 10% "IRQL_LESS_...".

Something that happened to me that could be useful:

The computer has been solid as a rock during a year and a half. Then, suddenly, started to crash, reboot or show BSOD when playing racing games. After a crash, it was nearly impossible to boot, didn't even show the BIOS startup message. No beeps, no memory check, only a black screen. Some memory errors detected with memtest, in both sticks.

To solve this state I had to remove the memory sticks and graphics card, or reset the CMOS. Then it started to boot again, and it ran well until the next crash. I reinstalled everything, without success.

Then, after removing memory sticks and cards and puting them on again, the system still crashes often, but then reboots without problems. No memory errors at all detected with memtest.

My first impression is a motherboard problem (specially because of the uncapacity to boot after an error), but now I don't know how to start.

- Power supply? It's a generic 300W, I probably need something better, but can this be the problem?
- Lack of ventilation? Max CPU temp is 57-59 celsius. Is that too much?
- Motherboard? Memory? CPU? Graphics card?
- Others?

It's a good chance to upgrade the whole system, but I wouldn't like to spend that much. How to start?

Thanks!!!
 

brutrak

Junior Member
Jun 2, 2004
7
0
0
Thanks for the info, I had already seen it.

For each one of the causes that Microsoft's URL describes:

1) System bus errors.
2) Memory errors that may include parity or Error Correction Code (ECC) problems.
3) Cache errors in the processor or hardware.
4) Translation Lookaside Buffers (TLB) errors in the processor.
5) Other CPU-vendor specific detected hardware problems.
6) Vendor-specific detected hardware problems.

Which component could be the cause?

1) Motherboard?
2) Memory?
3) Processor?
4) Processor?
5) Anything
6) Anything

I'm probably gonna change the motherboard+power supply+cpu ventilation first. If then errors persist but memtest is ok I will change the CPU. And finally memory.

And then probably I will be poor ;-)

Do you think is a logical path?

Thanks!
 

Delorian

Senior member
Mar 10, 2004
590
0
0
59C is pretty high, plus the temp given can be off by a few degrees. I'd double-check your HSF setup to make sure it is correctly installed, clear any dust out of the fans/case with air-duster and check your temps again. If they are still in the upper 50s then I would underclock your CPU a little or get a better HSF setup. I ordered the TT Volcano 12+ off of newegg for $35 and ordered Arctic Silver 5 from bestbyte for $5 (both shipped at that price) and they keep my 2500+ cool (low 40s to low 50s) at 200FSB instead of 166-stock. This may not be were your problem comes from, but I didn't like using my AMD fan setup as the temps it gave me at stock voltages/frequency were more than I liked. Now even at 200FSB my setup is a whole lot more stable. Let us know how your problem develops!
 

biostud

Lifer
Feb 27, 2003
19,705
6,780
136
do you use onboard audio? if not try moving your soundcard, or disable the onboard sound
 

VirtualLarry

No Lifer
Aug 25, 2001
56,570
10,205
126
This thread is interesting. I have an AMD Athlon TbredA XP1800 CPU, running at 166Mhz x 10.0 (XP2000+ speed), 1.525v, in an MSI KT4V-L mobo. CPU temps were approaching 52-53C under load, but after cleaning out the fan/heatsink and re-installing thermal grease, max CPU temps are now around 43C.

I have a multi-boot system with Win98se, W2K SP2, and XP SP1. When booting into XP, I often get a small on-screen dialog when booting, about a "Machine check exception: REGS", or something like that.

The system is otherwise completely stable, and indeed, this dialog is not accompanied by any sort of instability whatsoever. Prime95 stable, MemTest86 stable, etc., for 24hr+ periods. Haven't had any sort of BSOD in W2K/XP for months now, and that was only because I hotplugged some USB printers, and W2K crashed when trying to install/load them. (USB support in W2K, especially printing, and especially with only SP2, is kind of known to be flaky.)

It just sort of puzzles me that I get this error dialog. From reading the docs, it appears that my CPU only supports part of the MCA (machine-check architecture), and perhaps the error message is because of that incomplete support, and not because of any real error. I don't know, but I only boot XP for some games, so I basically have just been ignoring the message. Nothing else that I can see, is having any sort of issue whatsoever.

Has anyone else seen this behavior?