- Jun 30, 2004
- 16,133
- 1,742
- 126
I've had a profile on the forums lately, as if the Duck-Meister had become Chicken Little. No -- not "The Sky is falling!" "Something's wrong with my stellar starship-flagship computer, and I can't figure out what it is!" Everyone has seen my recent threads. They're on PSUs, Motherboards, Memory and STorage. And as I say in all of those threads, "It all began with an accident of stupidity, charging a vaping-pen in my computer's USB port."
Now it turns out that the accident and necessary hardware replacement -- the USB controller on the motherboard had died from the accident -- there may have been a confusion of chronological coincidence. I think there was a feature upgrade for Windows 10 last year, and some may correct me. It just seemed to me once the initial necessary hardware swap was completed two or three months after the accident, I recall there may have been a feature update.
The random shut-downs and restarts were noticeable as a first occurrence around June, maybe a few weeks after the motherboard swap. Over the last year, there have been some 16 of them, with an average interval of 14.5 days. Hard to diagnose something like that! I swapped a spare UPS, thinking that it had been failing, but the old unit was still good. I swapped in a new PSU, but that didn't resolve the problem. I had known-good RAM of the same spec -- again -- NO CIGAR! Finally, I replaced the (new) open-box motherboard with the ASUS RMA replacement. No change; intervals of four to five days. So I have two, good spare motherboards, because last year, I'd purchased another ASUS board with the same chipset -- a more feature-rich workstation model. Finally, we swapped in a twin, known-good graphics card, and replaced a twisted, tortured PSU-to-PCIE cable that connected to the graphics card. No prize for that, either.
What was left? Two simple NVME PCIE interfaces with drives that hadn't missed a lick. One auxiliary SATA-III Marvell controller in an X2 slot -- to which my backup disk for Macrium is connected. Drivers replaced. Everything that could be done to save what is now considered an "old" computer, was done.
So I stumbled across quite a collection of forum posts -- at sites other than Anandtech's -- all dealing with "Windows 10: Random shutdowns and restarts". My system would either shut down, requiring removal of the AC power cable from the UPS so that I could boot it, or it would restart and return to my Windows log-in screen.
Take a look at this, from Microsoft Community.
There are many other forums and threads turning up the same problem around the internet, several with parallel prescriptions for solving the same problem.
There is not one, single piece of hardware in my system excluding the CPU which hasn't been replaced to locate a hardware problem -- Oh, the two NVME cards and the Marvell controller -- both working just fine. The CPU will go day and night through iterative cycles of Intel CPU Diagnostic Tool. "Pass, Pass, Pass . . . Pass Pass Pass, (etc.) [next iteration] Pass Pass Pass Pass . . . . " The CPU is a year old. The way Silicon Lottery took pains to de-lid and re-lid -- no -- nothing wrong with it. Otherwise? A CPU that throws a random error every two weeks? That's hard to believe.
MORAL OF THE STORY.
Some symptoms seem catastrophic. Maybe they seem that way after an accident that actually had a catastrophic effect on a motherboard quickly gone South. So we naturally assume "Hardware! Hardware! Hardware!"
I've been fiddling around with PCs since 1983. I haven't bought an OEM computer since 1994. As much extra money, time, trouble, sweat and tears as it takes to execute a deliberate plan to build a perfect system, it seems worth it to me.
So again -- the MORAL OF THE STORY. IT may SEEM like hardware failure, but do two things right away: Run an elevated command prompt to execute "sfc /scannow". OK! I had some minor HDD corruption, which Windows fixed. I ran this procedure two more times, with nothing to report in the CBS.log file. A-number one. Stellar.
In the 1990s, the Sleep and Hibernate features were already reliable if you took the time to make them work. What we have now is a set of power or sleep states that are moving toward a full dozen. Should I have problems with a seven-year-old processor and a system built five years ago with Windows 10? I think not. But -- likely -- I do. I'm about ready to prepare for an "Installation Upgrade", with good backups of the configuration causing the trouble, stellar Macrium and Windows server backups waiting in the wings in case it flounders. With a creation of my latest Macrium rescue disk (USB flash), I'm going to do this with great deliberation and care. But I've been exercising deliberation and care all along, and I treated this system with deliberation and care, but for the vaping pen thing. But that problem was solved in May, 2021. This? Likely something else, and something else that seems to be too common.
Windows . . .
The hits just keep on comin'.
Now it turns out that the accident and necessary hardware replacement -- the USB controller on the motherboard had died from the accident -- there may have been a confusion of chronological coincidence. I think there was a feature upgrade for Windows 10 last year, and some may correct me. It just seemed to me once the initial necessary hardware swap was completed two or three months after the accident, I recall there may have been a feature update.
The random shut-downs and restarts were noticeable as a first occurrence around June, maybe a few weeks after the motherboard swap. Over the last year, there have been some 16 of them, with an average interval of 14.5 days. Hard to diagnose something like that! I swapped a spare UPS, thinking that it had been failing, but the old unit was still good. I swapped in a new PSU, but that didn't resolve the problem. I had known-good RAM of the same spec -- again -- NO CIGAR! Finally, I replaced the (new) open-box motherboard with the ASUS RMA replacement. No change; intervals of four to five days. So I have two, good spare motherboards, because last year, I'd purchased another ASUS board with the same chipset -- a more feature-rich workstation model. Finally, we swapped in a twin, known-good graphics card, and replaced a twisted, tortured PSU-to-PCIE cable that connected to the graphics card. No prize for that, either.
What was left? Two simple NVME PCIE interfaces with drives that hadn't missed a lick. One auxiliary SATA-III Marvell controller in an X2 slot -- to which my backup disk for Macrium is connected. Drivers replaced. Everything that could be done to save what is now considered an "old" computer, was done.
So I stumbled across quite a collection of forum posts -- at sites other than Anandtech's -- all dealing with "Windows 10: Random shutdowns and restarts". My system would either shut down, requiring removal of the AC power cable from the UPS so that I could boot it, or it would restart and return to my Windows log-in screen.
Take a look at this, from Microsoft Community.
There are many other forums and threads turning up the same problem around the internet, several with parallel prescriptions for solving the same problem.
There is not one, single piece of hardware in my system excluding the CPU which hasn't been replaced to locate a hardware problem -- Oh, the two NVME cards and the Marvell controller -- both working just fine. The CPU will go day and night through iterative cycles of Intel CPU Diagnostic Tool. "Pass, Pass, Pass . . . Pass Pass Pass, (etc.) [next iteration] Pass Pass Pass Pass . . . . " The CPU is a year old. The way Silicon Lottery took pains to de-lid and re-lid -- no -- nothing wrong with it. Otherwise? A CPU that throws a random error every two weeks? That's hard to believe.
MORAL OF THE STORY.
Some symptoms seem catastrophic. Maybe they seem that way after an accident that actually had a catastrophic effect on a motherboard quickly gone South. So we naturally assume "Hardware! Hardware! Hardware!"
I've been fiddling around with PCs since 1983. I haven't bought an OEM computer since 1994. As much extra money, time, trouble, sweat and tears as it takes to execute a deliberate plan to build a perfect system, it seems worth it to me.
So again -- the MORAL OF THE STORY. IT may SEEM like hardware failure, but do two things right away: Run an elevated command prompt to execute "sfc /scannow". OK! I had some minor HDD corruption, which Windows fixed. I ran this procedure two more times, with nothing to report in the CBS.log file. A-number one. Stellar.
In the 1990s, the Sleep and Hibernate features were already reliable if you took the time to make them work. What we have now is a set of power or sleep states that are moving toward a full dozen. Should I have problems with a seven-year-old processor and a system built five years ago with Windows 10? I think not. But -- likely -- I do. I'm about ready to prepare for an "Installation Upgrade", with good backups of the configuration causing the trouble, stellar Macrium and Windows server backups waiting in the wings in case it flounders. With a creation of my latest Macrium rescue disk (USB flash), I'm going to do this with great deliberation and care. But I've been exercising deliberation and care all along, and I treated this system with deliberation and care, but for the vaping pen thing. But that problem was solved in May, 2021. This? Likely something else, and something else that seems to be too common.
Windows . . .
The hits just keep on comin'.