Question Repeated restarts (8x in ~5 hours)

Steelbom

Senior member
Sep 1, 2009
455
22
81
Hi,

I'm having issues with my pc restarting abruptly. There is no BSoD or error otherwise. It just restarts.

I'm trying to diagnose what the issue is. I checked the event logs but did not see anything under system that stood out to me.

My first suspicion is memory. (I can't run my 2x32GB 5600MHz kit at 5600MHz -- only at 4800MHz.)

Could anyone point me in the right direction on where to start?

Specs:
Windows 11 Pro 10.0.22621 Build 22621
AMD 7950X
Gigabyte X670E Aorus Extreme
2x32gb 5600MHz
Gigabyte 3090 Ti
ASUS 3050 Aero
2x Ark Odyssey (one into the 3090 Ti via HDMI, one into the 3050 via HDMI)
1x Samsung 49" G9 Neo (DP into the 3090 Ti)

I've got some Windows updates to do but have been unable to run long enough to do a full backup of my OS drive. Working on that...

Cheers,
SB
 

mikeymikec

Lifer
May 19, 2011
20,996
16,241
136
Have you confirmed that there is no BSOD by disabling auto restart on system failure? Sometimes it gives so little time on that screen that some monitors don't react in time.

I pulled some instructions quickly off another site:

  • Launch Settings.
  • Go to System > About.
  • From Related links, select Advanced system settings.
  • In the System Properties window, open the Advanced tab and click Settings below Startup and recovery.
  • Below System failure, uncheck Automatically restart.

Another method is to download NirSoft's BlueScreenView (free utility), which will list any BSODs that Windows has a record of (which isn't necessarily a definitive list, but it's better than nothing).

Control Panel > Security & Maintenance > Maintenance > View maintenance history can be another friendlier way of analysing the logs for weird stuff going on.

If you want to test memory then memtest86 is a good test.
 

Steelbom

Senior member
Sep 1, 2009
455
22
81
Have you confirmed that there is no BSOD by disabling auto restart on system failure? Sometimes it gives so little time on that screen that some monitors don't react in time.

I pulled some instructions quickly off another site:

  • Launch Settings.
  • Go to System > About.
  • From Related links, select Advanced system settings.
  • In the System Properties window, open the Advanced tab and click Settings below Startup and recovery.
  • Below System failure, uncheck Automatically restart.

Another method is to download NirSoft's BlueScreenView (free utility), which will list any BSODs that Windows has a record of (which isn't necessarily a definitive list, but it's better than nothing).

Control Panel > Security & Maintenance > Maintenance > View maintenance history can be another friendlier way of analysing the logs for weird stuff going on.

If you want to test memory then memtest86 is a good test.
Thanks for the help!

I have disabled it now and I will test it again. I'm pretty sure it isn't showing a bsod but I guess I'll find out soon.

I will give memtest86 a try soon.

EDIT: No luck with the auto restart turned off. Screen still goes black and reboots almost immediately.

I'm using WSL (Ubuntu 22.04). I also use Windows Terminal to connect to a lot of remote servers. I think the issue is related to one or the other or both. Going to try some updates!
 
Last edited:

mikeymikec

Lifer
May 19, 2011
20,996
16,241
136
I don't know much about WSL I'm afraid, I've heard about it and that's about it.

Does the system spontaneously reboot if you simply start Windows and leave it to idle?
 

Steelbom

Senior member
Sep 1, 2009
455
22
81
I don't know much about WSL I'm afraid, I've heard about it and that's about it.

Does the system spontaneously reboot if you simply start Windows and leave it to idle?
It does not. In fact, I was about to back up the system over 2 hours as long as I didn't open the Windows Terminal (may or may not be related) and definitely not the WSL instance(s).
Which memory kit is it? If it's Corsair, bad luck! Gigabyte that I know prefers Kingston or maybe G.Skill or Crucial.
Corsair indeed!
It's just a VM or maybe a cross between a VM and a container. OP can check if it's the issue by uninstalling WSL for a while and see if the spontaneous reboot still happens.
It definitely only happens when I'm in one of my WSL instances.


---
I have done the following:
- Updated graphics drivers to latest version through Geforce Experience (2 versions behind I was)
- Ran Windows updates (which included something for WSL)
- Updated BIOS from F6c to F6h which included a microcode update to AMD AGESA 10.0.04 and "improved memory compatibility".

Still testing at the moment. I'm hoping it's fixed. I have installed memtest pro to a USB and will do a test and see how the memory holds up.
 

Steelbom

Senior member
Sep 1, 2009
455
22
81
Could be some sort of memory violation? Try turning on Core Memory Isolation in Windows Settings.
I thought this might've been it as I had quite a long run today without a reboot. Unfortunately, it happened again.

I see a "Hardware error" entry in the Summary. Going to try a memtest. The most recent event is the bottom one at 4:19 PM.

In Event Viewer, I do see a warning just before the reboot happened:
Log Name: System
Source: Microsoft-Windows-DistributedCOM
Date: 2/02/2023 4:16:14 PM
Event ID: 10016
Task Category: None
Level: Warning
Keywords: Classic
User: Redacted
Computer: Redacted
Description:
The application-specific permission settings do not grant Local Activation permission for the COM Server application with CLSID
{2593F8B9-4EAF-457C-B68A-50F6B8EA6B54}
and APPID
{15C20B67-12E7-4BB6-92BB-7AFF07997402}
to the user Redacted SID (S-1-5-21-1899788623-3753003367-2967663886-1001) from address LocalHost (Using LRPC) running in the application container Unavailable SID (Unavailable). This security permission can be modified using the Component Services administrative tool.
Event Xml:
<Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event">
<System>
<Provider Name="Microsoft-Windows-DistributedCOM" Guid="{1B562E86-B7AA-4131-BADC-B6F3A001407E}" EventSourceName="DCOM" />
<EventID Qualifiers="0">10016</EventID>
<Version>0</Version>
<Level>3</Level>
<Task>0</Task>
<Opcode>0</Opcode>
<Keywords>0x8080000000000000</Keywords>
<TimeCreated SystemTime="2023-02-02T06:16:14.1806533Z" />
<EventRecordID>16768</EventRecordID>
<Correlation ActivityID="{4ddaace3-3688-000f-7d1c-364e8836d901}" />
<Execution ProcessID="2184" ThreadID="32968" />
<Channel>System</Channel>
<Computer>Redacted</Computer>
<Security UserID="S-1-5-21-1899788623-3753003367-2967663886-1001" />
</System>
<EventData>
<Data Name="param1">application-specific</Data>
<Data Name="param2">Local</Data>
<Data Name="param3">Activation</Data>
<Data Name="param4">{2593F8B9-4EAF-457C-B68A-50F6B8EA6B54}</Data>
<Data Name="param5">{15C20B67-12E7-4BB6-92BB-7AFF07997402}</Data>
<Data Name="param6">Redacted</Data>
<Data Name="param7">Redacted</Data>
<Data Name="param8">S-1-5-21-1899788623-3753003367-2967663886-1001</Data>
<Data Name="param9">LocalHost (Using LRPC)</Data>
<Data Name="param10">Unavailable</Data>
<Data Name="param11">Unavailable</Data>
</EventData>
</Event>
 

Attachments

  • win error.jpg
    win error.jpg
    317.9 KB · Views: 10
Jul 27, 2020
28,173
19,198
146
Damn, that's frustrating. I see that you have two GPUs. Could you remove the 3090 Ti temporarily and just work with the 3050 for a while to see if that has any effect? If the system still reboots and memtest also passes, I would try getting a cheap possibly 2x8GB DDR5 kit of a different brand to completely rule out the memory.

I'm very biased against Corsair coz their DDR3 Vengeance 4x8GB kit gave me days of headaches on my Gigabyte Z97 mobo and only way I could solve it was to replace the kit with a Kingston one. By the way, the Corsair kit worked flawlessly in a Dell Optiplex office PC. Just didn't want to play with my Gigabyte mobo.
 

mikeymikec

Lifer
May 19, 2011
20,996
16,241
136
Ignore the DCOM messages, you get those on a completely normal system.

'Hardware error' I think (from maintenance history) comes with a code that one can look up on the Internet for a few helpful pointers.
 

Steelbom

Senior member
Sep 1, 2009
455
22
81
Thanks to both!

Damn, that's frustrating. I see that you have two GPUs. Could you remove the 3090 Ti temporarily and just work with the 3050 for a while to see if that has any effect? If the system still reboots and memtest also passes, I would try getting a cheap possibly 2x8GB DDR5 kit of a different brand to completely rule out the memory.

I'm very biased against Corsair coz their DDR3 Vengeance 4x8GB kit gave me days of headaches on my Gigabyte Z97 mobo and only way I could solve it was to replace the kit with a Kingston one. By the way, the Corsair kit worked flawlessly in a Dell Optiplex office PC. Just didn't want to play with my Gigabyte mobo.
Yeah, I might have to. I'm going to try to avoid doing that until last. It'll be such a headache. I'm tempted to just buy a brand new memory kit and see how it goes. Is Kingston the most suitable for Gigabyte?
Also, what kind of USB devices do you have plugged into the system that are never removed?
Err, quite a few. About 10 or so USBs plugged in. USB Dac, mic, keyboards, etc. They haven't caused issues in the past but I'll keep this in mind if all else fails.
Ignore the DCOM messages, you get those on a completely normal system.

'Hardware error' I think (from maintenance history) comes with a code that one can look up on the Internet for a few helpful pointers.
Ohh, you're right. Interesting -- it said error code 193 can be an outdated Intel driver, memory, gpu or disk issues. I'm running a hardware diagnostic (msdt).

The results are it says there are 2 errors:
- PCI Device doesn't have a driver (not sure what this would be)
- Unknown USB Device (Device Descriptior Request Failed) has error

I have removed the Intel driver and disabled the device (I don't use WiFi), although it did say it was the latest version.

It was also recommended to run the following commands to check for disk health:

dism /online /cleanup-image /restorehealth
sfc /scannow
chkdsk /f

Thoughts?

EDIT: I just went to Startup and Recovery to set the dump to "Full dump" instead of "Small". I get an error saying that the paging file is disabled or less than 65GB. Then I set it to "Mini" and got an error saying the paging file is disabled or less than 800MB. Then I set it to "Small" again and got an error that it is disabled or less than 1MB.

My paging file is 40GB though.
 
Last edited:

mikeymikec

Lifer
May 19, 2011
20,996
16,241
136
If you get the hardware ID of the now-disabled device (e.g. VEN_8086&DEV_8008, found in the device's properties in Device Manager), googling that will probably get a result that'll tell you what the device is.
 
  • Like
Reactions: igor_kavinski

Steelbom

Senior member
Sep 1, 2009
455
22
81
If you get the hardware ID of the now-disabled device (e.g. VEN_8086&DEV_8008, found in the device's properties in Device Manager), googling that will probably get a result that'll tell you what the device is.
This is what I found: Device PCI\VEN_1022&DEV_14DE&SUBSYS_D0001458&REV_C1\4&16012499&0&0041 requires further installation.

It's not disabled though. I just disabled the wifi earlier. The PCI is something else.


VEN_1022 seems to be AMD. I can't seem to find "DEV_14DE" or "DEV_14DD".
 
Jul 27, 2020
28,173
19,198
146

I had the same issue. First install the latest chipset driver from AMD. Then go to Device Manager, right click on the device, choose Update Driver on the device, Browse my computer for driver software, choose this folder: c:\Program Files (x86)\AMD\Chipset_IODrivers\PSP Driver\

It fixed my missing driver for me, now it says: AMD PSP 11.0 Device, under Security devices.

Hopefully that works.
 

Steelbom

Senior member
Sep 1, 2009
455
22
81



Hopefully that works.
I don't think I installed any raid drivers. I do have an external raid bay which connects over USB... although, it still crashes when it is turned off.

EDIT: DISM /RestoreHealth completed. Didn't report any issues. sfc /scannow did identify corrupted files and successfully repaired them. Messing around with usb devices now.

EDIT2: So, I did a single pass on the memtest86 pro. (I know it's not a lot.) No problems detected. After booting up, my 3050 wasn't showing up so I decided to remove it in case it is the culprit. I still see an unidentified PCIE device in the device manager... thought that would've fixed it.

I've also enabled ECO mode on my 7950X for now. Just going to test some more and use it... see if I still run into any issues.
 
Last edited:

Steltek

Diamond Member
Mar 29, 2001
3,342
1,084
136
You might try pulling one memory module to see if it makes a difference.

However, just shutting off like that repeatedly without bluescreening screams PSU/power issue to me.
 
Last edited:
  • Like
Reactions: igor_kavinski

Steelbom

Senior member
Sep 1, 2009
455
22
81
You might try pulling one memory module to see if it makes a difference.

However, just shutting off like that repeatedly without bluescreening screams PSU/power issue to me.
Thanks! Going good so far. No issue doing my usual thing. Maybe the TX-1000 can't handle a 3090 Ti + 3050 + Ryzen 7950X well.
 

mikeymikec

Lifer
May 19, 2011
20,996
16,241
136
I assume the PCI Encryption/decryption controller is a representation in the OS of the CPU's AES capabilities. I've seen the device showing up plenty of times on Ryzen-era AMD systems.
 

Steltek

Diamond Member
Mar 29, 2001
3,342
1,084
136
Thanks! Going good so far. No issue doing my usual thing. Maybe the TX-1000 can't handle a 3090 Ti + 3050 + Ryzen 7950X well.

If pulling a memory module stabilizes things, then it is probably a memory kit issue.

That being said, at full draw a 1000 watt PSU might not provide enough juice for that setup. But for simple housekeeping stuff, it ought not have any problems unless your PSU is developing a fault.

Push comes to shove, if the problem returns try to pull the 3050 and test to see if the system otherwise works (a 1000 watt PSU should have no problem powering a 3090Ti + 7950x without the 3050 present).
 

Tech Junky

Diamond Member
Jan 27, 2022
3,825
1,343
106
I had an issue like this and it took awhile to pin point but, it turned out to be an Intel driver causing the issue.

On my laptop it would do the random reboots. The logs were misleading and pointed to a potential power issue. The best diagnostic was booting into a live image of Linux and using it for a couple of days without any reboots happening.

Stripped out the driver and it was stable again.
 
  • Like
Reactions: igor_kavinski

Steelbom

Senior member
Sep 1, 2009
455
22
81
I assume the PCI Encryption/decryption controller is a representation in the OS of the CPU's AES capabilities. I've seen the device showing up plenty of times on Ryzen-era AMD systems.
Interesting! So do you think it's something I do need to update? Where do I even check if it is installed?
If pulling a memory module stabilizes things, then it is probably a memory kit issue.

That being said, at full draw a 1000 watt PSU might not provide enough juice for that setup. But for simple housekeeping stuff, it ought not have any problems unless your PSU is developing a fault.

Push comes to shove, if the problem returns try to pull the 3050 and test to see if the system otherwise works (a 1000 watt PSU should have no problem powering a 3090Ti + 7950x without the 3050 present).
Haven't pulled the memory stick or put the 3050 back in yet. Just leaving it as-is for now, as it's working. If I run into the problem I'll start investigating and experimenting again.
I had an issue like this and it took awhile to pin point but, it turned out to be an Intel driver causing the issue.

On my laptop it would do the random reboots. The logs were misleading and pointed to a potential power issue. The best diagnostic was booting into a live image of Linux and using it for a couple of days without any reboots happening.

Stripped out the driver and it was stable again.
Interesting! That's a great idea.

>>>>

Interestingly, I also had an issue with the same computer where putting it to sleep would cause an instant reboot. I started another thread and couldn't get to the bottom of it -- the issue existed even before the 3050 was added to my system -- after my recent attempts/changes/fixes it now goes to sleep just fine which is great!
 
  • Like
Reactions: igor_kavinski