Now my build is over a year old, but I think this is the appropriate forum for this kind of problem...
tldr: Computer reboots under intensive GPU loads - despite how long the load has been running, and very predictably upon certain interactions from user.
Examples:
Example 1: Doing something graphics intensive like Furmark. After a few minutes it just reboots.
Example 2: Playing a game level for 10 minutes, level ends so the level is just about to unload - instant reboot.
Example 3: Playing a turn based strategy game. Making many decisions. Click turn end - instant reboot. (doesn't happen every time)
Example 4: Using Furmark for several minutes, press alt-tab - instant reboot. (doesn't happen every time)
(Examples 2-4 can be reproduced pretty easily, but only account for ~40% of the total reboots I've seen)
When it occurs:
During intensive games: i.e. most games that can at least potentially raise the GPU temperature to 70 degrees C or above - even if the computer turning off doesn't necessarily happen when the GPU is that hot. I've seen it happen when Furmark had got the GPU to only 60 degrees C.
Data:
My CPU temps are cool under almost all conditions except Prime95 - 40-50 degrees C
My GPU temps are a bit high - 70 to 83 degrees C under sustained 100% load.
When it DOESN'T occur:
During web browsing, nor light games, nor when idle, nor during intense CPU tasks like Prime95.
When the room is cool.
Timeline:
It seems to have only started to happen often within the last few weeks, the computer is over a year old - although it happened a single time a bit over a month ago.
I made no recent hardware changes, although about 6 months ago I got a 144Hz Gsync display.
I update my Nvidia drivers frequently, which I've updated a few times since the problem first occurred. I've tried downgrading my drivers too.
What have I tried:
Disabled automatic reboot in windows (so I could see a BSOD if there was one there was none).
Checked Event Logs - only sudden power loss is noted.
Disabled BIOS overheat protection - system still rebooted under load.
I've tried running my graphics card fans at 100% (which kept GPU temp under 73 C) - no difference
I've tried running my CPU fans and case fans at higher speed - no difference
Since the problem started I updated my BIOS - no difference
Clamped down voltages in the BIOS (so they wouldn't fluctuate up under load) - no difference
I updated and downgraded graphics drivers - no difference
I unplugged peripheral devices - no difference
I've surveyed my motherboard somewhat and there doesn't appear to be any bulging or cracked capacitors.
Verified PSU is not in eco mode.
Verified all fans are actually running (including PSU).
Other:
Definite very slight buzzing I can hear when my case is open when GPU is under load (not when CPU under load like Prime95). Not rhythmic, I guess it could be a leaky capacitor but I can't tell where the sound is coming from. I don't think it's coil whine but I haven't heard every type of coil whine...
I've tried downclocking my GPU quite a bit - this seemed to help, but if I ran Prime95 simultaneously and alt-tabbed a few times I was able to get it to reboot.
I've tried cooling down the room until it's chilly - after a lot of testing this seems to resolve all crashing behavior. Note: my GPU still runs up to 79 degrees C in this case. This definitely seems to be the largest controlling factor I can implement - making me think some component, possibly not the GPU nor CPU, has become more heat sensitive?
My plan:
I've ordered a replacement PSU, I'll try replacing that first. Since it is the component that would be most affected by cooling the room and least affected by fan speeds, AND since we're talking about power dropping out, I figure PSU is the most likely candidate.
If the PSU doesn't help, I'm not sure what I'll go after next - I have other GPUs but it's significantly lower end - but I figure if it fixes the problem it could still be any other component (PSU or mobo having trouble with high amounts of power), and if it doesn't fix the problem it would again still be any other component. So swapping out the GPU won't actually tell me anything (would be different if it was a more power-hungry GPU). Still, I'll give it a shot after the PSU.
Finally I'd go for a new mobo - avoiding this because of the pain of complete disassembly and reassembly.
Specs:
i9-9900k (never overclocked)
32GB DDR4-3000 RAM (tried lowering RAM voltage and frequency - no help)
ASRock Taichi Z390 mobo
Evga 2080 XC Ultra (not overclocked (tried underclocking, see notes)) - in uppermost slot
EVGA 850W P2 PSU (NOT in eco mode, never used eco mode)
Fractal Design R6 case
Anyone disagree with any of my conclusions or have any other insights or suggestions? I'm all ears. It'll be a while before the PSU is delivered.
tldr: Computer reboots under intensive GPU loads - despite how long the load has been running, and very predictably upon certain interactions from user.
Examples:
Example 1: Doing something graphics intensive like Furmark. After a few minutes it just reboots.
Example 2: Playing a game level for 10 minutes, level ends so the level is just about to unload - instant reboot.
Example 3: Playing a turn based strategy game. Making many decisions. Click turn end - instant reboot. (doesn't happen every time)
Example 4: Using Furmark for several minutes, press alt-tab - instant reboot. (doesn't happen every time)
(Examples 2-4 can be reproduced pretty easily, but only account for ~40% of the total reboots I've seen)
When it occurs:
During intensive games: i.e. most games that can at least potentially raise the GPU temperature to 70 degrees C or above - even if the computer turning off doesn't necessarily happen when the GPU is that hot. I've seen it happen when Furmark had got the GPU to only 60 degrees C.
Data:
My CPU temps are cool under almost all conditions except Prime95 - 40-50 degrees C
My GPU temps are a bit high - 70 to 83 degrees C under sustained 100% load.
When it DOESN'T occur:
During web browsing, nor light games, nor when idle, nor during intense CPU tasks like Prime95.
When the room is cool.
Timeline:
It seems to have only started to happen often within the last few weeks, the computer is over a year old - although it happened a single time a bit over a month ago.
I made no recent hardware changes, although about 6 months ago I got a 144Hz Gsync display.
I update my Nvidia drivers frequently, which I've updated a few times since the problem first occurred. I've tried downgrading my drivers too.
What have I tried:
Disabled automatic reboot in windows (so I could see a BSOD if there was one there was none).
Checked Event Logs - only sudden power loss is noted.
Disabled BIOS overheat protection - system still rebooted under load.
I've tried running my graphics card fans at 100% (which kept GPU temp under 73 C) - no difference
I've tried running my CPU fans and case fans at higher speed - no difference
Since the problem started I updated my BIOS - no difference
Clamped down voltages in the BIOS (so they wouldn't fluctuate up under load) - no difference
I updated and downgraded graphics drivers - no difference
I unplugged peripheral devices - no difference
I've surveyed my motherboard somewhat and there doesn't appear to be any bulging or cracked capacitors.
Verified PSU is not in eco mode.
Verified all fans are actually running (including PSU).
Other:
Definite very slight buzzing I can hear when my case is open when GPU is under load (not when CPU under load like Prime95). Not rhythmic, I guess it could be a leaky capacitor but I can't tell where the sound is coming from. I don't think it's coil whine but I haven't heard every type of coil whine...
I've tried downclocking my GPU quite a bit - this seemed to help, but if I ran Prime95 simultaneously and alt-tabbed a few times I was able to get it to reboot.
I've tried cooling down the room until it's chilly - after a lot of testing this seems to resolve all crashing behavior. Note: my GPU still runs up to 79 degrees C in this case. This definitely seems to be the largest controlling factor I can implement - making me think some component, possibly not the GPU nor CPU, has become more heat sensitive?
My plan:
I've ordered a replacement PSU, I'll try replacing that first. Since it is the component that would be most affected by cooling the room and least affected by fan speeds, AND since we're talking about power dropping out, I figure PSU is the most likely candidate.
If the PSU doesn't help, I'm not sure what I'll go after next - I have other GPUs but it's significantly lower end - but I figure if it fixes the problem it could still be any other component (PSU or mobo having trouble with high amounts of power), and if it doesn't fix the problem it would again still be any other component. So swapping out the GPU won't actually tell me anything (would be different if it was a more power-hungry GPU). Still, I'll give it a shot after the PSU.
Finally I'd go for a new mobo - avoiding this because of the pain of complete disassembly and reassembly.
Specs:
i9-9900k (never overclocked)
32GB DDR4-3000 RAM (tried lowering RAM voltage and frequency - no help)
ASRock Taichi Z390 mobo
Evga 2080 XC Ultra (not overclocked (tried underclocking, see notes)) - in uppermost slot
EVGA 850W P2 PSU (NOT in eco mode, never used eco mode)
Fractal Design R6 case
Anyone disagree with any of my conclusions or have any other insights or suggestions? I'm all ears. It'll be a while before the PSU is delivered.