Why can GPU's get so much hotter than CPU's?

phaxmohdem

Golden Member
Aug 18, 2004
1,839
0
0
www.avxmedia.com
Just curious, I was noticing that the max temp my 7800GTX can do before "core slow down" kicks in is 125 degrees C, how come CPU's only do 60-70 before going heat crazy?
 

aka1nas

Diamond Member
Aug 30, 2001
4,335
1
0
GPUs are usually built on an older process than current CPUs of that time.
 

Matt2

Diamond Member
Jul 28, 2001
4,762
0
0
Without pretending like I know more than I do, each has its own set of thermal characteristics.
 

phaxmohdem

Golden Member
Aug 18, 2004
1,839
0
0
www.avxmedia.com
But GPU's and CPU's are all built in the same silicon processes are they not? Even if GPU's are still built on last gen .13 .11nm processes, those CPu's still would not get to 125C.

The only thing I can possibly think of is that typically when CPU's get hot they start getting less accurate with the calculations? (Ex. SuperPi miscalculations) Do GPU's just not need to be 100% precise, thus allowing for higher thermal tolerances? Are the "pipelines" of a GPU (I know that is an incorrect term) simply shallow enough to not be condusive to heat related errors?

I'm just talking out of my arse as this point, but I'm still very curious to get to the bottom of this.
 

PingSpike

Lifer
Feb 25, 2004
21,741
569
126
I have no idea. I have a feeling that errors in a video card aren't catastrophic to the system like cpu ones are. In short...it is fvcking up, just not enough to reboot your machine. Thats just sort of a guess and I would only say it partially explains it though.
 

mooncancook

Platinum Member
May 28, 2003
2,874
50
91
back in the days we used to have powerful video cards that don't need a cooler (3Dfx Voodoo 2) while all CPUs requires big coolers. I don't know howthe performance-to-heat ratio of video card changes over the years as compared to CPUs. Maybe video card has performance increase at a much faster rate than CPU and therefore the heat also?
 

xtknight

Elite Member
Oct 15, 2004
12,974
0
71
Disclaimer: I'm talking out of my arse just as far as the rest of you.

I doubt that the GPU operates in any reduced/forced state at a high temp. Wouldn't you see lots of artifacts? Just one wrong bit and something could be completely the wrong color. And, certainly, they have to match the reference tests enough to be passable as DirectX compatible. These reference tests don't vary, they are executed on the CPU.

The chips are probably made out of materials with a higher tolerance (don't ask).

As far as I know, max temp for a Core 2 Duo CPU is specified at either 85C or 100C. Only then does it shut off. So, I don't think there is that much difference in sheer tolerance. A graphics card running at 85C is really the tip of the iceberg too, no? As to why it is hotter, probably just because there's more transistors on a GPU (and it's also done on a bigger manufacturing process).
 

BladeVenom

Lifer
Jun 2, 2005
13,365
16
0
Maybe because graphic card makers are more willing to let you burn out your GPU. Graphic cards seem to go bad at a much higher rate than processors.
 

nullpointerus

Golden Member
Apr 17, 2003
1,326
0
0
Originally posted by: phaxmohdem
Just curious, I was noticing that the max temp my 7800GTX can do before "core slow down" kicks in is 125 degrees C, how come CPU's only do 60-70 before going heat crazy?

Maybe it has something to do with clock speed:

7900 GTO: 650 MHz
Core 2 Duo E4300: 1800 MHz
 

Maximilian

Lifer
Feb 8, 2004
12,604
15
81
Thats a really good question... why the hell can they run so much hotter?

I think i asked the same question about the pentium M a while back (it can go to 100*C apprently) and why it could go so much hotter than the P4 without breaking. Cant remember the anwer i got though, but its likely the same reason why GPU's can go hotter.
 

Schadenfroh

Elite Member
Mar 8, 2003
38,416
4
0
I always thought it was because they do so many things in parallel while CPUs were just single / dual threaded. But, I have no facts or proof to back that up.
 

JBT

Lifer
Nov 28, 2001
12,094
1
81
I'm certain a video card running at 125C is going to have some problems. People here don't really like running over 80C or so and from what I've seen much over 90C and cards like the X1900XT's will start getting artifacts or locking up. Just because the software says 125C before it starts throttling doesn't mean its right.
Also it seems CPU manufacturers are more concerned about not producing so much heat. While GPU's manufacturers seems like they couldn't care less and design them to work with higher tolerances and are able to work correctly with a little more heat added to the equation.
 

Modular

Diamond Member
Jul 1, 2005
5,027
67
91
Originally posted by: nullpointerus
Maybe it has something to do with clock speed:

7900 GTO: 650 MHz
Core 2 Duo E4300: 1800 MHz


I believe he is on to something here. The lower the clockspeed, the higher the tolerance for error. As heat and clockspeed increase, the potential for errors increases as well.

Perhaps the lower clockspeed on the GPU allows it to operate at higher temps without errors.

Basically I'm positing that errors are directly related to some relationship between heat and clockspeed. As either heat or clockspeed decrease, the potential for errors decreases as well.

The basic mechanical build that the GPU is made from allows for lower clock speeds with more transistors, hence higher heat. Processors on the other hand, operate at a higher clockspeed but are restricted to lower temperatures before they break the threshold and errors occur.

I would like to point out that not only did I just come up with the garbage (potentially) that I just wrote, it's also early, I don't have enough coffee in me, and I was up pretty late last night...Spring Break 2k7 wo0t



 

xtknight

Elite Member
Oct 15, 2004
12,974
0
71
Hrm..Pentium Ms and Core Duos also operate at lower clock speeds. And Prescott operated at insane clock speeds.
 

happy medium

Lifer
Jun 8, 2003
14,387
480
126
Originally posted by: nullpointerus
Originally posted by: phaxmohdem
Just curious, I was noticing that the max temp my 7800GTX can do before "core slow down" kicks in is 125 degrees C, how come CPU's only do 60-70 before going heat crazy?

Maybe it has something to do with clock speed:

7900 GTO: 650 MHz
Core 2 Duo E4300: 1800 MHz

This is my guess!:thumbsup:
 

Gstanfor

Banned
Oct 19, 1999
3,307
0
0
For one thing, GPU's generally use many more transistors than CPU's do. More transistors mean more energy usage and more leakage current (that manifests itself partially as heat).

Also GPU's tend to be fabricated without much in the way of custom design libraries (though this is slowly changing with use of tech such as Arithmatica's CellMath libraries and other custom logic libraries such as those employed in G80's shader ALU's), they tend to be made on relatively low cost merchant processes, whereas CPU's are predominately custom logic and utilize cutting edge fabrication techniques.
 

Gatt

Member
Mar 30, 2005
81
0
0
Originally posted by: Modular
Originally posted by: nullpointerus
Maybe it has something to do with clock speed:

7900 GTO: 650 MHz
Core 2 Duo E4300: 1800 MHz


I believe he is on to something here. The lower the clockspeed, the higher the tolerance for error. As heat and clockspeed increase, the potential for errors increases as well.

Perhaps the lower clockspeed on the GPU allows it to operate at higher temps without errors.

Basically I'm positing that errors are directly related to some relationship between heat and clockspeed. As either heat or clockspeed decrease, the potential for errors decreases as well.

The basic mechanical build that the GPU is made from allows for lower clock speeds with more transistors, hence higher heat. Processors on the other hand, operate at a higher clockspeed but are restricted to lower temperatures before they break the threshold and errors occur.

I would like to point out that not only did I just come up with the garbage (potentially) that I just wrote, it's also early, I don't have enough coffee in me, and I was up pretty late last night...Spring Break 2k7 wo0t

Actually, the higher the temperature the more Silicon begins to conduct electricity. Silicon's semi-conductor properties decrease with heat.

GPU's heat tolerance is probably more due to it's lack of onboard cache. Since it's not storing large amounts of data, the leakage isn't as big an issue. A CPU starts leaking and it starts losing it's data in cache, a GPU starts leaking and it really doesn't lose anything that's being stored for long periods of time. Cache data loss will cause CPU errors. All the GPU will lose is a frame. The lower clock speeds are likely a function of heat as well, the more leakage you have, the stronger the signal you need to overcome it. The stronger your signal, the slower your signal(IIRC).
 

firewolfsm

Golden Member
Oct 16, 2005
1,848
29
91
I think it's a combination of all the posts so far, which make valid points. The clockspeed, the transistor count, lack of cache, they come together to make a more tolerant chip.
 

Munky

Diamond Member
Feb 5, 2005
9,372
0
76
125C is really too high for a GPU to handle. My x1900xt locks up whenever the temps go into the mid 90's, and my older x800gto locked up in the mid 80's, so the card makers might as well set the limit temp to 200C, it wouldn't mean that 199C is a safe operating temp.
 

Goi

Diamond Member
Oct 10, 1999
6,766
7
91
1) GPU product cycles are much shorter than CPU product cycles, so the physical design engineers don't have much time to hand tune/tweak transistor placement/layout/characteristics. They usually let software do it for them, hence physically they're not as optimal, and this leads to higher power consumption
2) Because of the shorter product cycle, GPU architects also have less time to make a more power/thermal aware architecture and use techniques such as dynamic voltage scaling, clock gating and other power/thermal aware features that modern CPUs have.
3) Modern GPUs have more transistors than CPUs
4) GPUs still lag behind CPUs in process technology by 1-2 generations since ATI/nvidia are fabless and rely on fabs like TMSC/UMC to produce the wafers for them. Intel and AMD however have their own fabs, and intel is especially good at process technology. Now that ATI has been acquired by AMD this may change, at least for DAAMIT cards.
5) GPU coolers are anemic compared to CPU coolers
 

nZone

Senior member
Jan 29, 2007
277
0
0
Because CPU has massive heatsink and 120mm fan while GPU has a measly small heatsink with small fan?

It's not apple to apple. If one was to mount that Thermalright Ultra 120 on a 8800GTX GPU; the temp would probably less than 20C. It would probably break that GTX in half with the massive weight.
 

Modular

Diamond Member
Jul 1, 2005
5,027
67
91
Originally posted by: nZone
Because CPU has massive heatsink and 120mm fan while GPU has a measly small heatsink with small fan?

It's not apple to apple. If one was to mount that Thermalright Ultra 120 on a 8800GTX GPU; the temp would probably less than 20C. It would probably break that GTX in half with the massive weight.

That doesn't explain the physical limitations of the silicone chip itself though. The question regards why a processor will lock up at much lower temps than a GPU, not why GPU's inherently run hotter.

 

Nanobaud

Member
Dec 9, 2004
144
0
0
I also can't speak directly to the GPU / CPU comparison, but drawing on analogies I can offer an additional point to consider. The geographical workload in a GPU is probably fairly evenly distributed across the chip. If the chip is at 100°, then the hottest on-chip temp is also probably near 100°. On a CPU, the hardest-working parts of the chip occupy a substantially smaller fraction of the total chip area. Predicting heat-flows and such based on speculation is not much of a road to understanding, but I don't think it unreasonable that a CPU with a chip temperature of 60° above ambient while dumping 100W could conceivably have spot temperatures that are 10°, 20°, or more above that.

Edit:
You could always turn off the thermal safety routines as these folks did:
3.8? GHz Duron

nBd