NVIDIA GTX 280 High Failure Rates/RMA

Page 2 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

thilanliyan

Lifer
Jun 21, 2005
12,048
2,262
126
Originally posted by: Ocguy31
You dont quote the part where I say it is a problem, we just dont know how widespread...you prove my point oh so well.

That wasn't my issue with your post. You claiming the OP is an ATI employee and that it's an ATI dominated forum was my issue (Lol you should have been here during the G80 days :)). You implied with your post that the thread is about sensationalism (and THIS thread certainly wasn't if you actually read through the posted links) and shouldn't have been posted...but I think it should so I responded.
 

Sylvanas

Diamond Member
Jan 20, 2004
3,752
0
0
Originally posted by: thilan29
Originally posted by: Ocguy31
You dont quote the part where I say it is a problem, we just dont know how widespread...you prove my point oh so well.

That wasn't my issue with your post. You claiming the OP is an ATI employee and that it's an ATI dominated forum was my issue (Lol you should have been here during the G80 days :)). You implied with your post that the thread is about sensationalism (and THIS thread certainly wasn't if you actually read through the posted links) and shouldn't have been posted...but I think it should so I responded.

Don't waste your time. There are certain members of this forum that will spout Nvidia to the last breath, no amount of reason will appeal to them.
 

OCGuy

Lifer
Jul 12, 2000
27,224
37
91
Originally posted by: thilan29
Originally posted by: Ocguy31
You dont quote the part where I say it is a problem, we just dont know how widespread...you prove my point oh so well.

That wasn't my issue with your post. You claiming the OP is an ATI employee and that it's an ATI dominated forum was my issue (Lol you should have been here during the G80 days :)). You implied with your post that the thread is about sensationalism (and THIS thread certainly wasn't if you actually read through the posted links) and shouldn't have been posted...but I think it should so I responded.


Ah, fair enough. But I posted the link where a mod had to lock his thread the other day for sensationalism, so that was where I was coming from. (BTW, before my 8800GTS 640, I was X800, X1300, X1900GT)


Thinking about this critically:


If the problem is on X% of cards, and is so easily solved by a couple screws, wouldnt NV know this? Why wouldnt the partners just halt shipping and fix them?

From some of the complaints, they are only getting to the 90s. Most GPU's will still operate in the 90C range. Are they counting on the fact that not everyone is constantly checking the tempurature of thier GPUs like enthusiasts do?

Someone said that people are getting multiple bad cards...if you are BFG, and someone RMA's a card for this issue, why wouldnt you test the card that it is being replaced with? Something isnt adding up.

So that leaves two options basically:

A. The percentage of bad cards is low, so they are rolling the dice as far as not checking them all and seeing how many they get back.

B. They dont give a crap
 

Cookie Monster

Diamond Member
May 7, 2005
5,161
32
86
The thing is that most of the "overheating" issues are related to the GPU itself, not because of issues such as the HSF being loose (this is one of those rare freak accidents each happens to almost all products). People with overheating cards tried re-sitting the HSF, using a better thermal adhesive and other various methods to stop their cards from overheating. Obviously these attempts were futile at best and they had to RMA their cards anyway.

My theory is that its a manufacturing problem (TSMC) and maybe a design problem (nVIDIA). However given the fact that working cards do work wonderfully well, the latter seems unlikely. Since nVIDIA is a fabless company afterall, i cant see how they can be blamed for poor packaging/manufacturing of their products although its rather funny how TSMC/UMC have been awfully quiet in recent times.

It would be nice if some of investigation took place, such as which partners are affected, what the issue really is etc etc. It seems like BFG/EVGA are the major partners to be affected by this. (and other partners seems unaffected by this so it could have been a bad batch from the initial wave of G200 cores for example).

edit - Seems to affect the GTX280 the most compared to the GTX260.
 

Keysplayr

Elite Member
Jan 16, 2003
21,211
50
91
Originally posted by: magreen
I've just learned to not trust posters who RandOmly capitalize the First Letter in words for no reason.

Why wouldn't you trust the OP? The only other people I have seen capitalize words in that fashion are attorneys. They like to emphasize/sensationalize most of their words for importance because they feel most of what they say are important enough to capitalize. Mr Fox fits this bill pretty closely, which is why this very subject has come up in the past. I myself in the past have thought Mr Fox was trying to pump up public forums to try to ignite class actions suits. Lots of money there for attorneys. But Mr Fox already expressed that he is not an attorney in past threads. So until he gives a reason not to trust him, why not?
 

Mr Fox

Senior member
Sep 24, 2006
876
0
76
Originally posted by: Cookie Monster
The thing is that most of the "overheating" issues are related to the GPU itself, not because of issues such as the HSF being loose (this is one of those rare freak accidents each happens to almost all products). People with overheating cards tried re-sitting the HSF, using a better thermal adhesive and other various methods to stop their cards from overheating. Obviously these attempts were futile at best and they had to RMA their cards anyway.

My theory is that its a manufacturing problem (TSMC) and maybe a design problem (nVIDIA). However given the fact that working cards do work wonderfully well, the latter seems unlikely. Since nVIDIA is a fabless company afterall, i cant see how they can be blamed for poor packaging/manufacturing of their products although its rather funny how TSMC/UMC have been awfully quiet in recent times.

It would be nice if some of investigation took place, such as which partners are affected, what the issue really is etc etc. It seems like BFG/EVGA are the major partners to be affected by this. (and other partners seems unaffected by this so it could have been a bad batch from the initial wave of G200 cores for example).

edit - Seems to affect the GTX280 the most compared to the GTX260.




All nV Cards are built on Foxconn Assembly lines in Taiwan, and PRC (Shenzhen). The cards are built to nV BOM, and then packaged, and shipped to the partners.

EVGA, and BFG are the more dominant brands, the more exposure in the field (Sales) the more you will see the issue.

I'll make an attempt to find someone that has pulled one apart...



 

ViRGE

Elite Member, Moderator Emeritus
Oct 9, 1999
31,516
167
106
Let's take it down a notch guys. This thread started out well enough, if you're going to throw around accusations of fanboyism that aren't constructively adding to the thread, I'm going to throw you out instead.

-ViRGE
 

Keysplayr

Elite Member
Jan 16, 2003
21,211
50
91
Originally posted by: Cookie Monster
The thing is that most of the "overheating" issues are related to the GPU itself, not because of issues such as the HSF being loose (this is one of those rare freak accidents each happens to almost all products). People with overheating cards tried re-sitting the HSF, using a better thermal adhesive and other various methods to stop their cards from overheating. Obviously these attempts were futile at best and they had to RMA their cards anyway.

My theory is that its a manufacturing problem (TSMC) and maybe a design problem (nVIDIA). However given the fact that working cards do work wonderfully well, the latter seems unlikely. Since nVIDIA is a fabless company afterall, i cant see how they can be blamed for poor packaging/manufacturing of their products although its rather funny how TSMC/UMC have been awfully quiet in recent times.

It would be nice if some of investigation took place, such as which partners are affected, what the issue really is etc etc. It seems like BFG/EVGA are the major partners to be affected by this. (and other partners seems unaffected by this so it could have been a bad batch from the initial wave of G200 cores for example).

edit - Seems to affect the GTX280 the most compared to the GTX260.

The thing is, I have read most of the threads regarding this issue. Most report the coolers enclosure getting extremely hot, which leads me to believe that the cooler is most likely making good contact with the core. And this leads me to think that there might be some sort of power regulation problem which doesn't necessarily mean the GPU core itself. There are several components on the card that control power flow. Without really knowing the answer, I would guess there is more voltage being sent to the core under load than intended. Could be as simple as a defective regulator/mosfet, which would point at a few board partners and the companies they purchased those regulators/mosfets from. Just my guess at this point.
 

Ryl3x

Banned
Nov 28, 2007
34
0
0
ok guys....i just threw away my gtx280 and ordered a 4870 in light of this news.

THANK GOD FOR ATI!


fing morons
 

Keysplayr

Elite Member
Jan 16, 2003
21,211
50
91
Originally posted by: Mr Fox
Originally posted by: Cookie Monster
The thing is that most of the "overheating" issues are related to the GPU itself, not because of issues such as the HSF being loose (this is one of those rare freak accidents each happens to almost all products). People with overheating cards tried re-sitting the HSF, using a better thermal adhesive and other various methods to stop their cards from overheating. Obviously these attempts were futile at best and they had to RMA their cards anyway.

My theory is that its a manufacturing problem (TSMC) and maybe a design problem (nVIDIA). However given the fact that working cards do work wonderfully well, the latter seems unlikely. Since nVIDIA is a fabless company afterall, i cant see how they can be blamed for poor packaging/manufacturing of their products although its rather funny how TSMC/UMC have been awfully quiet in recent times.

It would be nice if some of investigation took place, such as which partners are affected, what the issue really is etc etc. It seems like BFG/EVGA are the major partners to be affected by this. (and other partners seems unaffected by this so it could have been a bad batch from the initial wave of G200 cores for example).

edit - Seems to affect the GTX280 the most compared to the GTX260.




All nV Cards are built on Foxconn Assembly lines in Taiwan, and PRC (Shenzhen). The cards are built to nV BOM, and then packaged, and shipped to the partners.

EVGA, and BFG are the more dominant brands, the more exposure in the field (Sales) the more you will see the issue.

I'll make an attempt to find someone that has pulled one apart...

Does Foxconn build to exact BOM products? Or Values?
Example: Foxconn needs to purchase 50000 mosfets/capacitors/resistors whatever
that have certain values in their function. ohms resistance value, charge value, etc.
Does Nvidia explicity say which vendor to purchase from? Or is that left up to Foxconn to find parts with these exact values, no matter who manufactures them? Or are they restricted to purchasing mosfet X from company Y? And even then, what kind of QC does the mosfet company have? There are hundreds of parts that go into making these cards, so it could very well take some time to track down any problems with various parts, or even combinations of these parts. We'll eventually know what is going on.
 

Cookie Monster

Diamond Member
May 7, 2005
5,161
32
86
When i mean "packaging/manufacturing" i meant the GPU specifically, not the PCB where this is up to foxconn (initially that is).
 

faxon

Platinum Member
May 23, 2008
2,109
1
81
ooh god i remember the issue i had with my 7900GT. i RMAed it, and it actually came back WORSE than before i RMAed it. ended up giving it to a neighbor, whose mom just so happened to work for nvidia. she went in to work and swapped it for a working 7900GT (not OC). worst $300 i have ever spent. im still pissed off just thinking about it to this day. as for the issue with the 280s, im glad i decided to just hold onto my 2900PRO now. 2 gamer buddies of mine (brothers) both bought dual 280s and all 4 cards ran into this issue within about 2 days of use. im probably going to hold out for Larrabee at this point lol
 

Mr Fox

Senior member
Sep 24, 2006
876
0
76
Originally posted by: keysplayr2003
Originally posted by: Mr Fox
Originally posted by: Cookie Monster
The thing is that most of the "overheating" issues are related to the GPU itself, not because of issues such as the HSF being loose (this is one of those rare freak accidents each happens to almost all products). People with overheating cards tried re-sitting the HSF, using a better thermal adhesive and other various methods to stop their cards from overheating. Obviously these attempts were futile at best and they had to RMA their cards anyway.

My theory is that its a manufacturing problem (TSMC) and maybe a design problem (nVIDIA). However given the fact that working cards do work wonderfully well, the latter seems unlikely. Since nVIDIA is a fabless company afterall, i cant see how they can be blamed for poor packaging/manufacturing of their products although its rather funny how TSMC/UMC have been awfully quiet in recent times.

It would be nice if some of investigation took place, such as which partners are affected, what the issue really is etc etc. It seems like BFG/EVGA are the major partners to be affected by this. (and other partners seems unaffected by this so it could have been a bad batch from the initial wave of G200 cores for example).

edit - Seems to affect the GTX280 the most compared to the GTX260.




All nV Cards are built on Foxconn Assembly lines in Taiwan, and PRC (Shenzhen). The cards are built to nV BOM, and then packaged, and shipped to the partners.

EVGA, and BFG are the more dominant brands, the more exposure in the field (Sales) the more you will see the issue.

I'll make an attempt to find someone that has pulled one apart...

Does Foxconn build to exact BOM products? Or Values?
Example: Foxconn needs to purchase 50000 mosfets/capacitors/resistors whatever
that have certain values in their function. ohms resistance value, charge value, etc.
Does Nvidia explicity say which vendor to purchase from? Or is that left up to Foxconn to find parts with these exact values, no matter who manufactures them? Or are they restricted to purchasing mosfet X from company Y? And even then, what kind of QC does the mosfet company have? There are hundreds of parts that go into making these cards, so it could very well take some time to track down any problems with various parts, or even combinations of these parts. We'll eventually know what is going on.


Foxconn builds per reference BOM, that is supplied from nV Product Engineering...


I have seen Two Posts that this seems to be the case, yes...

where the guys power consumption went from 350 to like 550 watts...

Something like that can be drivers, or hardware also,

there were a few posts in the NZone, and nVNews trying to help trouble shoot.

The ones that I was talking about were the ones over at Xtreme, and the one guy found an airgap in the Thermal Material between the Copper Contact Pad, and The GPU, and two or three that had loose Screws that when tightened fixed the issue.. Unfortunately no pictures..





 

JPB

Diamond Member
Jul 4, 2005
4,064
89
91
Originally posted by: WaTaGuMp
I have a MUCH better thread idea.

Poster #1 ATI makes the shittiest drivers, ill never buy one of those cards.

Poster#2 Nvidia drivers suck, ATI makes much better drivers.

:laugh:
 

ajaidevsingh

Senior member
Mar 7, 2008
563
0
0
I tested a card "GTX 280" for more than 4 hours on ati tool mainly and i fond artifacts that were caused by the defective ram..

The card had to RMA'ed and still waiting for the return!!
 

Keysplayr

Elite Member
Jan 16, 2003
21,211
50
91
Originally posted by: Mr Fox
Originally posted by: keysplayr2003
Originally posted by: Mr Fox
Originally posted by: Cookie Monster
The thing is that most of the "overheating" issues are related to the GPU itself, not because of issues such as the HSF being loose (this is one of those rare freak accidents each happens to almost all products). People with overheating cards tried re-sitting the HSF, using a better thermal adhesive and other various methods to stop their cards from overheating. Obviously these attempts were futile at best and they had to RMA their cards anyway.

My theory is that its a manufacturing problem (TSMC) and maybe a design problem (nVIDIA). However given the fact that working cards do work wonderfully well, the latter seems unlikely. Since nVIDIA is a fabless company afterall, i cant see how they can be blamed for poor packaging/manufacturing of their products although its rather funny how TSMC/UMC have been awfully quiet in recent times.

It would be nice if some of investigation took place, such as which partners are affected, what the issue really is etc etc. It seems like BFG/EVGA are the major partners to be affected by this. (and other partners seems unaffected by this so it could have been a bad batch from the initial wave of G200 cores for example).

edit - Seems to affect the GTX280 the most compared to the GTX260.




All nV Cards are built on Foxconn Assembly lines in Taiwan, and PRC (Shenzhen). The cards are built to nV BOM, and then packaged, and shipped to the partners.

EVGA, and BFG are the more dominant brands, the more exposure in the field (Sales) the more you will see the issue.

I'll make an attempt to find someone that has pulled one apart...

Does Foxconn build to exact BOM products? Or Values?
Example: Foxconn needs to purchase 50000 mosfets/capacitors/resistors whatever
that have certain values in their function. ohms resistance value, charge value, etc.
Does Nvidia explicity say which vendor to purchase from? Or is that left up to Foxconn to find parts with these exact values, no matter who manufactures them? Or are they restricted to purchasing mosfet X from company Y? And even then, what kind of QC does the mosfet company have? There are hundreds of parts that go into making these cards, so it could very well take some time to track down any problems with various parts, or even combinations of these parts. We'll eventually know what is going on.


Foxconn builds per reference BOM, that is supplied from nV Product Engineering...

Still doesn't tell us if Foxconn buys from specified parts manufacturers. Foxconn could very well be buying equal "value" (not monetary value, but specification value) parts from a broad spectrum of manufacturers. Foxconn may be following the BOM values, but might not be purchasing from the same manufacturer every time. They could even be buying the same part (e.g. capacitors) from multiple manufacturers even though they have the same "technical value". Different brand, same value or purpose.

I have seen Two Posts that this seems to be the case, yes...

From whom?

where the guys power consumption went from 350 to like 550 watts...

This again points to power regulation, to me anyway.

Something like that can be drivers, or hardware also,

there were a few posts in the NZone, and nVNews trying to help trouble shoot.

As you can probably tell by now, this can get pretty complicated and almost impossible to single out any specific entity to blame for it. It will take time to find out what exactly is going on. I can see a poor cooler contact a cause for overheating, but according to these users, they are reporting super hot cooler enclosures telling me that the cooler IS making good contact with the core and other respective parts that need cooling. If the cooler was NOT making good contact, I don't think the cooler would be so hot to the touch.

The ones that I was talking about were the ones over at Xtreme, and the one guy found an airgap in the Thermal Material between the Copper Contact Pad, and The GPU, and two or three that had loose Screws that when tightened fixed the issue.. Unfortunately no pictures..

While I could see a poorly seated cooler can be tightened to improve cooler to card contact, I don't see how anyone can say they found an "airgap" without physically removing the cooler from the core, all the while the actual "act" of removing the cooler from the card can cause the thermal compound to "appear" like it had an airgap. I'm not saying that it isn't possible, but I am saying that it is not very conclusive.

 

thilanliyan

Lifer
Jun 21, 2005
12,048
2,262
126
Originally posted by: Ocguy31
From some of the complaints, they are only getting to the 90s. Most GPU's will still operate in the 90C range. Are they counting on the fact that not everyone is constantly checking the tempurature of thier GPUs like enthusiasts do?

A bunch of the posts I saw from the links said that their GPUs got to exactly 105C (maybe the temperature diode isn't measuring properly passed that temp?) then crashed or started throttling (indicated by sudden stuttering during gaming).

I'm surprised that some people are getting multiple defective cards. The odds of that happening are pretty slim...I myself have only ever had to RMA one card an ATI 3870.

Like the laptop problem, I don't think we'll find out exactly what happened unless it's a massive problem or they do some sort of recall.
 

Dadofamunky

Platinum Member
Jan 4, 2005
2,184
0
0
My guess is that the OEMs/resellers will be able to address this issue pretty quickly.
 

n7

Elite Member
Jan 4, 2004
21,281
4
81
Originally posted by: thilan29
Originally posted by: Ocguy31
From some of the complaints, they are only getting to the 90s. Most GPU's will still operate in the 90C range. Are they counting on the fact that not everyone is constantly checking the tempurature of thier GPUs like enthusiasts do?

A bunch of the posts I saw from the links said that their GPUs got to exactly 105C (maybe the temperature diode isn't measuring properly passed that temp?) then crashed or started throttling (indicated by sudden stuttering during gaming).

I'm surprised that some people are getting multiple defective cards. The odds of that happening are pretty slim...I myself have only ever had to RMA one card an ATI 3870.

Like the laptop problem, I don't think we'll find out exactly what happened unless it's a massive problem or they do some sort of recall.

As soon as the card hit 105c, it starts throttling bigtime.

It usually will stutter/lag/chug a little bit starting around 95-100C.

Once it hits 105C, it starts lagging like mad, obviously clocks/voltages are being forced lower majorly, as it'll do whatever necessary to keep things @ 105C or lower.

Unfortunately, 105C or lower doesn't help, since games are unplayable when you are seeing lag & stuttering like mad, not to mention the artifacting.

My two defective cards do the exact same thing...the second one takes a wee bit longer to cook itself up to 105C, but it's still only a couple minutes of steady temps increase before it hits barbeque time.

So i can play GRID for about 1.5-2 minutes before becomes it unplayable.

Or i can run Furmark for a one minute benchmark & it'll have hit 100+C in less than a minute.