|
|
 |
07-17-2012, 12:49 PM
|
#1
|
|
Diamond Member
Join Date: Feb 2011
Posts: 3,475
|
Folding Question about GPUs
I know that nVidia cards are better at folding. But why are they better at folding?
I mean, what do they do differently? Two cards with similar gaming performance, the AMD will pull in about half the PPD of the nVidia.
But theoretical GFLOPs are usually similar, and in other GPGPU or OpenCL benchmarks, AMD's cards results will vary a lot - often beating the pants off of nVidia's cards.
So what gives? Is there an important architectural difference? Is the F@H Client coded better for CUDA than for OpenCL?
|
|
|
07-17-2012, 01:47 PM
|
#2
|
|
Lifer
Join Date: Aug 2001
Posts: 22,235
|
Quote:
Originally Posted by dave_the_nerd
I know that nVidia cards are better at folding. But why are they better at folding?
Is the F@H Client coded better for CUDA than for OpenCL?
|
That's basically what it comes down to. The AMD client doesn't get enough dev support and is relatively un-optimized.
__________________
Rig(s) not listed, because I change computers, like some people change their socks.
|
|
|
07-17-2012, 03:16 PM
|
#3
|
|
Golden Member
Join Date: Aug 2007
Location: Calgary, Canada
Posts: 1,095
|
Yeah, it boils down to Nvidia dumping a ton more money into software than AMD.
__________________
PLAY, WORK & CRUNCH
(1) i7 3770K @ 4.2 GHz | 7950 @ 1000/1300 MHz (2) i7 2600K @ 4.2 GHz | GTX 580 @ 850/1700/1050 MHz (3) i7 950 @ 4.0 GHz | 7870 @ 1100/1500 MHz
CRUNCH
(1) i5 2500K @ 4.3 GHz | GT 240 (2) i7 970 @ 3.9 GHz | GTX 480 @ 775/1550/924 MHz (3) Q9550 @ 3.4 GHz | 5850 @ 900/1250 MHz (4) X6 1055T @ 3.5 GHz | 2 x GTX 460 @ 825/1650/900 MHz
|
|
|
07-18-2012, 03:24 AM
|
#4
|
|
Senior Member
Join Date: Mar 2011
Location: Augusta, GA
Posts: 916
|
If bitcoin mining is a representation of anything, I'd suspect folding would benefit greatly if AMDs cards were optimized on the software side. I'm just speculating, of course.
__________________
Custom Cost effectiveness spreadsheet.
Power usage detail thread found here.
We are but shadows of our achievements and dust of the stars; empowering the universe to have conciousness.
"Fighting ignorance is bliss." - Me
|
|
|
07-18-2012, 08:06 AM
|
#5
|
|
Diamond Member
Join Date: May 2003
Location: Blaine, MN
Posts: 6,574
|
Quote:
Originally Posted by ZipSpeed
Yeah, it boils down to Nvidia dumping a ton more money into software than AMD.
|
Yup, there are a lot of CUDA wu's that get good PPD, not so much with OpenCL.
ASAIK, there different "strands" of research and the scientist either take the "easy" way out and do a CPU Gromacs (?) based simulation or put the extra grunt in and build a GPU optimized one. Nvidia is helping them do this using CUDA, and so the scientists can spend less time building the simulation and more time doing sciency things. Like quibbling with the local comic book shop owner or figuring out the optimal couch placement for the airflow in their apartment.
Since AMD chooses not to do this, OpenCL just kind of sits there, at least we've seen a little movement there.
For some time, I thought it would be quite the coup if AMD worked with the F@H on even one massively GCN optimized batch of work. Even one project would be enough to get a lot of attention and I think it could probably move enough cards to fund a full time AMD helper.
Who knows, maybe AMD will actually get serious about these things themselves when they have APUs with unified CPU/GPU memory space. I am not holding my breath, though.
Bit coin mining isn't the only place where AMD OpenCL has proven powerful, look at Milkway @ Home or POEM, both work very well. It does seem like CUDA is more CPU effecient - most of the work must occur on the GPU. With the AMD OpenCL tasks we've seen, it appears that a lot of grunt still happens on the CPU and the GPU is truly a co-processor dealing with a subset of the calculation functions, albeit very effectively.
|
|
|
08-13-2012, 01:43 AM
|
#6
|
|
Golden Member
Join Date: Mar 2009
Posts: 1,026
|
I've heard that ATI works great on simple repetitive parallel tasks, like Collatz, and that nVidia is better for more complex work. Something about ATI having lots of little GPU brains vs nVidia having fewer bigger GPU brains or some such...
|
|
|
08-17-2012, 10:40 PM
|
#7
|
|
Elite Member
Join Date: Dec 1999
Location: Colorado
Posts: 7,326
|
Let's see...
From what I remember, AMD - er, it was ATI then - got an early start on GPGPU. They had that advantage, the CAL low-level interface, and more total core-cycles. They had some disadvantages, though. First, they had a high-level language available, instead of CAL, that was absolutely impossible to figure out. (I forget its name, but it wasn't OpenCL.)
Second, they had sets of five cores operated by one instruction. Not five cores running five instances of one instruction with different data; five cores together working on one instruction. If you could access the hardware at a low enough level (CAL), and keep your register use low enough that all five (or at least four) cores could work on some data at the same time (usually with a very few vector variables), you could make ATI cards run insanely fast. But it's hard to set up software to run like that.
nVidia, on the other hand, had one core dedicated to one instruction, a much simpler model to program for, and CUDA, a much simpler way to program, which is still well-suited to its hardware. nVidia also had better documentation and an emulator, so people without any video card could try writing stuff. So everybody started writing in CUDA.
Finally, AMD added OpenCL. But OpenCL is not well-suited to its hardware, even if you use vectors, which you basically have to to make OpenCL at all fast on AMD. So AMD was still harder to program for and less efficient than nVidia.
Eventually with GF104 and GK104, 1.5 cores were dedicated to one instruction, and AMD moved down to 4 cores per instruction with the 6000 series, but these didn't make much difference. The 7000 series is much better for computing and better-suited to OpenCL, but not that many people have 7000 series cards yet.
__________________
Look for really BIG primes with TeAm AnandTech PrimeGrid! 
Next race: June 14-17
vBulletin looking run down? Get your script fix here.
|
|
|
08-18-2012, 07:55 AM
|
#8
|
|
Elite Member
Join Date: Apr 2004
Posts: 6,009
|
Quote:
Originally Posted by Ken g6
First, they had a high-level language available, instead of CAL, that was absolutely impossible to figure out. (I forget its name, but it wasn't OpenCL.)
|
Was the name Brooke? It only worked on ATI 1950 cards, if you could get it working! Gave Top PPD for a while 
Until they dropped it completely for nVidia.
__________________
Remember to spend time with loved ones.
|
|
|
08-23-2012, 05:17 PM
|
#9
|
|
Elite Member
Join Date: Nov 1999
Location: UK,Surrey,Guildford
Posts: 21,643
|
Yea I remember folding with my 1950 Pro for a while  , until, like you said, it was dropped like a stone  .
|
|
|
Posting Rules
|
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts
HTML code is Off
|
|
|
All times are GMT -5. The time now is 05:02 AM.
|