Go Back   AnandTech Forums > Software > Distributed Computing

Forums
· Hardware and Technology
· CPUs and Overclocking
· Motherboards
· Video Cards and Graphics
· Memory and Storage
· Power Supplies
· Cases & Cooling
· SFF, Notebooks, Pre-Built/Barebones PCs
· Networking
· Peripherals
· General Hardware
· Highly Technical
· Computer Help
· Home Theater PCs
· Consumer Electronics
· Digital and Video Cameras
· Mobile Devices & Gadgets
· Audio/Video & Home Theater
· Software
· Software for Windows
· All Things Apple
· *nix Software
· Operating Systems
· Programming
· PC Gaming
· Console Gaming
· Distributed Computing
· Security
· Social
· Off Topic
· Politics and News
· Discussion Club
· Love and Relationships
· The Garage
· Health and Fitness
· Merchandise and Shopping
· For Sale/Trade
· Hot Deals
· Free Stuff
· Contests and Sweepstakes
· Black Friday 2012
· Forum Issues
· Technical Forum Issues
· Personal Forum Issues
· Suggestion Box
· Moderator Resources
· Moderator Discussions
   

Reply
 
Thread Tools
Old 07-17-2012, 12:49 PM   #1
dave_the_nerd
Diamond Member
 
dave_the_nerd's Avatar
 
Join Date: Feb 2011
Posts: 3,475
Default Folding Question about GPUs

I know that nVidia cards are better at folding. But why are they better at folding?

I mean, what do they do differently? Two cards with similar gaming performance, the AMD will pull in about half the PPD of the nVidia.

But theoretical GFLOPs are usually similar, and in other GPGPU or OpenCL benchmarks, AMD's cards results will vary a lot - often beating the pants off of nVidia's cards.

So what gives? Is there an important architectural difference? Is the F@H Client coded better for CUDA than for OpenCL?
__________________
Scientific progress goes "Boink?"

http://boxenbase.com/browse/profile/dave_the_nerd
dave_the_nerd is offline   Reply With Quote
Old 07-17-2012, 01:47 PM   #2
VirtualLarry
Lifer
 
VirtualLarry's Avatar
 
Join Date: Aug 2001
Posts: 22,235
Default

Quote:
Originally Posted by dave_the_nerd View Post
I know that nVidia cards are better at folding. But why are they better at folding?

Is the F@H Client coded better for CUDA than for OpenCL?
That's basically what it comes down to. The AMD client doesn't get enough dev support and is relatively un-optimized.
__________________
Rig(s) not listed, because I change computers, like some people change their socks.
VirtualLarry is offline   Reply With Quote
Old 07-17-2012, 03:16 PM   #3
ZipSpeed
Golden Member
 
ZipSpeed's Avatar
 
Join Date: Aug 2007
Location: Calgary, Canada
Posts: 1,095
Default

Yeah, it boils down to Nvidia dumping a ton more money into software than AMD.
__________________
PLAY, WORK & CRUNCH
(1) i7 3770K @ 4.2 GHz | 7950 @ 1000/1300 MHz (2) i7 2600K @ 4.2 GHz | GTX 580 @ 850/1700/1050 MHz (3) i7 950 @ 4.0 GHz | 7870 @ 1100/1500 MHz
CRUNCH
(1) i5 2500K @ 4.3 GHz | GT 240 (2) i7 970 @ 3.9 GHz | GTX 480 @ 775/1550/924 MHz (3) Q9550 @ 3.4 GHz | 5850 @ 900/1250 MHz (4) X6 1055T @ 3.5 GHz | 2 x GTX 460 @ 825/1650/900 MHz
ZipSpeed is offline   Reply With Quote
Old 07-18-2012, 03:24 AM   #4
pandemonium
Senior Member
 
pandemonium's Avatar
 
Join Date: Mar 2011
Location: Augusta, GA
Posts: 916
Default

If bitcoin mining is a representation of anything, I'd suspect folding would benefit greatly if AMDs cards were optimized on the software side. I'm just speculating, of course.
__________________
Custom Cost effectiveness spreadsheet.
Power usage detail thread found here.
We are but shadows of our achievements and dust of the stars; empowering the universe to have conciousness.

"Fighting ignorance is bliss." - Me
pandemonium is offline   Reply With Quote
Old 07-18-2012, 08:06 AM   #5
blckgrffn
Diamond Member
 
blckgrffn's Avatar
 
Join Date: May 2003
Location: Blaine, MN
Posts: 6,574
Default

Quote:
Originally Posted by ZipSpeed View Post
Yeah, it boils down to Nvidia dumping a ton more money into software than AMD.
Yup, there are a lot of CUDA wu's that get good PPD, not so much with OpenCL.

ASAIK, there different "strands" of research and the scientist either take the "easy" way out and do a CPU Gromacs (?) based simulation or put the extra grunt in and build a GPU optimized one. Nvidia is helping them do this using CUDA, and so the scientists can spend less time building the simulation and more time doing sciency things. Like quibbling with the local comic book shop owner or figuring out the optimal couch placement for the airflow in their apartment.

Since AMD chooses not to do this, OpenCL just kind of sits there, at least we've seen a little movement there.

For some time, I thought it would be quite the coup if AMD worked with the F@H on even one massively GCN optimized batch of work. Even one project would be enough to get a lot of attention and I think it could probably move enough cards to fund a full time AMD helper.

Who knows, maybe AMD will actually get serious about these things themselves when they have APUs with unified CPU/GPU memory space. I am not holding my breath, though.

Bit coin mining isn't the only place where AMD OpenCL has proven powerful, look at Milkway @ Home or POEM, both work very well. It does seem like CUDA is more CPU effecient - most of the work must occur on the GPU. With the AMD OpenCL tasks we've seen, it appears that a lot of grunt still happens on the CPU and the GPU is truly a co-processor dealing with a subset of the calculation functions, albeit very effectively.
blckgrffn is offline   Reply With Quote
Old 08-13-2012, 01:43 AM   #6
lakedude
Golden Member
 
Join Date: Mar 2009
Posts: 1,026
Default

I've heard that ATI works great on simple repetitive parallel tasks, like Collatz, and that nVidia is better for more complex work. Something about ATI having lots of little GPU brains vs nVidia having fewer bigger GPU brains or some such...
lakedude is offline   Reply With Quote
Old 08-17-2012, 10:40 PM   #7
Ken g6
Elite Member
 
Ken g6's Avatar
 
Join Date: Dec 1999
Location: Colorado
Posts: 7,326
Default

Let's see...

From what I remember, AMD - er, it was ATI then - got an early start on GPGPU. They had that advantage, the CAL low-level interface, and more total core-cycles. They had some disadvantages, though. First, they had a high-level language available, instead of CAL, that was absolutely impossible to figure out. (I forget its name, but it wasn't OpenCL.)

Second, they had sets of five cores operated by one instruction. Not five cores running five instances of one instruction with different data; five cores together working on one instruction. If you could access the hardware at a low enough level (CAL), and keep your register use low enough that all five (or at least four) cores could work on some data at the same time (usually with a very few vector variables), you could make ATI cards run insanely fast. But it's hard to set up software to run like that.

nVidia, on the other hand, had one core dedicated to one instruction, a much simpler model to program for, and CUDA, a much simpler way to program, which is still well-suited to its hardware. nVidia also had better documentation and an emulator, so people without any video card could try writing stuff. So everybody started writing in CUDA.

Finally, AMD added OpenCL. But OpenCL is not well-suited to its hardware, even if you use vectors, which you basically have to to make OpenCL at all fast on AMD. So AMD was still harder to program for and less efficient than nVidia.

Eventually with GF104 and GK104, 1.5 cores were dedicated to one instruction, and AMD moved down to 4 cores per instruction with the 6000 series, but these didn't make much difference. The 7000 series is much better for computing and better-suited to OpenCL, but not that many people have 7000 series cards yet.
__________________
Look for really BIG primes with TeAm AnandTech PrimeGrid!
Next race: June 14-17

vBulletin looking run down? Get your script fix here.
Ken g6 is offline   Reply With Quote
Old 08-18-2012, 07:55 AM   #8
GLeeM
Elite Member
 
GLeeM's Avatar
 
Join Date: Apr 2004
Posts: 6,009
Default

Quote:
Originally Posted by Ken g6 View Post
First, they had a high-level language available, instead of CAL, that was absolutely impossible to figure out. (I forget its name, but it wasn't OpenCL.)
Was the name Brooke? It only worked on ATI 1950 cards, if you could get it working! Gave Top PPD for a while
Until they dropped it completely for nVidia.
__________________
Remember to spend time with loved ones.
GLeeM is offline   Reply With Quote
Old 08-23-2012, 05:17 PM   #9
Assimilator1
Elite Member
 
Assimilator1's Avatar
 
Join Date: Nov 1999
Location: UK,Surrey,Guildford
Posts: 21,643
Default

Yea I remember folding with my 1950 Pro for a while , until, like you said, it was dropped like a stone .
__________________
Want to join mankinds ultimate search using just a PC? - SETI@Home.
Or help design a new particle accelerator to study neutrinos? - DPAD.
Or help research diseases? - Folding@Home.
Checkout these & other DC projects here,& help advance scientific research through Team AnandTech
My rigs:-
Q6600 @3.24GHz(1.3875v), 405MHz FSB tRD7, 2x2GB DDR2 1066 @1083MHz, HD4870 1GB
E5200 @3.6GHz(1.35v), 343MHz FSB tRD7, 2x1GB DDR2 800 @824MHz, GTX 260 c216
Assimilator1 is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT -5. The time now is 05:02 AM.


Powered by vBulletin® Version 3.8.7
Copyright ©2000 - 2013, vBulletin Solutions, Inc.