[computerworld] Nvidia, AMD release graphics processors for supercomputing

Final8ty

Golden Member
Jun 13, 2007
1,172
13
81
Nvidia's K20 and K20X, and AMD's FirePro SM10000 GPUs deliver more than a teraflop of double-precision performance

Nvidia and Advanced Micro Devices on Monday announced high-performance graphics chips for supercomputers.

Nvidia announced GPUs (graphics processing units) called K20 and K20X, with the latter being used in Titan, a 20-petaflop supercomputer at the U.S. Department of Energy's Oak Ridge National Laboratory. AMD announced the FirePro SM10000 graphics processor, which is targeted at high-performance computers and servers in virtualized environments.

Co-processors like GPUs are considered more powerful that CPUs for specific tasks such as scientific and math applications. The GPUs are important in providing more computing power for simulation and experimentation in research areas such as biosciences, climate, energy and space. IBM and Intel also offer accelerators for supercomputers.


Some of the world's fastest supercomputers today harness the processing power of CPUs and graphics chips for complex calculations. The Titan supercomputer pairs 18,688 Nvidia Tesla K20X GPUs with 18,688 AMD 16-core Opteron 6274 CPUs, with the GPUs handling 80 to 90 percent of the processing load. Other supercomputers that pair CPUs and GPUs include the Tianhe-1A at the National Supercomputer Center in Tianjin, China.

Nvidia has a big lead over AMD in GPUs for supercomputing, said Dan Olds, principal analyst at Gabriel Consulting Group.

Nvidia pushed parallel programming tools many years ago so coders could write applications for GPUs, Olds said. AMD has virtually no presence in the supercomputing market and needs to foster a programming environment for parallel frameworks like OpenCL to be a worthy alternative to Nvidia, Intel and other companies, Olds said.

Nvidia's K20 has 5GB of memory and delivers 1.17 teraflops of double-precision performance and 3.52 teraflops of single-precision performance. Double-precision performance is more important for supercomputing applications as it carries higher precision for floating-point calculations than single-precision calculations. The faster K20X has 6GB of memory and delivers 1.31 teraflops of double-precision performance. The K20X is three times faster than its predecessor, the Tesla M2090, which was released in the middle of last year.

The K20 products will be available in computers from companies such as Hewlett-Packard, IBM, Asus, Fujitsu, Tyan, Quanta Computer and Cray. Nvidia declined to provide pricing, saying the GPUs would be sold through server vendors.

The new chips have thousands of small processing cores that will be able to more effectively execute application code simultaneously. The Hyper-Q feature will speed up execution of legacy code through smarter scheduling of code execution.

AMD claimed that its FirePro SM10000 delivered 1.48 teraflops of peak double-precision performance. The graphics card has 6GB of memory.

The SM10000 is designed for multiple server deployments, AMD said in a slide presentation. GPUs in servers are capable of deploying virtual desktops to client devices like PCs and tablets. The GPU can accelerate graphics on the server side for full high-definition virtual desktops on client devices.

The faster processing speed of SM10000 could also help deploy virtual machines at a faster rate in computing environments, AMD said. A single graphics card will be able to deploy many virtual machines, and AMD has worked with Citrix, VMware and Microsoft to boost virtualization performance on the GPU.

AMD is also targeting the FirePro SM10000 at workstations. The company did not immediately comment on questions related to single-precision performance and pricing for the graphics card.
http://www.computerworld.com.au/art...d_release_graphics_processors_supercomputing/
 

Plimogz

Senior member
Oct 3, 2009
678
0
71
K20X? so I guess ORNL received the best chips, those with 14/15 SMX functional, for Titan while there'll be a K20 with 13/15 and a memory controller disabled.

It'll be interesting to see if and when nV revise and re-spin big Kepler to achieve what they did going from 480 to 580, in order to get some fully functional chips out to market.

But that Sea Islands based SM10000 sounds like fun. And it bodes well for the 89xx Radeons being here well before next summer, too.
 

Alusan

Member
Mar 5, 2010
33
0
66
But that Sea Islands based SM10000 sounds like fun. And it bodes well for the 89xx Radeons being here well before next summer, too.
I think it's more likely that it's that dual-Tahiti board with downclocked GPUs they showed a few months back. Wouldn't we have heard some more rumblings about Sea Islands if they already had a FirePro product ready?
 

DiogoDX

Senior member
Oct 11, 2012
757
336
136
I think it's more likely that it's that dual-Tahiti board with downclocked GPUs they showed a few months back. Wouldn't we have heard some more rumblings about Sea Islands if they already had a FirePro product ready?
I agree. SM10000 is a Seamicro Server not an evolution of the S9000.

AMD will be showcasing its virtualization solutions at VMworld 2012 through October 12 at Barcelona’s Fira Barcelona Gran Via, hall 7, level 0, stand G312. AMD will be demonstrating an HP ProLiant DL385p Gen8 server with VMware vSphere™ and View, the graphics path capabilities of vSphere with AMD FirePro™ graphics boards, a static SeaMicro SM10000™ server, as well as the AMD Opteron and Intel Ivy Bridge modules for the newly announced SeaMicro SM15000™ system.

http://www.amd.com/us/press-releases/Pages/amd-based-hp-servers-2012oct9.aspx
 

Homeles

Platinum Member
Dec 9, 2011
2,580
0
0
I think it's more likely that it's that dual-Tahiti board with downclocked GPUs they showed a few months back. Wouldn't we have heard some more rumblings about Sea Islands if they already had a FirePro product ready?
It's a fully enabled Tahiti chip, running at lower clocks than the 7970. It should be S10000, not SM10000.
 

ViRGE

Elite Member, Moderator Emeritus
Oct 9, 1999
31,516
167
106
No. GCN has FP64 up to ½ of its FP32 rate. Consumer cards max out at ¼.
When AMD calls FP64 performance configurable, they don't mean in the "flip a software switch sense". It's configurable in that it's easy to modify a design to have higher or lower FP64 performance based on your design goals (higher performance requiring larger hardware blocks). Pitcairn and Cape Verde are 1/16, Tahiti is 1/4. There is not a Southern Islands chip with 1/2. That's merely an architectural option for AMD if they want to go that way on future chips.

So based on this article, it has to be a dual-GPU card. 1.48 TFLOPS is not possible on a single Tahiti GPU.
 

3DVagabond

Lifer
Aug 10, 2009
11,951
204
106
TechPowerUp has an article on the S10000 server graphics card.

amd_firepro_s10000_01.jpg


Dual GPU card.
 

Lonbjerg

Diamond Member
Dec 6, 2009
4,419
0
0
TechPowerUp has an article on the S10000 server graphics card.



Dual GPU card.


Nice PR spin form your link:
OpenCL Support: OpenCL has become the compute programming language of choice among developers looking to take full advantage of the combined parallel processing capabilities of the FirePro S10000. This has accelerated computer-aided design (CAD), computer-aided engineering (CAE), and media and entertainment (M&E) software, changing the way professionals work thanks to performance and functionality improvements.

I guuess CUDA dosn't get recognized by AMD...go figure ;)



No fanboy-fueled trolling please.

In the future, I hope you won't feel obligated to troll everything AMD-related.

Moderator jvroig
 
Last edited by a moderator:

Keysplayr

Elite Member
Jan 16, 2003
21,211
50
91
Nice PR spin form your link:


I guuess CUDA dosn't get recognized by AMD...go figure ;)

Ok, ya lost me here. What does the AMD GPU(s) 3dVagabond listed have to do with CUDA? Should it? No idea where you were coming from with this. Blame it on my first cup of coffee if you wish.

Can you clear this up for the bleary eyed?
 

Jaydip

Diamond Member
Mar 29, 2010
3,691
21
81
S10000 has better DP and SP performance but at the expense of TDP.
 
Last edited:

sontin

Diamond Member
Sep 12, 2011
3,273
149
106
S10000 has only 13% more DP performance and needs ~66% more power. That's not impressive.
 

Jaydip

Diamond Member
Mar 29, 2010
3,691
21
81
AMD needed to create a Tesla equivalent can't blame them for trying.
 
Feb 19, 2009
10,457
10
76
Doesnt matter, AMD wont even make a blip in HPC, ppl are already used to CUDA.

At least Intel has the muscle and $$ to hammer away at its x86 for their HPC attempt. AMD has nothing of the sort to dent away at NV's dominance.
 

Jaydip

Diamond Member
Mar 29, 2010
3,691
21
81
Gotta disagree.You will get nowhere unless you try. Amd needs to offer more than hardware, they need to offer a complete package.The potential is there,they got to take advantage of it.
 

3DVagabond

Lifer
Aug 10, 2009
11,951
204
106
Gotta disagree.You will get nowhere unless you try. Amd needs to offer more than hardware, they need to offer a complete package.The potential is there,they got to take advantage of it.

CUDA is proprietary. nVidia has to support it. OpenCL is open source. In theory anyway, it doesn't need vendor support.
 

Lonbjerg

Diamond Member
Dec 6, 2009
4,419
0
0
CUDA is proprietary. nVidia has to support it. OpenCL is open source. In theory anyway, it doesn't need vendor support.

So just like Direct3D vs OpenGL....that went well for the open source...oh wait.

It's more complicated that just tossing hardware out there....hence why CUDA is used in schools ^^