There's a lot of mixing up of whats what in this thread. Here's clarification of what we know and what the slides at xbit labs show.
Chip: Intel Polaris
Timeframe: 2007
Process Node: 65nm
Power: 62W unknown for chip or for board
Cores: 80 simple floating point unit + register + control, not x86
Performance: claimed 1 teraflop, unknown single or double precision
Commercial Availability: None
Status: Unknown
Purpose: research and development experiment of stacked dies, on chip routing network
Chip: Intel Larrabee
Timeframe: 2009
Process Node: unknown, assumed 45nm
Power: unknown
Cores: 32-64 x86 derived from P54C Pentium
Performance: claimed 1 teraflop single precision on SGEMM
Commercial Availability: None
Status: Terminated
Purpose: GPGPU, graphics acceleration
Chip: Intel Knight's Ferry
Timeframe: 2011
Process Node: 22nm
Power: unknown
Cores: 50+ x86, possibly derived from Larrabee
Performance: claimed 1 teraflop double precision on DGEMM
Commercial Availability: planned availability in 2012 or later as Knight's Corner, planned into be placed into 10 petaflop TACC supercomputer in 2013
Status: ongoing
Purpose: high performance computing accelerator
Chip: NVIDIA GF100
Timeframe: 2010-2011
Process Node: 40nm
Power: 225W TDP claimed for board
Cores: 448 CUDA cores
Performance: theoretical 1.03 teraflops single precision, 0.515 teraflops double precision, 635 gigaflops in SGEMM, 305 gigaflops in DGEMM (source:
http://www.netlib.org/utk/people/JackDongarra/SLIDES/gpu-0711.pdf)
Commercial Availability: now as NVIDIA Tesla M2070 and M2050
Status: ongoing
Purpose: GPGPU, graphics acceleration
Chip: AMD RV870
Timegrame: 2010-2011
Process Node: 40nm
Power: 225W TDP claimed for board
Cores: 320 SIMD cores (1600 processing elements)
Performance: theoretical 2.72 teraflop single precision, 0.544 double precision, claimed 87% peak efficiency on DGEMM for a 473 gigaflops double precision (source:
http://www.cse.scitech.ac.uk/disco/mew21/presentations/AMD.pdf)
Commercial Availability: now as AMD Radeon 5870
Status: ongoing
Purpose: GPGPU, graphics acceleration
Chip: Fujitsu SPARC64 VIIIfx
Timeframe: 2011
Process Node: 45nm
Power: 58W for chip?
Cores: 8
Performance: theoretical 128 gigaflops double precision, 93% efficiency in Linpack 119 gigaflops double precision, 111 gigaflops DGEMM (source:
http://icl.cs.utk.edu/hpcc/hpcc_record.cgi?id=459)
Commercial Availability: Deployed now in 10 petaflop RIKEN K supercomputer, follow on 16 core chip to be commercially available in 2012
Status: ongoing
Purpose: high performance supercomputing
Chip: (Future Exascale Processor)
Timeframe: unknown, possibly 2018 based on previous stated goals from Intel
Process Node: unknown
Power: Projected 5W for computation, 20W for an entire system (which includes computation, disk/storage, memory, communication, and miscellaneous)
Cores: unknown
Performance: unknown, performance target of 1 exaflop double precision on 20 megawatts of power for the entire supercomputer
Commercial Availability: unknown, targeting 2018
Status: ongoing
Purpose: high performance supercomputing
Here is Michael Feldmen's take on Intel's SC11 Knight's Ferry presentation:
http://www.hpcwire.com/hpcwire/2011...ark_of_one_teraflop_with_knights_corner_.html