Update:
I redid my tests on windows XP, with/without HT. I used similar compiler flags as SPEC 2006.
I also overclock Core i7 from 2.66G to 3.5G. Below are the setup and comparison.
Core i7 computer
CPU: core i7 920 at 2.66G
MB: ASUS P6T
memory: 6G DDR3 1600
OS: Fedora 10 64bit
Compiler: Intel Fortran Compiler Professional 11.0.069
Compiler flags: -r8 -xSSE4.2 -ip -openmp -opt-prefetch
OS: Window XP 64bit
Compiler: Intel Visual Fortran Compiler Professional 11.0.066
Compiler flags: /Qautodouble /QxSSE4.2 /Qip /Qopenmp /Qopt-prefetch /F1000000000
Core 2 Quad computer
CPU: Core 2 Q6600 at 2.4G
MB: gigabyte p35
memory: 4G DDR2 800
OS: fedora 9 64bit
Compiler: Intel Fortran Compiler Professional 11.0.069
Compiler flags: -r8 -ip -openmp
Dual Xeon computer
CPU: dual Xeon E5410 at 2.33G
MB: Tyan S5396A2NRF
memory: 8G DDR2 667 ECC Fully Buffered
OS: fedora 9 64bit
Compiler: Intel Fortran Compiler Professional 11.0.069
Compiler flags: -r8 -ip -openmp
The testing code is a openmp CFD(computational fluid dynamics) code. The code used about 500MB memory. The testing results:
results
*For Core i7, Turbo mode is OFF.
For four threads, Core i7 920@3.55G is slower than Q6600@2.4G!
The scaling of Xeon E5410 is the best.
----------------------------------------------------------------------------------------------
I read a lot reviews and forums about the new Intel CPU core i7. The new Intel CPU looks great. For example, according to spec cfp2006,
http://www.amdzone.com/phpbb3/...opic.php?f=52&t=135802, Core i7 is about 100% faster than core 2 for float point performance. Although someone suggested that Intel might optimize the CPU for the benchmark codes, I still decided to upgrade my computer from core 2(Q6600) to core i7(920). Because I thought the better bandwidth of core i7 would help the multithread computation. Well, I was wrong. My testing results made me very disappoint to the new core i7 CPU.
My new computer
CPU: core i7 920 at 2.66G
MB: asus p6t
memory: 6G DDR3 1600
OS: Linux fedora 10 64bit
My old computer
CPU: core 2 Q6600 at 2.4G
MB: gigabyte p35
memory: 4G DDR2 800
OS: Linux fedora 9 64bit
compiler: Inter ifort 11
The testing code is a openmp CFD(computational fluid dynamics) code. The code used about 500MB memory. The testing results:
1 thread : 202.7s (core i7 920) , 213.1s (q6600), 5.13%(core i7 advantage)
2 threads: 109.0s (core i7 920) , 109.3s (q6600), 0.27%(core i7 advantage)
4 threads: 96.1s (core i7 920) , 68.3s (q6600), -28.93%(core i7 advantage)
(core i7 HT is off, it is slower if HT is on. the clock of core i7 920 is about 10% higher than core 2 Q6600)
The scaling of core i7 for 4 threads is really bad. Considering the bandwidth of core i7 is about twice of core 2, this result is really strange. I don't know why, maybe the OS didn't use all the potentials of the new CPU?
I redid my tests on windows XP, with/without HT. I used similar compiler flags as SPEC 2006.
I also overclock Core i7 from 2.66G to 3.5G. Below are the setup and comparison.
Core i7 computer
CPU: core i7 920 at 2.66G
MB: ASUS P6T
memory: 6G DDR3 1600
OS: Fedora 10 64bit
Compiler: Intel Fortran Compiler Professional 11.0.069
Compiler flags: -r8 -xSSE4.2 -ip -openmp -opt-prefetch
OS: Window XP 64bit
Compiler: Intel Visual Fortran Compiler Professional 11.0.066
Compiler flags: /Qautodouble /QxSSE4.2 /Qip /Qopenmp /Qopt-prefetch /F1000000000
Core 2 Quad computer
CPU: Core 2 Q6600 at 2.4G
MB: gigabyte p35
memory: 4G DDR2 800
OS: fedora 9 64bit
Compiler: Intel Fortran Compiler Professional 11.0.069
Compiler flags: -r8 -ip -openmp
Dual Xeon computer
CPU: dual Xeon E5410 at 2.33G
MB: Tyan S5396A2NRF
memory: 8G DDR2 667 ECC Fully Buffered
OS: fedora 9 64bit
Compiler: Intel Fortran Compiler Professional 11.0.069
Compiler flags: -r8 -ip -openmp
The testing code is a openmp CFD(computational fluid dynamics) code. The code used about 500MB memory. The testing results:
results
*For Core i7, Turbo mode is OFF.
For four threads, Core i7 920@3.55G is slower than Q6600@2.4G!
The scaling of Xeon E5410 is the best.
----------------------------------------------------------------------------------------------
I read a lot reviews and forums about the new Intel CPU core i7. The new Intel CPU looks great. For example, according to spec cfp2006,
http://www.amdzone.com/phpbb3/...opic.php?f=52&t=135802, Core i7 is about 100% faster than core 2 for float point performance. Although someone suggested that Intel might optimize the CPU for the benchmark codes, I still decided to upgrade my computer from core 2(Q6600) to core i7(920). Because I thought the better bandwidth of core i7 would help the multithread computation. Well, I was wrong. My testing results made me very disappoint to the new core i7 CPU.
My new computer
CPU: core i7 920 at 2.66G
MB: asus p6t
memory: 6G DDR3 1600
OS: Linux fedora 10 64bit
My old computer
CPU: core 2 Q6600 at 2.4G
MB: gigabyte p35
memory: 4G DDR2 800
OS: Linux fedora 9 64bit
compiler: Inter ifort 11
The testing code is a openmp CFD(computational fluid dynamics) code. The code used about 500MB memory. The testing results:
1 thread : 202.7s (core i7 920) , 213.1s (q6600), 5.13%(core i7 advantage)
2 threads: 109.0s (core i7 920) , 109.3s (q6600), 0.27%(core i7 advantage)
4 threads: 96.1s (core i7 920) , 68.3s (q6600), -28.93%(core i7 advantage)
(core i7 HT is off, it is slower if HT is on. the clock of core i7 920 is about 10% higher than core 2 Q6600)
The scaling of core i7 for 4 threads is really bad. Considering the bandwidth of core i7 is about twice of core 2, this result is really strange. I don't know why, maybe the OS didn't use all the potentials of the new CPU?