StrangerGuy
Diamond Member
- May 9, 2004
- 8,443
- 124
- 106
In before how x86 doesnt matter anymore because ARM is going to rule over Intel but somehow AMD remains unaffected so AMD wins.
anybody see that new benchmark earlier today about the 0x55 masked thread scheduling in win7 for the 8150, made all the <=4 thread benchmarks 15-20% faster.
A Korean site had done a pretty thorough investigation of this behavior on release day.
http://udteam.tistory.com/442 [comparison of 2M/4C versus 4M/4C)
http://udteam.tistory.com/440 (scores from other CPUs of the benches used above)
http://udteam.tistory.com/441
In short:
Pro: Measurable performance increase
Con: At <= 4 threads, still loses to the lower clocked Phenom II 980 in everything except the WinRar benchmark.
Well, that's actually pretty interesting, at least for heavily threaded apps. It would be nice if M$ would add a revised scheduler to Win7 SP2.
It's easier to see why Cray didn't ditch Interlagos, with their custom OS and custom compiler, they can probably really tweak the typically heavily threaded apps to run very fast on Bulldozer.
Well, that's actually pretty interesting, at least for heavily threaded apps. It would be nice if M$ would add a revised scheduler to Win7 SP2.
It's easier to see why Cray didn't ditch Interlagos, with their custom OS and custom compiler, they can probably really tweak the typically heavily threaded apps to run very fast on Bulldozer.
By only using half the "cores?"
You realize what you're saying here, right?
Unless they have some magic power-saving bullet, however, they will also be spending a lot more money to run the machine.
Since when did performance/watt become king? Why are we not looking at absolute performance?
Surely I must have missed some of the finer points of previous fanboy vs fanboy debates. Thank you for the explanation. It actually makes sense by its sheer absurdity.Since AMD used it as a selling point in the gpu divisions. All over the forums there has been a huge focus on performance per watt because AMD had the advantageover nvidia. the advantage isnt really that big now with cayman vs gtx500 series but its still thrown up all the time. AMD and the fanboys made it a big deal, the biggest deal possible, and now its come back to bite them in the....... buttocks
Since AMD used it as a selling point in the gpu divisions. All over the forums there has been a huge focus on performance per watt because AMD had the advantageover nvidia. the advantage isnt really that big now with cayman vs gtx500 series but its still thrown up all the time. AMD and the fanboys made it a big deal, the biggest deal possible, and now its come back to bite them in the....... buttocks
HPCwire: Why Appro and why AMD Opteron™-powered servers?
McLaughlin: Appro offers innovative supercomputing solutions by combining the performance advantages of AMD Opteron processors such as higher core count, significant memory enhancements, and improved floating point processing with optimized power management features. Being on the forefront of system cooling and optimization, Appro offers many configuration options and having features such as performance-on-demand by dynamically adjusting performance based on CPU utilization – helps systems to run at optimum performance and power levels, reducing electricity costs while maximizing IT budget dollars. Appro platforms based on AMD ”Interlagos” processors will deliver greater performance per watt with improved memory bandwidth while reducing memory latency. AMD has been a great technology partner and has provided us with the tools and expertise to accelerate HPC business results. Working together, Appro and AMD provide customer-focused innovations that our High-Performance computing customers require.
HPCwire: Can we talk about the Appro Xtreme-X Supercomputer and new servers based on “Interlagos”? What can we expect?
McLaughlin: The Appro next generation, server platforms and the Xtreme-X Supercomputer will increase memory performance for high-performance computing while adding reliability, high availability, and flexibility for the end-users with the best performance/value. In addition, the new AMD architecture drastically improves the memory and I/O latency. It provides a much easier way to address larger chunks of memory faster, yielding huge performance boosts.
HPCwire: Who will benefit from Appro Xtreme-X Supercomputer and server platforms based on AMD Opteron™ processors?
McLaughlin: Many industries will benefit, in particular industries that are using memory-intensive applications, such as those used in scientific research and engineering. Compute-intensive multi-threaded applications will also see an immediate benefit.
HPCwire: Many people in the industry are eager to see what the AMD “Interlagos” processors can offer. Where do things stand with the Appro Xtreme-X Supercomputer based on these processors?
McLaughlin: We're on target for a Q4 launch with “Interlagos”-based systems. We're actually pretty excited about the product, particularly with respect to some of the features that we're going to enable. We started out Appro’s next generation Xtreme-X, with a certain feature set. After gathering customer feedback on the design, we changed the features to address faster hybrid multi-core technologies with improved power and cooling, flexible Interconnect options with fast I/O bandwidth, superior HPC software stack integration with complementary Appro Cluster Engine™ (ACE) management suite, including capabilities such as job scheduling, revision control, and fault tolerance. Taken together, these features are very important to the HPC industry. As a result, we think that our next generation product will have a bigger impact than we originally anticipated.
HPCwire: The AMD “Interlagos” processor offers some interesting innovations. How are you able to take full advantage of the processor’s capabilities?
McLaughlin: The Appro Xtreme-X supercomputer will offer enhancements to match the new processor and future generations. To accomplish this, we made significant changes to the Xtreme-X architecture to complement the new AMD “Interlagos” processor. We looked at the overall platform including thermal, mechanical, and chassis cooling design, as well as various interconnects so we can continue to deliver on quality, value, and performance.
HPCwire: As we know, whenever a new processor is announced customers need more than just a spec sheet to understand how the processor will help their specific HPC applications. What’s been done to prepare the market for the AMD “Interlagos” processors?
McLaughlin: The AMD Seed program is the key. It is definitely important and critical for our industry. People want to kick the tires, especially when they are getting a lot more cores to make sure their applications can benefit and can reach a quicker ROI. Through the seed program, we’ve provided samples to customers and we also performed customer benchmark testing in Appro’s Houston and Milpitas facilities. In response to that, we already have customers lined up and more announcements will be made in the near future.
Software not properly coded? If AMD is ahead of its time, why does its multithreaded performance benchmarks(all the way to 8 threads) only lie btwn the 4 cores 2500k and 2600k? Can faulty Win7 scheduling problems account for most of the shortfall?
One reviewer (hardocp) I think also said that his Intel setup was more responsive in stress testing (full cpu/memory load) compared to the BD rig.
Hmm I was thinking that maybe windows dedicates a thread to explorer among other things so on a heavily loaded system, the multicore PC with the faster single threaded perf wins out in terms of responsiveness. But if explorer itself was multithreaded....That's the advantage of having higher single-threaded performance when comparing two multi-core CPUs. In almost all modern Operating Systems you gain an advantage in overall responsiveness and multitasking when you go from a single-core to a dual-core, but after a dual-core it's all single-threaded performance that makes the big difference......
Explorer doesn't do enough actual work to need to be multithreaded. It will be mostly limited by disk and network performance, even with a fast SSD. For tasks which are not embarrassingly parallel, diminishing returns typically begin after two threads (making the assumption that if it was made to use more than one thread, going from one to two threads was was worthwhile). Things like games can keep wringing out more performance because at a low level, they are themselves multitasking.Hmm I was thinking that maybe windows dedicates a thread to explorer among other things so on a heavily loaded system, the multicore PC with the faster single threaded perf wins out in terms of responsiveness. But if explorer itself was multithreaded....
Well that doesn't matter, can't keep shifting the goalposts.