Know the truth
1) Bulldozer full product line announcement, FX-4170 up to 4.2GHz frequency. http://diybbs.zol.com.cn/11/11_100683.html
2) from coolaler with corescn (970 replacement motherboard after) Bulldozer B2 stepping score on the "I know the results," the premise is true.
2.1) with 2 sets of NB coolaler frequency of testing, found that when the frequency of 2.2GHz and 2.6GHz NB When, WPrime results better than NB frequency of 2.0GHz and 2.4GHz (see above B2 stepping known BUG).
2.2) Another test showed that: When the FX-8150 default rate with 1866MT / s memory, Cinebench R11.5 score of 5.95. The overclocked to 4.8GHz, and NB 2.2GHz after, Cinebench R11.5 performance upgrade to 7.8. (I can only guarantee the authenticity of the default results.)
3) C0 stepping overclocking ability is improved, and to reduce some of the products of the TDP.
4) shopblt.com Bulldozer with provantage.com scheduled start, the truth is unknown.
5) The so-called Bulldozer official results from AMD. http://diybbs.zol.com.cn/11/11_100797.html
6) AGESA is to develop the necessary BIOS code, now part of the BIOS does not use the latest version of AGESA, Bulldozer will ultimately limit the performance.
7) retail chip (B2G) has started production, the relative performance of engineering samples (B2) has been raised. (Please read my microblogging)
8) had a final statement chips Fritz Chess Game score as high as 17000, veracity study. (Refer to section 2 of this item)
9) all messages are pointing October 13 date for the NDA lifted, authenticity is difficult to trace. Probably like 6 September, rumors, etc. August 26 the same. After all, only to eventually determine the release of AMD.
But those cores have the module penalty, right?
So those 16 cores, at 80% are only 12.8 cores - so those 35% would have to come from IPC and clock speed.
But since the clock speed is the same either the IPC is way higher or the performance won't be 35% up.
You could say the same thing about the Apple A6. How many transistors does it have? How much does it cost to make? We know nothing about it, yet there must be 10 million of those chips somewhere.
Good point and agreed! :biggrin::hmm: about as wise as seeking information on pre-release hardware from a troll-laden forum in the firstplace? :|
We all get what we pay for here, the info is free and it's worth every penny![]()
I don't mean to imply anything about BD's competitiveness; I'm as skeptical as anyone else (though I remain somewhat hopeful). It just seems like people have been appropriately skeptical of performance leaks (both good and bad), but have accepted these prices without question. Even those who believe the benchmark leaks are fake/not representative are formulating their arguments on the basis that these prices are correct ("AMD has a history of pricing this way...", and so on) rather than simply questioning the prices themselves.Even if we ignore all the pricing rumors, if BD is so competitive, why was it delayed so many times, why did it require so many revisions to get higher frequencies, why is it 9 months late since SB arrived, why hasn't there been a single positive leak from anyone?
Your first sentence there makes sense and I think is the explanation I was looking for.Its not to say that the pricing is reliable but any bit of information brings more clues and answers. I was also referring to the price of Opteron 6272 being lower than Opteron 6176 SE earlier from another site. :hmm:
Its a collection of information that's already been posted/discussed earlier. The Fritz chess was debunked. If not mistaken, that "C0 stepping" speculation came from SemiAccurate forums. Other details not posted here is that Cinebench score 7.8 was with a FX8150 overclocked to 4.8GHz mentioned on XS forums (by a poster from Vietnam who supposedly have one). :hmm:More yoda talk from mainland China.
http://translate.google.com/transla...p%3A//diybbs.zol.com.cn/11/11_100864.html
*snip*
They said "up to" 35% higher performance. If they got that "up to" figure using an application that only requires integer execution then they may lose very little performance, if any.
You're forgetting the part about pricing coming from stores that are gonna sell the CPU. Pre-order pricing is always very close to launch pricing.
So yes, it's pretty much a fact that the FX-8150 will cost from $~250-275.
But by all means, be optimist. The reality of it is that it's not as competitive as they hoped for, so they're pricing lower and trying desperately to raise clock speeds. Bulldozer, like Thuban, will be good for multi-threaded applications and at a big deficit in anything that's not.
They said "up to 80%" performance of a traditional dual-core design.
The integer portion seems to be the one more prone to lose performance - the FPU has more theoretical capability than 2x K10.5's FPU.
They said "up to" 35% higher performance. If they got that "up to" figure using an application that only requires integer execution then they may lose very little performance, if any.
They said "up to 80%" performance of a traditional dual-core design.
The integer portion seems to be the one more prone to lose performance - the FPU has more theoretical capability than 2x K10.5's FPU.
If there's two threads being executed, then each one is forced to share resources. How much real-world performance they lose from it, we don't know. Every module has 2x 128-bit FMAC.
160% vs 200% to be exactAlso, they meant to say 180% scaling in comparison to two typical cores, or 200%.
not 160% 180% of 2 full cores. id link it but its in all these damn threads so im not gonna bother looking for it
Dual-Core CMP => 100%, 200%
Dual-Core CMT => 80%, 160%
Owing to the shared-resource design, I thought the single-thread performance was supposed to be better than 1/2 the dual-thread performance on the dual-core CMT microarchitecture?
Because how could a single thread on a single core outperform a single thread on a dual-core module when that thread now has access to 2x the resources in those instances where a second thread is not scheduled on the module?
Or are they saying that the comparitor CMP architecture is to take the CMT module, remove ONLY the int core, leave all the shared resources as is, and then make two cores out of that?
Actually, it's the other way around. A module has two full, normal integer cores that do not share any resources.
Considering that MC can do 0 (ZERO) AVX instructions and 0 (ZERO) 256-bit instructions, a BD module should lose no FP performance compared to 2 MC cores. In fact, a BD module should gain performance compared to 2 K10.5 cores if such instructions are used.When it comes to that it's the same as all other CPUs in execution. Where it changes is the FPU, since if only a single thread is executed on the module it'll use all the resources. That means it'll be able to handle 256-bit AVX (used for encryption) and FP SSE. If there's two threads being executed, then each one is forced to share resources. How much real-world performance they lose from it, we don't know. Every module has 2x 128-bit FMAC.
Also, they meant to say 180% scaling in comparison to two typical cores, or 200%.
The 35% was for the HPC market which is a downplayed number because it was talking about current apps
33% more core + 2% improvement(.125% improvement per core)
Unfortunately, HPC is one of the few places where developers program to metal
The applications that will be built for Interlagos will use its FMA and 256bit capabilities
Bulldozer CMP = 100% performance
Bulldozer CMT = 80% performance with less mm^2 and less power(watts)
The 80% number wasn't aimed at current products but at a hypothetical Bulldozer using CMP
If two threads are required to execute on the Floating Point aka 256bit....an extra cycle will occur for the extra 256bit op
160% vs 200% to be exact
160% w/ Bulldozer Module vs 200% w/ Bulldozer Dual-core w/ Seperate Fetch/Decode and Floating Point and L2
The integer cores do share resources, just look at AMD slides posted a few posts above.
Considering that MC can do 0 (ZERO) AVX instructions and 0 (ZERO) 256-bit instructions, a BD module should lose no FP performance compared to 2 MC cores. In fact, a BD module should gain performance compared to 2 K10.5 cores if such instructions are used.
I've read both, but the 80% is the only one showing up on AMD marketing slides.
The FPUs share resources, the integer units do not. I've looked at that slide, and that's exactly what it says.
I guess they didn't write shared fetch and shared decode, although as you can see they are shared, but they did write "shared L2$"
Additionally I didn't say the FPU isn't shared by the 2 threads and I also said MC can do 0 256-bit FP instructions and 0 AVX.
As for the FPU, what's being compared is not the performance improvement from K10.5 in FP SSE and 256-bit AVX (this one in particular K10.5 can't do at all), but how much performance it loses in comparison to having everything in the unit dedicated. Of course, FP SSE performance should be improved in comparison to K10.5 when only one thread is being run in the module.
As for the FPU, what's being compared is not the performance improvement from K10.5 in FP SSE and 256-bit AVX (this one in particular K10.5 can't do at all), but how much performance it loses in comparison to having everything in the unit dedicated. Of course, FP SSE performance should be improved in comparison to K10.5 when only one thread is being run in the module.
No, 180%. AMD means 100% for the first core, and 80% performance added from the second core. Therefore, 180% overall. Of course, this is probably an average, and it may very well be that AMD is using this estimate taking into account how much performance will be lost in FP workloads in comparison to having it completely dedicated and using it to round out. It may very well be, like another poster said: 200% integer performance, 160% floating point performance when two threads are being run.
And the number said by AMD when it comes to performance improvements in HPC from Interlagos in comparison to Magny Cours is "UP TO 35% higher", which means it's a best-case scenario. It could definitely be true in integer workloads, though. Needless to say, there's been more info in terms of servers because that's what Bulldozer is mainly aimed at.