WhoBeDaPlaya
Diamond Member
- Sep 15, 2000
- 7,415
- 404
- 126
Llano(32nm) Stars => 32 mm² for two cores & 2 MB L2.Stars cores were small. The comparison is relevant based on cost.
A 8150 was 315mm2 on 32nm.
Thuban x6 was 346mm2 on 45nm.
A zen core is 193mm2 btw.
Bd was a epic failure. And mostly because of sad efficiency and its huge size and not because of its lackluster performance from witch it was famous. It was worse than its reputation.
Except, AMD didn't make Zen. It was developed first at Apple, which was shot down. SMT consumed too much power in a mobile phone.Its not a blamegame on the engineers making the stuff on the ground because they also made zen. And yeaa bd really learned amd a bit about efficiency that we already saw in excavator or even the clockmesh in pd. Hard lessons.
Sandy Bridge-EP-4 was 270 mm². So, the large dies wasn't exclusive to just AMD. AMD at the time was using larger components that were more optimal in 0.7-1.1v range. It isn't really all that shocking FX was at 315 mm² and later 319 mm² with Piledriver. AMD should have launched with the 45nm PDSOI version of K10 @ Phenom FX & APU Falcon. Back in 2008/2009, this would have shown the progress from standard voltage to low voltage design. 45nm version of Bulldozer being effectively smaller than 32nm Bulldozer, etc.But bd didnt arive as 2 cores on 32mm2 sans l3. It came as 315mm2. Thats the reality that hit the market. Yes it was evident from the die shots most of the area was wasted. But its just another technical reason of a long string that went completely wrong. Tons of top level designs decisions were incompetent. You cant hand optimize away from that.
Probably it would have been better if it was worse so it could have been scrapped at 32nm tapeout! - it was that bad
For all intents and purposes the 32nm PDSOI version should have consumed significantly more power than it did. More pipeline stages with each stage being wider(~2.3x to be approximately exact) than K8! (the ! is the actual name for K8 cores with 128-bit LSQ & 128-bit FPU). So, the fact that FX-8150 and later FX-8350 out clock Phenom II X6/Phenom II X4, is pretty significant. Especially, when you consider than 32nm PDSOI had very little power/perf improvement over 45nm PDSOI in initial yields(2010-2012; 2013 being the year 32nm PDSOI finally got faster than 45nm PDSOI).Verification of Bulldozer cores (microprocessor based on K10 micro architecture & M-SPACE design methodology)
The design and verification of the 45 nm clock skew and clock tree test chip for the K10 (Bulldozer)
{{Citation needed}}Except, AMD didn't make Zen. It was developed first at Apple, which was shot down. SMT consumed too much power in a mobile phone.
That theory is ridiculous and is completely based on Keller working at Apple for the A5X design. Apple doesn't and will never have an X86 license. There is no reason they would have Keller designing a x86 CPU that they could never use.{{Citation needed}}
Apple A7 has SMT logic in Cyclone which was disabled/fused-off at release. With A8 that logic was completely removed. That defunked path was moved to AMD via IP theft(early A7/Cyclone concept) by Keller. Zen is based off Cyclone from A7, but not the subsequent releases in A8/A9/A10. K12 which is ARM was a complete rip of the core, Zen was considered the not so desired sister-core to it. Except, Zen was different enough to bypass any engineering scrutiny cuz x86. Which, ultimately leads to Zen being released and K12 being killed off do to a cease and desist order by Apple.{{Citation needed}}
Hey,finished blueprints AND open software (other suckers doing work for them)IBM is using the word open to describe this project in three ways.[3]
- They are licensing the microprocessor technology openly to its partners. They are sharing the blueprints to their hardware and software to their partners, so they can hire IBM or other companies to manufacture processors or other related chips.
- They will collaborate openly in an open-collaboration business model where participants share technologies and innovations with each other.
- Advantages via open source software such as the Linux operating system.
That defunked path was moved to AMD via IP theft(early A7/Cyclone concept) by Keller.
Which, ultimately leads to Zen being released and K12 being killed off do to a cease and desist order by Apple.
So, ideally Bulldozer was good because in most part it was built Internally.
Zen is awful because it was built from another companies existing core.
It was built grounds-up internally by teams already at AMD. AMD received baseline MCMT & concept Bulldozer between 2002-2004. First Bulldozer test-chip was taped out in 2006. Which, then would go through a very weird development trend.. . . what?
NexGen was purchased by AMD. So, effectively it was then purely AMD in origin. It also lead to superior design within one-ish year of K5s shipping.Gee, I guess k6 was awful too . . .
It was built grounds-up internally by teams already at AMD. AMD received baseline MCMT & concept Bulldozer between 2002-2004. First Bulldozer test-chip was taped out in 2006. Which, then would go through a very weird development trend.NexGen was purchased by AMD. So, effectively it was then purely AMD in origin. It also lead to superior design within one-ish year of K5s shipping.
Apple A7 has SMT logic in Cyclone which was disabled/fused-off at release. With A8 that logic was completely removed. That defunked path was moved to AMD via IP theft(early A7/Cyclone concept) by Keller. Zen is based off Cyclone from A7, but not the subsequent releases in A8/A9/A10. K12 which is ARM was a complete rip of the core, Zen was considered the not so desired sister-core to it. Except, Zen was different enough to bypass any engineering scrutiny cuz x86. Which, ultimately leads to Zen being released and K12 being killed off do to a cease and desist order by Apple.
So, ideally Bulldozer was good because in most part it was built Internally. Zen is awful because it was built from another companies existing core. While components are different, largely because Zen is built from shrinked Excavator IP/patents/etc. So, really have fun with your recycled Excavator cores. (The ARM version had none of this differing from Cyclone only the x86 ver.)
Trolling is not allowed
Markfw
Anandtech Moderator
I feel the same way about Mongoose. At least, Samsung evolved the Cat design rather than doing a 1:1 rip with Apple, but with Excavator IP.@NostaSeronx
I'm curious, what you think about Mongoose architecture? "Obviously" all signs point that Samsung "stole" it from AMD?
But bd didnt arive as 2 cores on 32mm2 sans l3. It came as 315mm2. Thats the reality that hit the market.
One thing Nosta has brought up though is the comparisons of 32nm SOI to 45nm, It really was a dog for some time.. Llano was a disaster IMO, with FMAX well below 45nm, and perf/watt was barely any better even at lower freq/power levels. So if you actually compare Llano with piledriver based APU's, the later looks more impressive.
I'd say Bulldozer and all its variants had good performance, but total junk features. Boost and throttle capabilities didn't add value, if you turned them off you had a much better basis... On Bulldozer, single-core - dual, in light of modular design - boosting occurs when....
GeAPM setting cannot be switched off directly after selecting a profile in msrtweaker dos utility, since it needs to be done in Stilt's Devastator Powertune if it constantly shuffles between profiles as it does in its usual operation, so I suppose you aren't correct on this one... Stilt has tested how much it impacts to throttle to p3 from p0 and it was 1%.I disagree on all three counts:
The saving grace for BD1 and BD2 was their multithread and having 8 cores. BD1 (and possibly PD) was (were) the first consumer octacore(s) that hit market and it had a great niche to fill. A quadcore BD1 as flagship would have been a real joke going against Phenom II quads and hexacores.
I don't know anything about Llano desktop, but Llano laptop was marvelous; I suppose K10 was meant for lower frequencies and power profiles.
It's natural to expect a performance increase going to a new generation. And however small, that's what we got going from Llano to BD2 APUs.
Boost and single thread is what saved the dozers; the unimpressive IPC can only be offset by frequency gains.
You could clock the individual cores separately (and thus get very effective single core boosts) starting with Steamroller. Piledriver still had a shared decoder+dispatcher, which might have been what prevented a single core to boost alone. Steamroller eliminated this shared front end and you can set different frequencies for cores on the same module; had steamroller kept the shared front end, steamroller might have been (significantly?) more power efficient. Steamroller is not impressive at all power efficiency wise (neither desktop nor laptop) but overall they made for decent quadcores.
Turbo boost can be annoying at times (on 95W and up power profiles) and I've turned this features off at times for the duration of a summer. Mostly a poor OS setup or frequency governor is to blame. On linux, BOINC loads go from being a nuissance (spinning up the fan) to being near unnoticeable, if you set edit (set to 1) the file /sys/devices/system/cpu/cpufreq/ondemand/ignore_nice_load .
This executes niced tasks at low power efficient frequencies rather than maximal frequency and makes a world of difference.