IPC is hard because clock targets are high. The lower you set your clock targets, the more you can do per clock, without heroic effort. If you go and read that post again you can note the caveat of one-MHz clock target.
The point of my statement is that talking solely of IPC as if it's a primary measure of performance and capability of a CPU is exactly as wrong as doing the same for clock speed. The two can be traded against each other in design, almost without limit.
Generally ignore everything Fruehe said. He knew very little of anything he talked about. Looking at the cache, and decreased throughputs for muls, etc, the expectation could not have been greater ipc.
IPC is a design target you (with some caveats) get to set. If you want 30% greater IPC, you can just decide to do that. If you want 100% greater IPC, you can just decide to do that.
The hard part is how fast, in clock speed, that high-ipc design is going to be. IPC increases come after heroic effort and in small increments because they want to get the gains without backsliding on the clocks.
AMD is not trying to eke out a few percent out of their old design. They are doing a completely new core. Will that core be as fast as Intel's best? I don't think so. Will it have X ipc? Regardless of X, the answer is "if they want to", or "if they think the best tradeoff between clocks and IPC they can get is there".
I perfectly follow that.
But what process is amd then designing for?
For me it seems optimally designing for process that the ecosystem uses no matter else eg. Very low freq mobile designs. Imo i think if amd design like the old days when they could tailor process they are damn wrong. They dont shape any more. They have to follow and design for that.
Under those asumtions i would expect high ipc but very low freq hdl designs. Very much synthesized. Lower cost than Intel (and so perf) and fit for consoles embeded and mobile x86 solution.
They are not going to use 14nm FF LPE, they will use the 14nm FF LPP that is suited for high performance devices. And im betting they are not going to use HDL for Desktop/Server ZEN.
The "LP" in "LPP" is still "low power."
14LPP is just an enhanced 14LPE, just like 16FF+ is an enhanced 16FF.
AMD cant win on CPU performance, and they shouldnt be trying. What they need to do is deliver a core that can process gaming-related instructions an order of magnitude faster than an intel design. That is where the biggest bottleneck is in terms of gaming performance. Its all about connecting the cpu and gpu in ways that require serious out of box thinking. Major graphics breakthroughs have occured regularly throughout history. The next one simply needs to come from the CPU. A texture compression module, a command processor, vertex processor, texture samplers, a transform core, rasterization core, tessellator, all of it needs to be connected to the CPU at a very low level. They all need to be components inside the GPU just as INT and FPU are today. Their future depends almost entirely on how far they have come in designing a true APU that can process all of those functions alongside x86 instructions, AND how far they have come in writing drivers that can run today's games on such a core to deliver an order of magnitude more FPS. They had almost 10 years to come up with something and they so far have failed to do so. What they need is basically something nearly akin to a miracle, but that doesnt mean it is impossible. It's only impossible if they havent at least been trying for the last almost 10 years. If indeed all they have done in the last 10 years is move the GPU closer to the CPU without actually integrating any of the functionality, then they should just declare bk today because they are done.
Low Power Performance, it has way different characteristics than LPE.
It is like the old Low Power vs High Performance. They just call it Low Power because of the FF design.
Hmm, if the core itself is around the size of Broadwell core (~7mm^2) then one unit with L2+L3 cache might be around 40mm^2?
512KB Cache are only ~1mm² in Samsungs 14nm. A core with caches could fit in 12mm².
insider2015
Excavator has 14,48 sqmm w/o cache in 28nm, Jaguar's L2 is ~2 sqmm.
Just because I said <10 sqmm for Zen doesn't mean it's exactly 10. It's smaller.
If AMD is truly "starting from scratch," then that only makes their job more difficult; it's far easier to take what is already great and to make it better than to start with nothing and produce greatness, IMO.
You're both correct and wrong at the same time. Yes achievable IPC is impacted by the clock. But the theoretical IPC isn't infinite and the amount of required hardware grows extremely fast. As an example, register file area grows quadratically with number of write ports (and you basically need one write port per instruction executed).
So getting high IPC is certainly difficult and increasing it by 100% isn't just a matter of deciding it.
I would argue that playing catch up is far easier than blazing the initial trail. How do you think Apple was able to catch up so quickly?
New information:
They bought PA Semi, threw lots and lots of money at it to hire top notch talent from the industry, and basically develop a single chip for a product that Apple itself builds.
Oh, and they bought PA Semi in 2008; Apple didn't debut a custom CPU core until 2012.
Do any of those points argue against catching up being easier than blazing the trail? I would reckon that AMD's design capability is still far greater than Apple's, and look what Apple was able to accomplish in a few short years, starting from relatively little.
Apple has been at the CPU design thing for like 7 years now -- hardly a "few short years." Also, have you not noticed that AMD has lost a lot of talent to the likes of Apple, Samsung, and Qualcomm?
Don't underestimate the importance of money to getting competitive projects done in a reasonable time frame.
Do any of those points argue against catching up being easier than blazing the trail? I would reckon that AMD's design capability is still far greater than Apple's, and look what Apple was able to accomplish in a few short years, starting from relatively little.
Apple would only dream to have the CPU IP AMD has.
You do realize that Apple sells their brand & not the A8X, even if they replaced the Ax SoC's with something tier 2 people would still buy it in droves. They have a cult like following but in terms of tech leadership they've never been a pioneer, except arguably with the original iphone, & have always followed the leading trends in (market) areas where they operate.I don't think so:
![]()
Apple's CPU IP is helping the company achieve ungodly amounts of profit. Apple's operating profit last quarter was more than even Intel generated in operating profit last year! Apple's R&D budget this year will be higher than AMD's revenue in its best revenue year.
Apple spent >50% more than AMD's current market cap on a headphone company. Believe me, if Apple wants something, it doesn't sit there "dreaming" or "wishing" that it had it -- it would go and buy it.
Apple bought PA Semi and used that as a foundation to build one of the finest low power chip design houses on the planet. Apple also has deep software expertise (it controls iOS), which no doubt informs their CPU designs.
Apple would only dream to have the CPU IP AMD has. Also, as was mentioned before, Keller explained they used the best things from Bulldozer and Jaguar mArchitectures to create ZEN. Jaguar is a high IPC low power design when Bulldozer is a high clock high throughput design.
Add a 14nm FF and you can do miracles in 4 years. As keller said, they already know how to make a high clock design, now they will have the process node to fully exploit what IP they have.