Regarding Zen5, people have blown their own bubble based on the Epyc (Zen5) results.
Yes, that 40% IPC bubble is Zen 5's trademark.
Zen5 is a massively, but cautiously, expanded (widened and deepened) x86 core. It features a super-advanced BPU and prefetching. It's very extensive and significantly deeper than previous Zen generations. The BPU in Zen5 anticipates two consecutive branches and can have three branch windows. Zen5 sees very long and complex branch patterns.
Yes, but... reading the Chips'n'Cheese profiled data, majority of stalls within the Zen 5 core are caused by the frontend latency.
Zen5 has about 26% more transistors per core, or 218 million more than Zen4. The difference is practically the entire Skylake+L2 core (217 million transistors).
This metric is hugely flawed. Capacitance/frequency optimization, weird IO stuff, niche features, etc. Specifically, Zen 5 invested a notable amount of transistors to support the AVX-512 full-speed data flow.
The IPC increase is an average of +16% (average +13% Int and average +24% FP).
Compared to Zen2, it's about 50-55% (average IPC increase).
Even that 13% is not so impressive for such ambitious design, right? I know the relative base value of Zen 4 was higher than Zen 1's but still - there are Apple and Qualcomm.
Those sizes are cool, but how about the int RF or ROB size?
At 448 slots Zen 5's ROB is still smaller than Golden Cove's 512, let alone Lion Cove's 576.
The same applies to 240 int RF vs 280-300 of Intel's.
Both limits manifest themselves in the profiled data.