NostaSeronx
Diamond Member
- Sep 18, 2011
- 3,811
- 1,290
- 136
Well I don't think I've seen a delidded 300GE, but I'd imagine that is also the Raven2 die.
Also, with Dali coming in < a year clearing as much stock as possible is probably a good idea.
Raven2 is going to Dali though on 12LP(+).That dual core APU is so long overdue. I think this means they are now finally about to phase out the Stoney low end product. I'm thinking a cut down Raven2 with 1c/2t might be a good fit for eMMC craptop, tablet throwaways and Chromebook laptops. And, they would beat Stoney at 6W tdp is my guess.
They still do it just doesn't involve FinFETs. Especially with the problems at Malta going out of control.that they may no longer have Glo Fo obligations in the near future (beyond end 2020).
I would agree in case of evolution of Zen 2. However Zen 3 isn't an evolution.IPC is not a singular figure, mere boosts to the memory/cache system of Cortex-A12 led to its quick revision/renaming to A17 and a boost from 3.5 to 4 DMIPS per clock.
It's not impossible that a boost to the L1 could increase IPC all by itself by allowing current resources to be better utilised, though likely not a huge change, probably Zen+ level at most without further changes.
What about completely new architecture as Norrod said? AMD engineers run hundreds of simulations so they increased bandwidth due to some specific reason. To avoid some bottlenecks. Zen2 increased bandwidth due to doubling FPUs width. So increased bandwidth might indicate Zen 3 will be wider again, maybe FPU wider as RedGamigTech leak suggested. Maybe wider in ALUs too.There will be IPC increases. We just don't know where they are coming from just yet.
Zen 2 has 4 pipes FPU (2xFADD, 2xFMUL), take a look here: https://www.anandtech.com/show/14525/amd-zen-2-microarchitecture-analysis-ryzen-3000-and-epyc-rome/9It is already 8 pipes in Zen2, the problem is there is only four AVX256 issue ports. Supporting 4x AVX128 or 4x AVX256.
View attachment 13756
I think you are taking the "completely new architecture" part a bit too seriously.I would agree in case of evolution of Zen 2. However Zen 3 isn't an evolution.
Unified L3 cache is the main reason to call it as completely new architecture??? Maybe in your Intel world. AMD isn't playing this Intel's +3% IPC game. As Keller said in Intel, they made a plan for 50x larger CPUs in next 20 years. I'm pretty sure he did the same in AMD. And they will choose the most effective configuration/technologies for given manufacturing process. And this is what Apple is doing for a long time, AMD is doing that since Keller brought it from Apple. That's why Zen 3 could be quite a different uarch from Zen1/2. Zen 3 could be something like Apple's A11 Hurricane. Small performance jump despite of 6xALUs however new uarch brought solid base for much better performing A12 Vortex (Zen4).Well, AMD is already moving from a 2x4 core CCX, with its split cache and the inter-IF overhead to an 8 core CCD with none of those problems. So that's a new architecture right there. Obviously, there will be other improvements as well.
That's a very generalized patent. You'd have to search TSMC patents, though I suspect many process details to be held as trade secrets.New AMD patent, very similar to intel's 10nm COAG... probably the main reason N7+ have 20% more density
GATE CONTACT OVER ACTIVE REGION IN CELL
Complete Patent Searching Database and Patent Data Analytics Services.www.freepatentsonline.com
Apple A6 was their first fully custom CPU core, this was only 7 years + 2 months ago.And this is what Apple is doing for a long time
How do you know TSMC license it?New AMD patent, very similar to intel's 10nm COAG... probably the main reason N7+ have 20% more density
None... it just makes sense, the density boost is similar to intels claimsHow do you know TSMC license it?
Zen2 has 8 pipes.Zen 2 has 4 pipes FPU (2xFADD, 2xFMUL), take a look here: https://www.anandtech.com/show/14525/amd-zen-2-microarchitecture-analysis-ryzen-3000-and-epyc-rome/9
Raven2 is going to Dali though on 12LP(+).
Stoney still exists, there is still the 12FDX(Small Dali(Stoney successor(A6-9220C/A4-9120C))) product below 12LP(Big Dali(Raven2 successor(Ryzen 3 3200u/Athlon Gold 3150U/Athlon 300U))).They still do it just doesn't involve FinFETs. Especially with the problems at Malta going out of control.
Stoney is approximately 125 mm2 with the 28nm node.I think with Raven2 existing nowadays, a dozer based Stoney successor isn't really needed. The iGPU port to FDX might also be too much work for this low margin market that is getting very crowded (atom, along with chinese Centaur x86 SoC's as well as acorn SoC).
8 macro-ops can support 8 ALUs and 8 FPUs ops. However, imho if they use the shrink given with 5nm it would be simple to slap a second retire queue(RQ0 = A/B threads, RQ1 = C/D threads) on.The Zen back end was always oversized for some reason with 8 wide retire. The integer core is 7 wide in Zen2. If it grows to 8 wide, would the back end still handle it or would it be widened?
It is already 8 pipes in Zen2, the problem is there is only four AVX256 issue ports. Supporting 4x AVX128 or 4x AVX256.
View attachment 13756
4 FMULs, 4 FADDsWhat is the image supposed to be pointing out?
That's exactly my point of view either. Making CPU wider is the most complex work IMHO. That's why AMD might decided to develop completely new uarch as a solid and wide base for their future CPU evolutions. Zen 4 and 5 can add more complexity to gain much more performance without being limited by uarch (to pick the lowest hanging fruits, however they need to build solid platform to reach those fruits, because those low hanging fruits are quite high actually).Doubling everything to 4 MUL and 4 ADD requires a large upgrade to scheduling and instruction issue/retire. If they want to do AVX512 with two 256-bit micro ops, then doubling makes sense. If they aren’t doing that, then an increase up to 3 MUL + 3 ADD could makes sense for just larger AVX256 throughput.
The L1 cache is 2x32 byte load (2x256-bit) and 32 byte write (256-bits) per clock. The cache bandwidth could be a bottleneck, but saying increased by 40% doesn’t make much sense. You would expect that they would double the bandwidth to 4x32 bytes. I suppose they could go up to 3x32 bytes, but that isn’t 40%, it is 50%. Perhaps some mixture of read and write increases.
I think we should probably take it like Intel's old "tock" changes, with the Zen1 and Zen3 being in that area, and Zen+ and Zen2 being in the "tick" change area.Completely new uarch doesn't mean something radical.
Intel traded on this for years with SSSE3 and full SSE4.x - I'd say it's a big part of the reason AVX got announced by Intel when AMD announced SSE5, it's about perception of being in front of innovation.This makes little sense to me and it seems more like a "Hey my chip's got this one, you should have bought Intel!"
Unified L3 cache is the main reason to call it as completely new architecture??? Maybe in your Intel world. AMD isn't playing this Intel's +3% IPC game. As Keller said in Intel, they made a plan for 50x larger CPUs in next 20 years. I'm pretty sure he did the same in AMD. And they will choose the most effective configuration/technologies for given manufacturing process. And this is what Apple is doing for a long time, AMD is doing that since Keller brought it from Apple. That's why Zen 3 could be quite a different uarch from Zen1/2. Zen 3 could be something like Apple's A11 Hurricane. Small performance jump despite of 6xALUs however new uarch brought solid base for much better performing A12 Vortex (Zen4).
I can only assume he means 50x transistors total, rather than per core transistor count.50x larger CPU? 50x as large as now or 50 variations? When and where was this said? This is the first time I'm hearing of it.
50x transistor rate spread over 20 years?I can only assume he means 50x transistors total, rather than per core transistor count.
Because a core with 50x the transistor budget would be insane even at <1nm.