- Mar 3, 2017
- 1,777
- 6,791
- 136
The word heterogenous is not exclusive to AMDAMD calls it heterogenous
Hence the quotation marks in my original post.Which people can't buy en masse till July or August....
Neither is CPU, or Zen. In fact my favorite sports bar is named Zen.The word heterogenous is not exclusive to AMD
IMHO there are four main aspects:Interesting thing today with RDNA 3 launch is that N31 uses EFB as interconnect. And Angstronomics ( who happens to get things right on N31) also says N32 will use chiplets as well.
N32 will likely be priced in the range of ~600 USD and containing 4 interconnects.
N32 will be high volume parts so would this mean AMD has sorted out the economics of EFB? EFB is done at AMD's packaging facilities
Interesting thing was that the guy presenting the chiplet tech in N31 is Sam Naffziger who also was a key person for Zen chiplets.
If they were to use EFB for CPUs, for two CCDs, they would need only two EFB interconnects to connect the IOD to CCDs.
I am just wondering if this is the tech they would use to replace the SerDes links.
Then whats wrong in calling AMD's hybrid approach big little?Neither is CPU, or Zen. In fact my favorite sports bar is named Zen.
Does not make it any more invalid than if AMD calls their CPU ... CPU
In the original context it seems he was correcting the rumor that it was NOT big little. The nomenclature isn't important but AMD calling it heterogeneous in their own documentation more or less confirms at least 2 different core types. While some old rumors from Chinese forums say it isn't should probably be treated as less reputable.Then whats wrong in calling AMD's hybrid approach big little?
N32 will be high volume parts ...
I am just wondering if this is the tech they would use to replace the SerDes links.
Mr. Naffziger seems to be the person driving AMD's advanced power-related techniques. His effort was aimed mainly at getting Bulldozer's successors power-efficient, then adapting the power saving tech for both GCN and APUs (e.g. Bristol Ridge), followed by Zen and RDNA.Interesting thing was that the guy presenting the chiplet tech in N31 is Sam Naffziger who also was a key person for Zen chiplets.
Well, seems AnandTech retracted their article on N31 using EFB interconnect. So quite likely Angstronomics is right again on N31 using InFO-oS.IMHO there are four main aspects:
So it pretty much boils down to priorities and the question if InFo-R would give them enough total area for stitching all those Zen5 EPYC CCDs together.
- Bandwidth - The IFoP only has around 1/10th of the bandwidth of what N3x has between each MCD and the GCD (900Gbyte/s). Something like InFo-R should be enough in this regard.
- Reticle Limit - EFB gives you total freedom. But even InFo-R should provide multiple times the Reticle Limit through reticle stitching.
- Energy consumption - As I understand it EFB should be much better compared to InFo-R - but maybe I am wrong as there are not a lot of figures available.
- Costs - EFB should have come down on costs but might still be much more expensive than InFo-R.
Might as well be that they deem IFoP sufficient for yet another generation.
Also from LinkedIn we can surmise GMI4 runs at 64 Gbps on N3 nodes which is only possible if there is an interconnect chip with repeaters instead of high energy medium range PHYs. Otherwise they will burn even more power than what they current do on GMI3.In some embodiments, the first semiconductor chip is a core complex die, the second semiconductor chip is a core complex die, and the third semiconductor chip is an input/output die.
In the patent, they mentioned TSVs for allowing power and some contacts in case the CCD needs to reach the substrate, when it is blocked by interconnect chip.But they also mention TSVs for the bridge chip.
In the patent, the main idea is to overcome the reach of short range interconnects (obviously not using medium range PHYs like GMI2/3/4)Maybe they want to increase bandwidth massively in order to make the L3 of neighboring CCDs accessible to each other?
GMI3 is 32 GT/s/lane. We are talking about different things here I believe. BW is not as critical for CPUs as latency. The higher transfers per second it has per lane the lower the latency.But to my knowledge IFoP already is at 64Gbyte/s
Looks like plain InFO-R or AMD's equivalent of this tech. At best 4 copper layers, but definitely miles better than driving something through the substrate.
This is interesting. Why do I post here instead than in the RDNA3 thread? Because this may be a hint about the interconnect speeds that could be achievable on advanced packaging and then these could be applicable on Zen5, too. We have seen that IF links are becoming a limit already on Zen4, so improving these in Zen5 could remove a performance limitation.
The GLink-2.5D IP utilizes single-ended signaling on parallel bus with DDR clock forwarding. This allows for up to 8/16Gbps per pin consuming only 0.25pJ/bit on TSMC’s RDL-based InFO (Integrated-Fan-Out) or CoWoS (Chip-on-Wafer-on-Substrate). One slice has 32 full-duplex lanes and one PHY has 8 slices with 2/4Tbps maximum bandwidth. For the next generation GLink, one slice will have 56 full-duplex lanes and one PHY has 8 slices with 7.5 Tbps maximum bandwidth.
This leak is the one I strongly believe would be for Zen5 desktop CPU architecture. With removal of L3, AMD can double up Zen5 cores while maintaining similar die size which is important for Turin server CPU. And by sharing all L2 cache AMD can remedy latency issue with external L3 cache.Rgt saying zen 5 will have a unified l2 cache around the ccx and stacked l3.. will have zen 5 + zen 4 cores 🤔🤔![]()
Leak suggests AMD Zen 5 CPUs to pack impressive IPC performance gains and hugely increased core counts
RedGamingTech has leaked some early AMD Zen 5 info. The leak suggests Zen 5 chips will come with a hybrid design similar to Intel Alder Lake CPUs and will pack impressive gen-on-gen IPC performance gains.www.notebookcheck.net
This whole thing sounds like complete nonsense. Doubling the cores? +30% IPC? Unified L2 cache? Unified stacked L3 for everything? Yeah, I'm calling BS.Rgt saying zen 5 will have a unified l2 cache around the ccx and stacked l3.. will have zen 5 + zen 4 cores 🤔🤔![]()
Leak suggests AMD Zen 5 CPUs to pack impressive IPC performance gains and hugely increased core counts
RedGamingTech has leaked some early AMD Zen 5 info. The leak suggests Zen 5 chips will come with a hybrid design similar to Intel Alder Lake CPUs and will pack impressive gen-on-gen IPC performance gains.www.notebookcheck.net
man Apple's ex-cheif designer was way ahead. They already moved L3 cache ages ago and already moved to 8 wide decode in 2017 i think?With removal of L3, AMD can double up Zen5 cores while maintaining similar die size which is important for Turin server CPU. And by sharing all L2 cache AMD can remedy latency issue with external L3 cache.
That RGT video was posted more than 7 months ago. Pretty sure, it was already discussed back then...Rgt saying zen 5 will have a unified l2 cache around the ccx and stacked l3.. will have zen 5 + zen 4 cores 🤔🤔![]()
Leak suggests AMD Zen 5 CPUs to pack impressive IPC performance gains and hugely increased core counts
RedGamingTech has leaked some early AMD Zen 5 info. The leak suggests Zen 5 chips will come with a hybrid design similar to Intel Alder Lake CPUs and will pack impressive gen-on-gen IPC performance gains.www.notebookcheck.net
The term Beachfront is actually used?Unrelated to AMD, GLink provides similar inter die link like AMD's IFOP/GMI at 0.3pJ/bit on InFO-R(_oS)
View attachment 71129
AMD's IFOP seems more advanced than this scheme, at least based on open architecture. (low swing single ended signaling)
If they migrate their link to InFO-R they should be able to match this if not better.
IFOP via substrate --> ~2pJ/bit.
IFOP via RDL --> ~0.3pJ/bit
Up to ~7x reduction in pJ/bit.
BW is important, but latency is even more important in CPUs. How high they can clock would be very critical.
Wouldn't a unified large L2 cache be the next evolution in cache performance? A big slab of cache in the middle and cores placed on all sides of it?This whole thing sounds like complete nonsense. Doubling the cores? +30% IPC? Unified L2 cache? Unified stacked L3 for everything? Yeah, I'm calling BS.
Seems so, and IMHO it is a quite fitting term - just as real Beachfront it is rather limited on a die.The term Beachfront is actually used?