AMD can't use N6 if they want to do stacking because the TSMC tech is N7 on N7 or N5 on N5
Yeah 99% 6nm is 7nm equivalent in terms of stacking they already are really similar. TSMC predicts over 50% of 7nm users to pivot to 6nm. No way they'll just shut them out of stackingI think for stacking purposes, N6 might be considered N7 variant.
And when N5 on N5 becomes available, I think N5 on N7 and N7 on N5 stacking can also be used. But I am sure we will get some progress updates from TSMC along the way. We are still approx. one year away from N5 stacking.
In the case of AMD stacking SRAM on top on L3 on CCD, the SRAM stacked and the L3 should have the same surface area, so it should be made both in same process. How the TSV points will match between bottom SRAM and L3 cache CCD if it's in different process ? Maybe TSMC will bring too N6 on N6...
I think I am going to disagree here.
AMD specifically emphasized gaming performance. AMD is now fully on board in keeping that crown. The best gaming performance could be obtained from single chiplet 8 core CPU with high L3 cache.
2 reasons:
- higher power to cores in 8 core vs. 16 core config, can lead to higher sustained turbo clocks
- 1 chiplet eliminates coherency traffic and some memory latency hit
The fact that 5950 has a higher boost clock just means someone was not thinking things through fully.
The easiest way for AMD to keep gaming performance crown is to release highest clock 8 core CPU with 1-4 layers of V-Cache.
You may be right. But I am assuming there is a cache increase in Genoa anyways, don't think they need to race themselves in cache sizes at this point (need to stay ahead but they are by miles already). But its more likely AMD would launch as a second or third batch of sku's 6 months or more into the Genoa 2+ years of offering (specially if Zen 5 is Big.little and not meant for enterprise) 2 years into Milan. I think its for Ryzen and maybe TR at first. RDNA3 second then Genoa. Two to three opportunities and uses cases to make sure it works and yields are good before trying to apply it to your server product.There's no guarantee that Genoa will have vcache options at launch. It's entirely possible those come later.
You may be right. But I am assuming there is a cache increase in Genoa anyways, don't think they need to race themselves in cache sizes at this point (need to stay ahead but they are by miles already). But its more likely AMD would launch as a second or third batch of sku's 6 months or more into the Genoa 2+ years of offering (specially if Zen 5 is Big.little and not meant for enterprise) 2 years into Milan. I think its for Ryzen and maybe TR at first. RDNA3 second then Genoa. Two to three opportunities and uses cases to make sure it works and yields are good before trying to apply it to your server product.
I don't think the two are mutually exclusive in this instance.They don't need to be massively ahead, but if they could with some product they could sell it for arm and leg. Sure staying ahead isn't AMD main priority but making buttloads of money......
I don't think the two are mutually exclusive in this instance.
But they are already ahead by a huge margin 3x more than Intel's glued CPU. More than a 2s system of socket. They could have half a gig of cache in a 2s system right now. So unless a company signed a contract saying they would buy X of Milan with 1GB of L3 for insane amounts of money, AMD wouldn't do it. The margin is pretty damn great there but the process for getting that out the door, testing and dev wise would be pretty high for a CPU that otherwise already be getting sold (again Milan is sooooooo far ahead if it mattered these guys are already purchasing Milan for it). So yeah Semi custom and a customer buying thousands at 50k-100k pop, I could buy it. But I don't think it would have been a normal part of the dev plan as again Milan3d considering Milan's lead would just be canabalizing sales.They don't need to be massively ahead, but if they could with some product they could sell it for arm and leg. Sure staying ahead isn't AMD main priority but making buttloads of money......
The server market moves so slowly though, Rome is still their majority selling platform. Im not sure they would cannibalize sales at all by announcing faster skus. And after all there is no such thing as a free lunch, the extra cache will help many workloads but it will require more power if not exceptional binning, or lowered clocks. Also don't forget it will absolutely add cost over regular milan. Not all customers are going to have that need and I assume most will keep buying skus from the mainstream enterprise lineup.But I don't think it would have been a normal part of the dev plan as again Milan3d considering Milan's lead would just be canabalizing sales.
The server market moves so slowly though, Rome is still their majority selling platform. Im not sure they would cannibalize sales at all by announcing faster skus. And after all there is no such thing as a free lunch, the extra cache will help many workloads but it will require more power if not exceptional binning, or lowered clocks. Also don't forget it will absolutely add cost over regular milan. Not all customers are going to have that need and I assume most will keep buying skus from the mainstream enterprise lineup.
Lurker here chiming in, forgive me if this thought is silly.
Might it be that Zen 3 with stacking is coming to AM5? AMD demonstrated/showed off a sample on an AM4 sample (at least as far as I am aware), but would there be anything major stopping AMD from using a new AM5/DDR5 IO die with these stacked zen 3 chiplets?
Maybe even launch it on AM4 first and then AM5 shortly after to get the new platform rolling? Or both at once?
Could also help smooth out any DDR5 or other platform teething issues ahead of Zen 4 by getting early adopters/enthusiasts to stress test AM5.
This has been brought up previously. If the Zen 4 IO die uses the same IFIS (SerDes) then it should be backward compatible. It is like pci-express backward comparability; there wouldn’t be any technological reason why they wouldn’t be able to make such a part. It might be useful to get AM5 shipping before Zen 4 is available. Would you buy it though? With Zen 4 likely coming soon after it, it doesn’t seem like there is that much reason to make it.Lurker here chiming in, forgive me if this thought is silly.
Might it be that Zen 3 with stacking is coming to AM5? AMD demonstrated/showed off a sample on an AM4 package, (at least as far as I am aware), but would there be anything major stopping AMD from using a new AM5/DDR5 IO die with these stacked zen 3 chiplets?
Maybe even launch it on AM4 first and then AM5 shortly after to get the new platform rolling? Or both at once?
Could also help smooth out any DDR5 or other platform teething issues ahead of Zen 4 by getting early adopters/enthusiasts to stress test AM5.
(Edit for grammar/clarity)
They might use one cache die all the way down to 6 core parts. Anything lower should probably just be an APU anyway. M The cache die are probably relatively cheap. It is only 36 mm2, which is tiny. It probably doesn’t use very many metal layers and it might be made on N6 for actual volume production. That will have more EUV which avoids double and quad patterning (multiplies the number of mask and process steps by 2 or 4 for some layers). They may be making CCD with stacked cache all of the way down to 1 active core for Epyc, like the current 8 core 72F3 (8-core, 256 MB of cache). I kind of hope that we get a full N6 version for Zen 3D. I still doubt that we will see 4 high stacking in anything other than very high end Epyc. The full 288 MB per CCD version, if it exists, would have 2304 MB total L3 cache, which is truly ridiculous.My, completely wild guess would be that there would be 6950x and 6800x, both 2x8 and 1x8 core versions, and no 6 core version (as was demonstrated by Lisa Su).
Because, Zen 3D stacking is based on known good die, so why waste stacking on die that has only 6 good cores, if there are plentiful die with 8 good cores?
Well pretty much yeah. Except it was the 1.13GHz model that was recalled, not the 1GHz. Small detail, not trying to nitpick just wanted to clarify in case someone tried to go search for info on it.
AMD did beat Intel to 1GHz by a matter of a week or two maybe, but the Pentium 3 model was a paper launch whereas you could get the Athlon 1 GHz.
Kyle Bennett, there's a name I haven't heard in awhile. He was right up there with Anand. Anandtech was involved in the recalled P3 as well. With so many reputable sources confirming the problem Intel had no choice but to go into damage control mode.
About stacking different nodes I don't see an obvious problem to do that, the contacts are 900nm pitch metal layer, I'm pretty sure we can make that with all the latest process nodes even 12nm, the only 'problem' that I could see is different transistor properties when you want them to be as equal as possible on both ends of the wire, but you can take that in your design.Yeah 99% 6nm is 7nm equivalent in terms of stacking they already are really similar. TSMC predicts over 50% of 7nm users to pivot to 6nm. No way they'll just shut them out of stacking
Given the “chonky” package renderings that have been showing up, which may be completely bogus, I am wondering if the chiplet based parts will actually have an integrated vapor chamber. Spreading the chips apart rather than monolithic helps with the thermal density, but we are talking about a lot of power out of a very small chip at 5 nm. Perhaps they will have APUs with just a cheap lid.I suspect that there is still space in the SKU stack for a 12 core part and an 8 core part. It makes sense to me to bin the CCDs for stacking against thermal efficiency, in that, the stacking will likely reduce the thermal disipation rate of the CCDs, so binning the dies for lowest power consumption at the target all core boost so that they can achieve a clock bump over the existing parts while also getting the L3 stack as well seems doable. I think that the 12 core part could be a particularly interesting part as the individual cores could boost higher together and, with 96MB of L3 with the 64MB stack will give 16MB of L3 per core if the cache is being thrashed. On the desktop, and currently on low end HEDT, the 12 core part will be unparalleled in cache per core and likely all core clocks.
They might use one cache die all the way down to 6 core parts. Anything lower should probably just be an APU anyway. M The cache die are probably relatively cheap. It is only 36 mm2, which is tiny. It probably doesn’t use very many metal layers and it might be made on N6 for actual volume production. That will have more EUV which avoids double and quad patterning (multiplies the number of mask and process steps by 2 or 4 for some layers). They may be making CCD with stacked cache all of the way down to 1 active core for Epyc, like the current 8 core 72F3 (8-core, 256 MB of cache). I kind of hope that we get a full N6 version for Zen 3D. I still doubt that we will see 4 high stacking in anything other than very high end Epyc. The full 288 MB per CCD version, if it exists, would have 2304 MB total L3 cache, which is truly ridiculous.
7nm to 6nm is a production improvement not a product improvement.I would like to be surprised and have AMD use N6 for these CCDs. The reduction in energy draw and additional performance available on that node would likely be enough to push Zen3d into a performance leadership position against Alder Lake's best parts.