Discussion Speculation: Zen 4 (EPYC 4 "Genoa", Ryzen 7000, etc.)

Vattila · Oct 6, 2019

Except for the details about the improvements in the microarchitecture, we now know pretty well what to expect with Zen 3.

The leaked presentation by AMD Senior Manager Martin Hilgeman shows that EPYC 3 "Milan" will, as promised and expected, reuse the current platform (SP3), and the system architecture and packaging looks to be the same, with the same 9-die chiplet design and the same maximum core and thread-count (no SMT-4, contrary to rumour). The biggest change revealed so far is the enlargement of the compute complex from 4 cores to 8 cores, all sharing a larger L3 cache ("32+ MB", likely to double to 64 MB, I think).

Hilgeman's slides did also show that EPYC 4 "Genoa" is in the definition phase (or was at the time of the presentation in September, at least), and will come with a new platform (SP5), with new memory support (likely DDR5).

What else do you think we will see with Zen 4? PCI-Express 5 support? Increased core-count? 4-way SMT? New packaging (interposer, 2.5D, 3D)? Integrated memory on package (HBM)?

Vote in the poll and share your thoughts!

Joe NYC · Aug 8, 2021

Thibsie said:
Of course they are. you don't even try to answer my point.

Actually, I did not understood your point.

Because if CCDs are the limiting factor (AMD capability to obtain sufficient number from TSMC), you can't sell 2x 5900x or 2x 5950x instead of 2x5800x

Unless you meant to say that the substrate is the limit, which you did not say.

Joe NYC · Aug 8, 2021

moinmoin said:
You just described the 5800X. That aside I can only second @Vattila's post above.

Well, not exactly, because 5800x boost clock is 4.7 GHz, and 5950x is 4.9 GHz. While 5800x has more thermal headroom to run more cores at or close to their boost clock than 5950x

LightningZ71 · Aug 8, 2021

Joe NYC said:
Well, AMD is able to sell a lot of the 5800x despite the fact that it is arguably the worst deal for the customers (and the best deal for AMD) of the 5000x line of processors.

8:1 in proportion to 5950x

View attachment 48436

TechEpiphany on Twitter: "🔥 CPU Sales Week 30 (mindfactory) 🔥 AMD: 2575 units sold, 76.98%, ASP: 320.84 Intel: 770, 23.02%, ASP: 233.89 AMD Revenue: 826'175, 82.1% Intel Revenue: 180'092, 17.9% Please share 🙏This is a product of weeks of work #AMD #Intel #AMDRyzen https://t.co/3C0LbV2GTK" / Twitter

This isn't really supporting your argument very well. Given what we are seeing with availability, they are selling every chip they make, and what you are seeing is, roughly, what they are sending to the market.

DrMrLordX · Aug 8, 2021

Joe NYC said:
Interesting idea.

Slight problem is that if you have 2 cores highly utilized, you would want them to be on different CCDs, to fully take advantage of L3s and thermals.

And the 16 cores still need to adhere to minimum all core frequency within the power limit...

AMD already does this. It doesn't really pose a problem for them, especially since "bad" CCDs often have only 2-4 cores that put the CCD into a bad bin position. The minimum all-core frequencies are quite low compared to max boost.

amd6502 · Aug 8, 2021

All I really hope for with Zen4 is that they branch out from the limitations of SMT to a more general multithreading, which would allow a mode of operation where one thread is given a priority over all other threads (which would be 1 other thread, or 3 other threads per core, depending on whether the core is upgraded to 4-way MT or stays 2-way MT).

Vattila · Aug 8, 2021

Joe NYC said:
My problem is that AMD is NOT selling a version of 5800x, say 5810x with 4.9 GHz or higher boost clock.

Just buy a 5950X, disable the lowest boosting CCX, and slap a home-made "5810X inside" sticker on your DIY PC.

Note that by reserving the highest binning CCDs for the 5950X low-volume flagship, AMD could push frequency to the max, finding the optimal frequency where the limited supply of parts binning at said high frequency was sufficient to satisfy the target sales at the corresponding price point. They then set the 8-core volume parts at a frequency that was binning well.

PS. On V-Cache — if AMD really needs 2, 3 or 4 layers of V-Cache to compete with "Alder Lake", it would of course be nice if they could launch around the same time. Assuming multi-layer V-Cache is in the works, then like you, I see no technical barriers. The CCDs with V-Cache should be usable across server, HEDT/workstation and desktop SKUs (barring any Z-height issues). That said, one layer of V-Cache (i.e. triple the amount of current L3!) plus a little higher boost frequency may be enough to match or beat "Alder Lake" (especially at the same power consumption, rumours suggest). We'll see!

Joe NYC · Aug 8, 2021

Vattila said:
Just buy a 5950X, disable the lowest boosting CCX, and slap a home-made "5810X inside" sticker on your DIY PC.

That would be more like old Intel style of segmentation and rationing of performance to force people to buy something they don't want in order to get something they want....

Vattila said:
Note that by reserving the highest binning CCDs for the 5950X low-volume flagship, AMD could push frequency to the max, finding the optimal frequency where the limited supply of parts binning at said high frequency was sufficient to satisfy the target sales at the corresponding price point. They then set the 8-core volume parts at a frequency that was binning well.

Considering how easy it is for AMD to create an SKU, I am surprised they have not covered more ground. There is no reason not to have a flagship 8 core along with flagship 16 core.

Vattila said:
PS. On V-Cache — if AMD really needs 2, 3 or 4 layers of V-Cache to compete with "Alder Lake", it would of course be nice if they could launch around the same time. Assuming multi-layer V-Cache is in the works, then like you, I see no technical barriers. The CCDs with V-Cache should be usable across server, HEDT/workstation and desktop SKUs (barring any Z-height issues).

My guess is that AMD is planning on doing exactly that - announcing and launching within the time window of Alder Lake launch.
And then launching Milan X around the time of Sapphire Rapids launch

Vattila said:
That said, one layer of V-Cache (i.e. triple the amount of current L3!) plus a little higher boost frequency may be enough to match or beat "Alder Lake" (especially at the same power consumption, rumours suggest). We'll see!

Since it looks like Zen 4 Ryzen may be some time away, I think this Zen 3D should launch with a bang, not a whimper as far as what SKUs are brought to market...

We will see what is revealed at Hot Chips...

Thunder 57 · Aug 9, 2021

amd6502 said:
All I really hope for with Zen4 is that they branch out from the limitations of SMT to a more general multithreading, which would allow a mode of operation where one thread is given a priority over all other threads (which would be 1 other thread, or 3 other threads per core, depending on whether the core is upgraded to 4-way MT or stays 2-way MT).

Oh no, not SMT4 again. How would it know which thread to give priority too, though? Presumably it's possible, as Intel is counting on less demanding threads being sent to Atom cores.

DisEnchantment · Aug 9, 2021

Vattila said:
On V-Cache — if AMD really needs 2, 3 or 4 layers of V-Cache to compete with "Alder Lake"

For the client computing,
Zen2 --> Zen3 has 9% increase in effective die area for average 19% increase in IPC across all loads.
Zen3 --> Zen3D with 4 layers of V-Cache would result in 2.8x increase in effective die area for a questionable gain in general purpose compute outside of gaming.
Not to mention 2.8x die area + packaging cost would result in almost 3x more production cost per CCD. N6/7 might have gotten cheaper but I doubt 3x cheaper.
Just plain lackluster engineering if its entire purpose is to defeat Alder Lake. On the same lines like NetBurst going for the MHz with no improvements, if not regression, elsewhere.
And what about that projected 46% GM in the Earnings call?

Zen3D was not developed to address gaming or as a response to Alder Lake, Ryzen with Zen3D is, most likely, simply some rejected dies from Milan-X because the HPC/DC/Server market can sustain those high costs and they are really finding the excellent use for those huge caches.
Don't expect good availability.

DisEnchantment · Aug 9, 2021

amd6502 said:
All I really hope for with Zen4 is that they branch out from the limitations of SMT to a more general multithreading, which would allow a mode of operation where one thread is given a priority over all other threads (which would be 1 other thread, or 3 other threads per core, depending on whether the core is upgraded to 4-way MT or stays 2-way MT).

Seems AMD have been thinking on those lines with one thread having priority of resources in SMT mode. The threads in the core compete for the resources and gets some priority over the other and the Core resources are not evenly split.
Seems like a good idea, provided the algorithm for resource priority assignment and allocation is working well.

20210096914

https://www.freepatentsonline.com/y2021/0096914.html

SOFT WATERMARKING IN THREAD SHARED RESOURCES IMPLEMENTED THROUGH THREAD MEDIATION

20210096920

https://www.freepatentsonline.com/y2021/0096920.html

SHARED RESOURCE ALLOCATION IN A MULTI-THREADED MICROPROCESSOR

moinmoin · Aug 9, 2021

Joe NYC said:
My problem is not that AMD is selling 5950x with 4.9 GHz boost clock.

My problem is that AMD is NOT selling a version of 5800x, say 5810x with 4.9 GHz or higher boost clock.

In addition to current 5800x CPU with 4.7 GHz boost clock.

I hope you are aware you're talking about a frequency difference of 4,26%.

Joe NYC said:
That would be more like old Intel style of segmentation and rationing of performance to force people to buy something they don't want in order to get something they want....

AMD apparently prefers to have as few DIY desktop SKUs as possible. The segmentation between the SKUs then has to work on several parameters to make the more costly SKUs seem worth the premium. AMD however doesn't segment functionality (aside the PRO features) so that leaves amount of cores and frequencies as available parameters, which is exactly what AMD makes use of. And the boost clock difference between 5600X, 5800X, 5900X and 5950X is exactly 100MHz each. Maybe you'd prefer the difference to be 25MHz? To me the difference is honestly minuscule enough that to me all this discussion is a whole lot of whatever...

DisEnchantment said:
For the client computing,
Zen2 --> Zen3 has 9% increase in effective die area for average 19% increase in IPC across all loads.
Zen3 --> Zen3D with 4 layers of V-Cache would result in 2.8x increase in effective die area for a questionable gain in general purpose compute outside of gaming.
Not to mention 2.8x die area + packaging cost would result in almost 3x more production cost per CCD. N6/7 might have gotten cheaper but I doubt 3x cheaper.
Just plain lackluster engineering if its entire purpose is to defeat Alder Lake. On the same lines like NetBurst going for the MHz with no improvements, if not regression, elsewhere.
And what about that projected 46% GM in the Earnings call?

Zen3D was not developed to address gaming or as a response to Alder Lake, Ryzen with Zen3D is, most likely, simply some rejected dies from Milan-X because the HPC/DC/Server market can sustain those high costs and they are really finding the excellent use for those huge caches.
Don't expect good availability.

While this may be true I think the per area cost for the SRAM wafers are significantly lower: Far smaller dies, higher yield also due to far more repetitive patterns, and I think those pure SRAM V-Cache dies needs fewer layers than a full CPU design (not sure about that)?

So the advantage of this approach at this point very likely is rejuvenating an already paid off CPU die design without the whole cost for a new mask, validation etc. for the CPU die itself. The SRAM die may even be stock TSMC (so no added cost)?

Joe NYC · Aug 9, 2021

DisEnchantment said:
For the client computing,
Zen2 --> Zen3 has 9% increase in effective die area for average 19% increase in IPC across all loads.
Zen3 --> Zen3D with 4 layers of V-Cache would result in 2.8x increase in effective die area for a questionable gain in general purpose compute outside of gaming.
Not to mention 2.8x die area + packaging cost would result in almost 3x more production cost per CCD. N6/7 might have gotten cheaper but I doubt 3x cheaper.

As a reality check, the current lowest end GPU AMD is making is 237 mm2 of a logic die + board + memory + power management. All this effort for gaming, all for $379

Why would does it seem insurmountable to have less die area of much cheaper SRAM die, with higher yields in a CPU that will sell more, with lower bill of materials?

If 224mm2 is your ceiling, should both AMD and NVidia immediately stop producing graphics cards?

DisEnchantment said:
Just plain lackluster engineering if its entire purpose is to defeat Alder Lake. On the same lines like NetBurst going for the MHz with no improvements, if not regression, elsewhere.
And what about that projected 46% GM in the Earnings call?

NetBurst is known (infamous) for low IPC.
V-Cache will be known for improving IPC

DisEnchantment said:
Zen3D was not developed to address gaming or as a response to Alder Lake, Ryzen with Zen3D is, most likely, simply some rejected dies from Milan-X because the HPC/DC/Server market can sustain those high costs and they are really finding the excellent use for those huge caches.
Don't expect good availability.

How different is it from Zen 3, which was impossible to buy at MSRP for 6 months?
Should AMD have not released Zen 3 for client / desktop?

All of the Ryzen 5000x product line has dies that could be sold in Milan server chips.
Should AMD have not released Zen 3 for client / desktop?

I am wondering why people are inventing these random bars to clear that apply only to Zen 3D and no other product.

Joe NYC · Aug 9, 2021

moinmoin said:
I hope you are aware you're talking about a frequency difference of 4,26%.

It is also symbolic, as in making it a priority, taking a segment which most benefits from such a CPU - gaming - seriously.

And BTW, if 4.9 GHz is the limit in 16 core CPU, it is possible that 5 GHz may be achievable in 8 core CPU

moinmoin said:
AMD apparently prefers to have as few DIY desktop SKUs as possible. The segmentation between the SKUs then has to work on several parameters to make the more costly SKUs seem worth the premium.

I noticed the few SKUs.

moinmoin said:
AMD however doesn't segment functionality (aside the PRO features) so that leaves amount of cores and frequencies as available parameters, which is exactly what AMD makes use of. And the boost clock difference between 5600X, 5800X, 5900X and 5950X is exactly 100MHz each. Maybe you'd prefer the difference to be 25MHz? To me the difference is honestly minuscule enough that to me all this discussion is a whole lot of whatever...

No, I would go from 1 SKU for 8 core to 2 (high clock and lower clock)
Also 2 SKUs for 16 core (high clock and lower clock).

That's all. All together, from 4 5000x SKUs to 6.

DisEnchantment · Aug 9, 2021

moinmoin said:
While this may be true I think the per area cost for the SRAM wafers are significantly lower: Far smaller dies, higher yield also due to far more repetitive patterns, and I think those pure SRAM V-Cache dies needs fewer layers than a full CPU design (not sure about that)?

So the advantage of this approach at this point very likely is rejuvenating an already paid off CPU die design without the whole cost for a new mask, validation etc. for the CPU die itself. The SRAM die may even be stock TSMC (so no added cost)?

I don't think anybody on the forums is trying to argue about the advantages because that is the whole purpose of the architectural design for chiplets and stacking in the first place. AMD has been beating this trumpet since a while.

But to me the noise about Zen 3D being a primary response to Alder Lake does not make sense, it was designed for something else. And trickles down to Desktop.
Daytona platform already have BIOS support for V-Cache since a while because it was designed for that market all along.
Regarding cost of each SRAM die, I won't argue since I don't have data.
But stacking SRAM, it is not going to be cheap, a similar scenario is there with DRAM, there is a reason why DRAM memory of some capacity is much cheaper than HBM even though they use the similar underlying DRAM dies, simply because of the amount TSVs needed and the binning needed to get the final KGSDs

jpiniero · Aug 9, 2021

DisEnchantment said:
But to me the noise about Zen 3D being a primary response to Alder Lake does not make sense, it was designed for something else. And trickles down to Desktop.

It's possible that AMD chose this over Zen 3(+) on AM5 because of Alder Lake's gaming performance.

Joe NYC · Aug 9, 2021

DisEnchantment said:
Daytona platform already have BIOS support for V-Cache since a while because it was designed for that market all along.

Speaking of that BIOS, the "Auto" option apparently stands for 8 stacks high

eek2121 · Aug 9, 2021

Joe NYC said:
It is also symbolic, as in making it a priority, taking a segment which most benefits from such a CPU - gaming - seriously.

And BTW, if 4.9 GHz is the limit in 16 core CPU, it is possible that 5 GHz may be achievable in 8 core CPU

I noticed the few SKUs.

No, I would go from 1 SKU for 8 core to 2 (high clock and lower clock)
Also 2 SKUs for 16 core (high clock and lower clock).

That's all. All together, from 4 5000x SKUs to 6.

The 5950X already hits 5050 mhz (5.05 ghz) out of the box.

Joe NYC · Aug 9, 2021

eek2121 said:
The 5950X already hits 5050 mhz (5.05 ghz) out of the box.

But that's not a stock frequency, is it?

eek2121 · Aug 9, 2021

Joe NYC said:
But that's not a stock frequency, is it?

Yes, it is stock. I said “out of the box”.

AMD does not advertise it because some cores don’t quite reach it. On my machine, 4 cores only reach 4.95 ghz, the rest will reach 5.05 ghz.

Joe NYC · Aug 9, 2021

eek2121 said:
Yes, it is stock. I said “out of the box”.

AMD does not advertise it because some cores don’t quite reach it. On my machine, 4 cores only reach 4.95 ghz, the rest will reach 5.05 ghz.

That's VERY nice....

I have to check my CPU, what clock speeds I am getting....

BTW, but this may explain some of the oddities of testing, if a CPU can optionally exceed its official boost frequencies out of the box.

Or is this only the feature of 5950x?

LightningZ71 · Aug 9, 2021

Joe NYC said:
It is also symbolic, as in making it a priority, taking a segment which most benefits from such a CPU - gaming - seriously.

And BTW, if 4.9 GHz is the limit in 16 core CPU, it is possible that 5 GHz may be achievable in 8 core CPU

I noticed the few SKUs.

No, I would go from 1 SKU for 8 core to 2 (high clock and lower clock)
Also 2 SKUs for 16 core (high clock and lower clock).

That's all. All together, from 4 5000x SKUs to 6.

They ALREADY have more than 4 SKUs. There are lower wattage non-x SKUs that are offered to OEMs. The 5900 and 5800 have been out for a while now.

moinmoin · Aug 9, 2021

Joe NYC said:
It is also symbolic, as in making it a priority, taking a segment which most benefits from such a CPU - gaming - seriously.

🤦

Joe NYC said:
And BTW, if 4.9 GHz is the limit in 16 core CPU, it is possible that 5 GHz may be achievable in 8 core CPU

Seems you don't know how AMD's boost frequencies work with Zen 3 chips: Unlike for previous gens these are guaranteed frequencies, and most chips pass them at stock. And 5 GHz is already achievable depending on the chip.

DisEnchantment said:
But to me the noise about Zen 3D being a primary response to Alder Lake does not make sense, it was designed for something else. And trickles down to Desktop.
Daytona platform already have BIOS support for V-Cache since a while because it was designed for that market all along.

Of course Zen 3D being a gaming performance response to Alder Lake is nonsense, I sure hope everybody sane sees it's a PR move first and foremost. One targeted at the consumer audience, so obviously no talk about servers. But of course it was first designed for use in server like everything Zen was. My previous guess was that Zen 3 was originally planned together with 3D stacking, and either markets circumstances (weak competition) didn't require AMD to go all out with the most costly version, or the launch schedule for 3D stacking slipped, or Zen 3 sans stacking was actually moved ahead. Maybe a mix of all that.

Joe NYC · Aug 10, 2021

moinmoin said:
Of course Zen 3D being a gaming performance response to Alder Lake is nonsense, I sure hope everybody sane sees it's a PR move first and foremost. One targeted at the consumer audience, so obviously no talk about servers.

The talk is about Ryzen class SKUs of Zen 3D. That the SKUs are going to be designated as a gaming response to Alder Lake gaming SKU launch.

moinmoin said:
But of course it was first designed for use in server like everything Zen was.

No one is arguing otherwise.

But the chiplet approach allows AMD to share the server chiplet with high end desktop, so the chiplet based Ryzen processors get to benefit from it.

moinmoin said:
My previous guess was that Zen 3 was originally planned together with 3D stacking, and either markets circumstances (weak competition) didn't require AMD to go all out with the most costly version, or the launch schedule for 3D stacking slipped, or Zen 3 sans stacking was actually moved ahead. Maybe a mix of all that.

The launch dates did not align for Zen 3 to be launched with V-Cache, but since Zen 3 is such a flexible design, it is doing exceedingly well in the market place even without V-Cache.

The automated assembly facility (of TSMC) that will specialize in this sort of stacking was due to come online in May 2021. I am not sure what the actual completion date is (or was). But the stars are now all aligned for the Zen 3D.

jpiniero · Aug 10, 2021

moinmoin said:
Of course Zen 3D being a gaming performance response to Alder Lake is nonsense,

The 3D stacking in general, yes, that was Milan-X. But AMD must have decided that doing a 1 layer version of the chiplet (for desktop gaming perf) was better than the alternatives given the time frame.

Magic Carpet · Aug 17, 2021

Looks like power goes up for AMD as well. Liquid cooling the new norm, lol?

More cores, the war is approaching. 12/24 the new mainstream. Can’t wait, the future is bright 😈

Discussion Speculation: Zen 4 (EPYC 4 "Genoa", Ryzen 7000, etc.)

Senior member

Diamond Member

Diamond Member

Platinum Member

Lifer

Senior member

Senior member

Diamond Member

Diamond Member

Golden Member

Golden Member

Diamond Member

Diamond Member

Diamond Member

Golden Member

Lifer

Diamond Member

Diamond Member

Diamond Member

Diamond Member

Diamond Member

Platinum Member

Diamond Member

Diamond Member

Lifer

Diamond Member