Discussion Zen 7 speculation thread

Page 15 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

marees

Golden Member
Apr 28, 2024
1,840
2,460
96
So it seems, but patent specifies it can be any non-local cache, so they could be coupled to Shader Engine private cache, L2, or MALL.
If the reduced L2 (AT2 = 24MB L2 vs Navi 48 = 64MB MALL) is accurate and CCU is leveraged for RDNA 5 then those being coupled to L2 would shrink L2 for other processes since they require sizeable dedicated. Wonder how AMD engineers would tackle this. A SE cache implementation could happen as well, but it would require a much bigger SE cache, but some other benefits like superior cache latency, closer integration with CUs (routing and latency) etc... Latter seems more likely given the entire thing about the supposedly (not confirmed IIRC) overhaul with autonomous SE scheduling and dispatch (WGS and ADC). In this case CCUs outside SEs would complicate things alot.

Remember CCU offloads work from CU, so overhead might be lower than what it seems. No need to duplicate instructions, but yeah still some overhead, but how much?

Highly doubt that. But some BW heavy RT instructions could be offloaded to CCUs.

Again lots of unaswered questions but one of the most interesting AMD patents in a long time and a massive departure architecturally from anything previous.

Edit: Forgot this is Zen 7 thread, so would this be feasible at all for a CPU architecture? Maybe AMD processing in-cache 3D V-Cache for zen7 xD
We have a RDNA 5 thread
 
  • Like
Reactions: MrMPFR

dangerman1337

Senior member
Sep 16, 2010
396
45
91
Reviving this thread to avoid going off topic in the Zen 6 thread, since @Kepler_L2 said A14 was way too off IIRC for Zen 7 timeframe (does that mean Zen 7 is a RZL Competitor than say Titan/Hammer Lake, early 2028 or even late 2027?) 6 months ago. If it's A16 process is it possible they can still do 16 Cores per CCD (A16 & A14 aren't that both far off in terms of density and performance) and doubling of L2 Cache per core? I mean maybe the difference will be A16 being a slightly bigger CCD and slightly lower clockspeeds. I mean if AMD really wants to they could do DDR6/AM6 Zen 8 in late 2029 on A14 SPR probably.

If there is a Zen 7 product on A14 I think that'll be some 2H of 2028 server product. If A16 CCD Zen 7 is like 30% cheaper than one on A14 and only few % slower (maybe still able to do 16 ore CCD Zen 7X3D at 7Ghz :p) and it's on AM5 anyways now... I'd just do that if I was AMD. Do Zen 8 on A14 SPR late 2029 and just have that sold along with Zen 7 (which will probably be done for a long time, Zen 7 is going to be likely "good enough" for a very long time).
 
Last edited:

marees

Golden Member
Apr 28, 2024
1,840
2,460
96
Reviving this thread to avoid going off topic in the Zen 6 thread, since @Kepler_L2 said A14 was way too off IIRC for Zen 7 timeframe (does that mean Zen 7 is a RZL Competitor than say Titan/Hammer Lake, early 2028 or even late 2027?) 6 months ago. If it's A16 process is it possible they can still do 16 Cores per CCD (A16 & A14 aren't that both far off in terms of density and performance) and doubling of L2 Cache per core? I mean maybe the difference will be A16 being a slightly bigger CCD and slightly lower clockspeeds. I mean if AMD really wants to they could do DDR6/AM6 Zen 8 in late 2029 on A14 SPR probably.

If there is a Zen 7 product on A14 I think that'll be some 2H of 2028 server product. If A16 CCD Zen 7 is like 30% cheaper than one on A14 and only few % slower (maybe still able to do 16 ore CCD Zen 7X3D at 7Ghz :p) and it's on AM5 anyways now... I'd just do that if I was AMD. Do Zen 8 on A14 SPR late 2029 and just have that sold along with Zen 7 (which will probably be done for a long time, Zen 7 is going to be likely "good enough" for a very long time).
Nvidia will take-up one node completely (to maintain their AI dominance, as they don't have a 2nm product as of now, I think )

Zen 7 will have to use what's left after that
 

dangerman1337

Senior member
Sep 16, 2010
396
45
91
By late 2027 AMD would most probably still have a backlog of Zen 6 launches - think of Threadrippers, various APU SKUs, etc.
True, but with AMD still supporting Zen 2 and Zen 3 with rebranding into the new numbering scheme. We're going to see probably SKUs of different architectures cross over each other because Zen 7 will be tad more costly than low end Zen 6 (let's say Zen 6 CCD is like 65mm2 on N2P, A16 16 Core Zen 7 could be like 80+ mm2). I mean Zen 6 low end and Zen 7 X3D could very well co-exist as launches (IMV if I was AMD I'd prioritize Zen 7 X3D launching first before other lower end client Zen 7, hell I won't be surprised if Zen 6 X3D launches on time Vs Nova Lake bLLC if that is Q4 2026 next year).

Like a 9/10 Core Zen 6 non-X3D SKU is going to be cheaper than a Zen 7 A16 X3D SKU that uses 4nm cache tiles.
 

marees

Golden Member
Apr 28, 2024
1,840
2,460
96
Reviving this thread to avoid going off topic in the Zen 6 thread, since @Kepler_L2 said A14 was way too off IIRC for Zen 7 timeframe (does that mean Zen 7 is a RZL Competitor than say Titan/Hammer Lake, early 2028 or even late 2027?) 6 months ago. If it's A16 process is it possible they can still do 16 Cores per CCD (A16 & A14 aren't that both far off in terms of density and performance) and doubling of L2 Cache per core? I mean maybe the difference will be A16 being a slightly bigger CCD and slightly lower clockspeeds. I mean if AMD really wants to they could do DDR6/AM6 Zen 8 in late 2029 on A14 SPR probably.

If there is a Zen 7 product on A14 I think that'll be some 2H of 2028 server product. If A16 CCD Zen 7 is like 30% cheaper than one on A14 and only few % slower (maybe still able to do 16 ore CCD Zen 7X3D at 7Ghz :p) and it's on AM5 anyways now... I'd just do that if I was AMD. Do Zen 8 on A14 SPR late 2029 and just have that sold along with Zen 7 (which will probably be done for a long time, Zen 7 is going to be likely "good enough" for a very long time).

This is what I was referring to

Nvidia will take-up one node completely (to maintain their AI dominance, as they don't have a 2nm product as of now, I think )

Zen 7 will have to use what's left after that

The A16 process, scheduled for mass production in 2027, is TSMC’s first 2-nanometer node to incorporate backside power delivery network (BSPDN) technology — one of the most advanced innovations in semiconductor manufacturing.

BSPDN is a groundbreaking process technology with no commercial precedent. Traditionally, both power and signal interconnects are placed on the front side of a chip. However, as circuit dimensions shrink, interference increases, complicating design and fabrication. BSPDN flips this structure by routing the power network on the backside and the signal network on the front, thereby alleviating interconnect bottlenecks and improving power efficiency.

Samsung Electronics and Intel are also preparing BSPDN adoption, and industry consensus expects both companies to implement it at the 2-nanometer node as well.

NVIDIA’s GPU roadmap follows the sequence Hopper → Blackwell → Rubin → Feynman. The Blackwell series is currently in shipment, with Rubin expected next year. The Feynman GPU, planned for release in 2028, is believed to be the first to use TSMC’s A16 process. Although the product launch is slated for 2028, production using A16 will likely begin in the second half of 2027, allowing about a year for ramp-up to improve yield and productivity.




 

Joe NYC

Diamond Member
Jun 26, 2021
3,734
5,296
136
Nvidia will take-up one node completely (to maintain their AI dominance, as they don't have a 2nm product as of now, I think )

Zen 7 will have to use what's left after that

I don't think that's how TSMC operates. There is no benefit to TSMC in letting one customer push out the rest of TSMC customers. More like the opposite of that.
 
  • Like
Reactions: Tlh97 and RnR_au

Doug S

Diamond Member
Feb 8, 2020
3,652
6,465
136
I don't think that's how TSMC operates. There is no benefit to TSMC in letting one customer push out the rest of TSMC customers. More like the opposite of that.

There isn't any way for one customer to push out the rest. If TSMC was planning on 100K wafer starts for A14 when mass production began and Nvidia came along and wrote them a megacheck for 100K wafer starts TSMC would use that money to expand capacity and they'd have 150K or 200K wafer starts when mass production begins.

That's why the claims that Apple is "buying up so much N2 capacity to hurt the competition" is stupid. Apple is buying the capacity they NEED. They aren't going to pay for huge numbers of N2 wafers just to toss a portion of them in the trash to screw the competition. Throwing away billions of dollars is no way to beat the competition! The fact it is ~50% of N2 capacity is because TSMC's had enough other committed or anticipated N2 orders for the rest. If Nvidia had come along and bought the same amount as Apple at the same time they did then it isn't like Apple would have 50% and Nvidia would have 50%. Apple would have 33%, Nvidia would have 33%, and there would be 33% for everyone else, because TSMC would have built more initial capacity.

There are some limits to this, TSMC couldn't jump from 100K wpm they are producing for N2 next year to 500K if someone had come along a few years ago with a check for 400K wpm, because at some point they'll be resource limited in building fab shells, equipping them with everything from plumbing to EUV machines, etc. But such concens are theoretical, TSMC would be able to handle any real world surge so long as orders are placed far enough in advance.
 

Joe NYC

Diamond Member
Jun 26, 2021
3,734
5,296
136
There isn't any way for one customer to push out the rest. If TSMC was planning on 100K wafer starts for A14 when mass production began and Nvidia came along and wrote them a megacheck for 100K wafer starts TSMC would use that money to expand capacity and they'd have 150K or 200K wafer starts when mass production begins.

That's why the claims that Apple is "buying up so much N2 capacity to hurt the competition" is stupid. Apple is buying the capacity they NEED. They aren't going to pay for huge numbers of N2 wafers just to toss a portion of them in the trash to screw the competition. Throwing away billions of dollars is no way to beat the competition! The fact it is ~50% of N2 capacity is because TSMC's had enough other committed or anticipated N2 orders for the rest. If Nvidia had come along and bought the same amount as Apple at the same time they did then it isn't like Apple would have 50% and Nvidia would have 50%. Apple would have 33%, Nvidia would have 33%, and there would be 33% for everyone else, because TSMC would have built more initial capacity.

There are some limits to this, TSMC couldn't jump from 100K wpm they are producing for N2 next year to 500K if someone had come along a few years ago with a check for 400K wpm, because at some point they'll be resource limited in building fab shells, equipping them with everything from plumbing to EUV machines, etc. But such concens are theoretical, TSMC would be able to handle any real world surge so long as orders are placed far enough in advance.

100% to this. TSMC, when facing strong demand from one customer, always seeks to expand the capacity, rather subtract it from other customers. And this practice has served TSMC very well. Not just TSMC, the entire fabless ecosystem as well.

Most of the "Magnificent 7" and number of others right behind them are all beneficiaries of this.