Question Zen 6 Speculation Thread

Page 364 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Joe NYC

Diamond Member
Jun 26, 2021
4,253
5,879
136
Why not on both sides for cache galore?

So it's 48MB L3 in one plane and only 2*64MB underneath the whole CCD area (L3+12 cores)?
Doesn't seem logical or am I missing something...

Edit: correction

AMD likes to use 4MB L3 on CCD die and 8 MB on V-Cache die. Which will result in Zen 6 with 12 cores having 48MB + 96 MB = 144 MB of L3

If there were to be another layer of L3, it would likely be identical V-Cache die with 96 MB of SRAM.
 

Doug S

Diamond Member
Feb 8, 2020
3,856
6,810
136
They are not. The DQ pins are not the only pins on the interface. LPDDR5X has 72 active signals per 32-bit dual channel controller, while LPDDR6 has 84 active signals per 48-bit dual channel (4x half channel) interface. 3/2 data signals but only 7/6 times the pins. Or, 96-bit LPDDR6 uses only 168 signals, while 96-bit LPDDR5X uses 216. Even after you adjust for the 8/9 loss of efficiency from sharing the DQ pins, LPDDR6 comes out ahead.

LPDDR6 is a neat and efficient design.

Wow I didn't realize they'd economized on the pins so much. Will that efficiency help the physical size of an LPDDR6 only controller vs the equivalent width of LPDDR5X only controllers, or just shoreline? I'm guessing Synopsys probably doesn't have LPDDR6 only IP yet to directly make such a comparison. Or do they?
 
  • Like
Reactions: lightmanek

Tuna-Fish

Golden Member
Mar 4, 2011
1,697
2,617
136
Wow I didn't realize they'd economized on the pins so much. Will that efficiency help the physical size of an LPDDR6 only controller vs the equivalent width of LPDDR5X only controllers, or just shoreline? I'm guessing Synopsys probably doesn't have LPDDR6 only IP yet to directly make such a comparison. Or do they?
Yeah, I'm not aware of any apples-to-apples comparisons. I would assume that LPDDR6 PHYs need to be a bit bigger on old process tech, because the CA signals are driven at full speed so there are more high-speed single-ended pins, but I'm really not sure. The controllers are probably a bit bigger because they are probably doing more things.
 

MoistOintment

Member
Jul 31, 2024
164
263
96
That would certainly be a dream for AMD if they managed to sucker Intel into L3 size competition, where AMD would be stacking cheaper dies with low latency SRAM, while Intel is ballooning the N2 die size and increasing latency.
I imagine Intel's goal is to eventually stack the cache, but they needed a large L3 sooner than they could implement stacking
 

OneEng2

Golden Member
Sep 19, 2022
1,013
1,212
106
I imagine Intel's goal is to eventually stack the cache, but they needed a large L3 sooner than they could implement stacking
Is there any AMD patent that would prevent them from doing that? Since the process is being done at TSMC, it is possible that AMD has exclusive rights to the process at TSMC for some time window as well which would force Intel to either wait for the time window to expire, or implement it on their own (already troubled) process.
 

adroc_thurston

Diamond Member
Jul 2, 2023
8,671
11,373
106
Is there any AMD patent that would prevent them from doing that? Since the process is being done at TSMC, it is possible that AMD has exclusive rights to the process at TSMC for some time window as well which would force Intel to either wait for the time window to expire, or implement it on their own (already troubled) process.
No they don't, and Intel stacks caches for CWF/DMR anyway.
It's just their hybrid bonding yield sucks, so client guys NOPE'd out of eLLC in favour of normal 2D slab.
 

Joe NYC

Diamond Member
Jun 26, 2021
4,253
5,879
136
Is there any AMD patent that would prevent them from doing that? Since the process is being done at TSMC, it is possible that AMD has exclusive rights to the process at TSMC for some time window as well which would force Intel to either wait for the time window to expire, or implement it on their own (already troubled) process.

Intel and AMD also have a patent cross license agreement.
 
  • Like
Reactions: lightmanek

LightningZ71

Platinum Member
Mar 10, 2017
2,704
3,397
136
I have to wonder if thermals were also an issue for Intel client. It STILL looks like Intel needs to pull a lot of juice to hit their performance targets and I have to wonder if they could reliably stack cache at the thermal loads required for their Fmax targets?
 
  • Like
Reactions: lightmanek

adroc_thurston

Diamond Member
Jul 2, 2023
8,671
11,373
106
I have to wonder if thermals were also an issue for Intel client. It STILL looks like Intel needs to pull a lot of juice to hit their performance targets and I have to wonder if they could reliably stack cache at the thermal loads required for their Fmax targets?
Relaxing the PT a bit for stacked parts is pretty easy.
Also they did build PVC and are doing CWF et al so they're familiar with thermals in 3D at least.
 
  • Like
Reactions: booklib28

Doug S

Diamond Member
Feb 8, 2020
3,856
6,810
136
Is there any AMD patent that would prevent them from doing that? Since the process is being done at TSMC, it is possible that AMD has exclusive rights to the process at TSMC for some time window as well which would force Intel to either wait for the time window to expire, or implement it on their own (already troubled) process.

If there are any patent issues, wouldn't TSMC patents be more likely the hurdle - at least in the future when Intel wishes to reduce reliance on TSMC fabs?
 
  • Like
Reactions: OneEng2

Joe NYC

Diamond Member
Jun 26, 2021
4,253
5,879
136
It is always an exact match since they moved to wafer on wafer stacking.

BTW, strange that AMD never announced it (as far as I know), nobody asked in any interview.

Last time Max from High Yield did a video on this, he only speculated, and his conclusion was that AMD is still not using Wafer on Wafer packaging, and still using intermediate step of using carrier wafers.
 

OneEng2

Golden Member
Sep 19, 2022
1,013
1,212
106
I have to wonder if thermals were also an issue for Intel client. It STILL looks like Intel needs to pull a lot of juice to hit their performance targets and I have to wonder if they could reliably stack cache at the thermal loads required for their Fmax targets?
Good point. It's currently stacked on top correct?
 

adroc_thurston

Diamond Member
Jul 2, 2023
8,671
11,373
106
BTW, strange that AMD never announced it (as far as I know), nobody asked in any interview.
Pretty sure someone (maybe Cheese) did and they pokerfaced it.
Last time Max from High Yield did a video on this, he only speculated, and his conclusion was that AMD is still not using Wafer on Wafer packaging, and still using intermediate step of using carrier wafers.
AMD doesn't do "intermediate steps", they drive the goddamn roadmap at TSMC.
 
  • Like
Reactions: lightmanek

Joe NYC

Diamond Member
Jun 26, 2021
4,253
5,879
136
Pretty sure someone (maybe Cheese) did and they pokerfaced it.

AMD doesn't do "intermediate steps", they drive the goddamn roadmap at TSMC.

If that's the case, then there should further proliferation of V-Cache CPUs. With Zen 6, V-Cache die will have higher utilization of the V-Cache die, with 96MB on roughly the same die as 64MB on Zen 5.

From the recent presentations, looks like 12 core CCD models will likely have it, 32 core CCD models will likely not have it. Competing with AMD in server space will so much harder.
 
  • Like
Reactions: lightmanek

Joe NYC

Diamond Member
Jun 26, 2021
4,253
5,879
136
It means they get higher margins and you get uhhh.

you have no idea the nightmares about to happen.

I wonder how much there is to this:
- Subtweet: A comment on competitiveness of Arm vs. x86, especially in orchestrating GPU tasks to improve GPU utilization
- Substack article about hyperscalers vertical integration ability to compete with Merchant silicon in both CPU and GPU