Det0x
Golden Member
- Sep 11, 2014
- 1,465
- 4,999
- 136
Alright, thanks. So they need to have a clock domain crossing between the cores and the ring. For Intel, the ring doesn't peak as high, but it's still in the ballpark by a GHz or so. Unlikely to make a huge difference either way.L3 / ring runs at peak core-clockspeed for any given CCD
View attachment 83176
Guess i showed the wrong screenshot if you gonna take everything in bad faith. Oncourse the difference is not that big in this scenario since the 7800x3d is among the lowest clocking Zen4.
View attachment 83178
Do i need to dig up a 6ghz screenshot of plain old regular Zen4 ?
That second sentence was "For Intel"....Guess i showed the wrong screenshot if you gonna take everything in bad faith. Oncourse the difference is not that big in this scenario since the 7800x3d is among the lowest clocking Zen4.
View attachment 83178
Do i need to dig up a 6ghz screenshot of plain old regular Zen4 ?
My mistake, I were too quick on the trigger, need to read more thoroughly next timeThat second sentence was "For Intel"....
I have the opposite impression. GLC is a terrible core. Bloated and power hungry compared to its competitors. Add on a less than ideal process and core count deficit, plus a sprinkling of bugs, and you can more or less explain SPR's problems without needing to look at the memory subsystem.
Ah, so you're thinking that the current L1 size will remain the floor, but they'll add a larger cache tier / readjust the hierarchy above it? Because when I hear "L0 cache", I'm thinking something smaller and lower latency than the L1, with the rest more or less left the same.Original incarnation in ADL was not that good, RPL increased L2 cache to 2MBs, improved L3 speeds a little bit and it was decent already. Vs Zen3 on similar process.
Massive AVX512 machinery that got disabled is of course plain stupid, even more so when Zen4 showed how it's done properly with 256bit units without blowing up area and power that much.
GLC scales real well with memory latency, when it's OoO capacity is used to execute instructions instead of waiting for memory. It would really shine coupled with AMD's speedy L3, even more so with stacked L3.
And we arrive to starting point of this whole discussion - Intel is not blind to these problems, and they are doing something about it with additional cache level and most likely rebalance of current memory subsystem.
There are multiple ways to arrive to their desired destination of "feeding GLC or even wider core", but i think fundamentals remain the same:
1) L1 cache has to stay 48 if TLB basic unit stays 4KB, they can't do ARM hack of 128KB L1D cache
2) They need massive "2nd" level cache to increase performance and increase power efficiency by not hitting "uncore". At some point increasing core private L2 stops making sense i guess, it becomes better to share cache transistors between cores and burn tiny amount of transistors to ensure "fair" sharing between cores in cluster to please the cloud crowd.
So 16-32MB "L2" cluster => 2X cycle latency => need to insert additional cache level to not destroy GLC performance completely. Something like 10-12 cycle L2 of 256-512kb is just right for this task, and it makes some sense to make it inclusive in next level.
That's my reasoning and attempt to make sense of that "L0" news.
Xino claims it's what @JoeRambo is claiming is going to happen. L1 as the floor, an 'old capacity L2 cache' (I'm guessing 512KB or something), then the regular L2 which is going to be, prob, at least 2MB and then the L3.Ah, so you're thinking that the current L1 size will remain the floor, but they'll add a larger cache tier / readjust the hierarchy above it? Because when I hear "L0 cache", I'm thinking something smaller and lower latency than the L1, with the rest more or less left the same.
Yep, this is my thought. But don't hold your breath.A pitty they don't work on a MTL refresh using Intel 3, or is there a chance MTL refresh could use Intel 3? I guess not. If they really intend to use Intel 4 MTL for ARL-U we might see ARL with a big variation of process node and architectures. Some people are telling 20A is still planned for some SKUs. Maybe N3B for 8+16, 20A for 6+8 and Intel 4 for 2+8.
Likely little more than ASUS licensing/buying the brand. At best they keep using the same ODM that Intel used (though with the move to plastic housing I can't say I care much, the aluminium ones before that were nice).NUC isn't dead, kinda?
"L0 cache below L1" => the real question is how would that cache really operate.Ah, so you're thinking that the current L1 size will remain the floor, but they'll add a larger cache tier / readjust the hierarchy above it? Because when I hear "L0 cache", I'm thinking something smaller and lower latency than the L1, with the rest more or less left the same.
Xino claims it's what @JoeRambo is claiming is going to happen. L1 as the floor, an 'old capacity L2 cache' (I'm guessing 512KB or something), then the regular L2 which is going to be, prob, at least 2MB and then the L3.
- Wonder if Intel's reluctance to talk about ARL (even in comparison to LNL) is due to them having troubles with the development of ARL.
Ye, I think 2MB really is the minimum here. Interesting how both Zen 5 and LNC look to be getting some big changes in their cache subsystem (Zen 5 increased L1 and 'ladder' (istg if it's just mesh lol) cache.Same latency/size considerations apply i think to this small L2, without making next level way larger than current 2MB it does not make much sense to insert that additional levels IMO.
ARL-P is like the one sku rumored to use 20A lol. The 6+8 die. Are you referring to the ARL-U not being real ARL bcuz of MTL-R or something filling up that lineup? In that case, I wouldn't be surprised if LNL fills up the premium 'U' segment, considering it looks like the 'U' segment base TDP is 15 watts.I always wondered why they talked much more about LNL and basically nothing about ARL. And now it turns out ARL-U won't even using a real ARL. What about ARL-P, maybe it's the same?
ARL-P is like the one sku rumored to use 20A lol. The 6+8 die. Are you referring to the ARL-U not being real ARL bcuz of MTL-R or something filling up that lineup? In that case, I wouldn't be surprised if LNL fills up the premium 'U' segment, considering it looks like the 'U' segment base TDP is 15 watts.
More evidence that some of these 'leakers' are literally just making guesses.
Eh, even there, they propose a way to make a bigger L1. They just dismiss it as too big and power hungry. In general, it's a poor bet to assume there's no engineering solution to these sorts of scaling problems.is good read if you want to know why. Apple is getting out of it by having 16KB pages and therefore 128KB L1D works for them, they'd have 32KB L1D if they were on x86 with similar setup.
I'm not convinced the engineers are unaffected. Certainly they've made drastic cuts to engineering as well. But at a certain point, hard to distinguish what "wave" of layoffs they're at, and how many actually remain.More Intel cost cutting. At least it's not the engineers...
That would make sense, since LNL would clearly be the more premium of the two. Hard to say what TDP range this "MX" line will target, but if nothing else, 4+4 vs (a theoretical) 2+8, probably isn't a hugely meaningful difference for this market.I wouldn't be surprised if LNL fills up the premium 'U' segment, considering it looks like the 'U' segment base TDP is 15 watts.
If there are unique troubles with one of the two, it would have to be LNL. I think they're just talking about LNL more because it's so different from MTL.Wonder if Intel's reluctance to talk about ARL (even in comparison to LNL) is due to them having troubles with the development of ARL.
Ian reported a new round of marketing lay offs in the past 24 hrs (yesterday). I hope it doesn't affect engineers in this 'round'. Not that I hate the people who have jobs in marketing, I mean it much suck getting laid off, but like Intel needs their engineers more than ever.'m not convinced the engineers are unaffected. Certainly they've made drastic cuts to engineering as well. But at a certain point, hard to distinguish what "wave" of layoffs they're at, and how many actually remain.
That would make sense. It appears as if LNL's uncore changes are the main star of the show, so even if LNC is a bit lackluster, LNL as a whole could still be really impressive.If there are unique troubles with one of the two, it would have to be LNL. I think they're just talking about LNL more because it's so different from MTL.