IntelUser2000
Elite Member
- Oct 14, 2003
- 8,686
- 3,786
- 136
But Golden Coves cache bandwidth is far greater than Sky Lake's right? Usually higher bandwidth also means higher latency, though correct me if I'm wrong.
There isn't a direct relation.
However, you could optimize in a certain direction. Sacrificing latency to get higher bandwidth is a switch in mindset, from single thread to multi-thread optimization. So when they moved from L2 being the LLC to L2 being private and L3 being the LLC with Nehalem, it was a push towards better multi-threaded performance. L3 on Nehalem would be slower than L2 on Core 2 in single threaded applications for example.
With multi-threaded applications, you need the bandwidth of the caches to scale with cores to keep it scaling.
They are on a transitional phase to move away from the endless Skylake stagnation so there will be a lot of improvements, but on theory what I said would apply.
That's the crux. Intel had some catching up to do there (all the Skylake clones had excellent cache latency) so it's important it does just that.
Another thing is that Skylake likely wasn't meant to scale above certain core counts. The greatest benefit of the ring bus was it's simplicity. So for relatively low core counts it beat other implementations even though in theory it didn't sound so fast.
The greatest single ring stop they've done was 12 I think? Not so simple anymore. In Raptorlake they'll reach that number again. Also the push for ridiculous frequency doesn't help.