Skylake-EP/EX 28 Cores L3-Cache 38.5 MB

csbin

Senior member
Feb 4, 2013
908
614
136
https://www.computerbase.de/2016-09/skylake-ep-28-kerne-server-cpu-cache/



pbS76.jpg



Jji7N.png
 

Lepton87

Platinum Member
Jul 28, 2009
2,544
9
81
Why does it have so little L3 cache? Broadwell Xeons tops out at 60MB L3. There is a serious reduction, maybe they increased the speed of the cache, Xeons with lots of L3 cache have slower l3 cache then lower end skus.
 

daniel1926

Junior Member
Feb 18, 2015
23
1
11
I would strongly suspect that this means that the chip will have a healthy helping of L4 edram. From a performance perspective, it makes sense to have a smaller, but faster, L3 and a larger, but slower, L4. The real question is what is the communication penalty for an off chip, but on package, eDram cache and is the overall performance of the memory subsystem substantially higher despite this penalty (in typical server workloads). For my own purposes (highly parallel, memory intensive scientific-ish computing), it seems like a trade off that I would be more than happy making. And, as someone who has only recently branched out into this area from theory, it would certainly make it easier to cache optimize my data structures.
 

Edrick

Golden Member
Feb 18, 2010
1,939
230
106
I would strongly suspect that this means that the chip will have a healthy helping of L4 edram. From a performance perspective, it makes sense to have a smaller, but faster, L3 and a larger, but slower, L4. The real question is what is the communication penalty for an off chip, but on package, eDram cache and is the overall performance of the memory subsystem substantially higher despite this penalty (in typical server workloads). For my own purposes (highly parallel, memory intensive scientific-ish computing), it seems like a trade off that I would be more than happy making. And, as someone who has only recently branched out into this area from theory, it would certainly make it easier to cache optimize my data structures.

IBM Z series (mainframe) and Power series system already utilize this process. I would not be surprised to see Intel follow suit.

I also suspect that Skylake Xeon will have 512kb L2 which is needed for AVX512.
 

2blzd

Senior member
May 16, 2016
318
41
91
so 1151 Kaby's get TB3.0 and the higher-end Kaby-X's do not?

makes sense.