I would strongly suspect that this means that the chip will have a healthy helping of L4 edram. From a performance perspective, it makes sense to have a smaller, but faster, L3 and a larger, but slower, L4. The real question is what is the communication penalty for an off chip, but on package, eDram cache and is the overall performance of the memory subsystem substantially higher despite this penalty (in typical server workloads). For my own purposes (highly parallel, memory intensive scientific-ish computing), it seems like a trade off that I would be more than happy making. And, as someone who has only recently branched out into this area from theory, it would certainly make it easier to cache optimize my data structures.