Zambezi's and Vishera's L3 is shared between all cores, and is a "mostly exclusive" cache, meaning it basically is a big victim cache for the L2s. What do you mean that a 1:1 size ratio is bad for a victim cache? Using a 1:1 sized L3 as anything but a victim cache would be absurd.
On an unrelated note, the place where large caches help the most for server workloads is in caching instructions, not data. Generally not a lot of data locality in server workloads.
Intel's eDRAM LLC is 128 MB and is a victim cache. IIRC, you want victim caches to be >> 1:1 - then again, my memory may just be off. 1:1 L3$:L2$ seems to be absurd to me - generally speaking. Apparently, based on preliminary results, Kaveri loses some FP performance without the victim cache, even compared to 1:1. That said, a small inclusive L3$ would be pointless.