Depending on how the cache is laid out, you can increase the capacity without impacting the clock speeds overly much, but there are design trade offs.
Probably the easiest change would be to double the line length, which wouldn't affect the timing too much assuming you double the bus size so that loading or evicting an entry doesn't take any longer. Of course that bigger bus is more power hungry and needs more transistors.
Of course there's probably not that much of a performance uplift from increasing the line size. Most x86 CPUs have been using 64 byte cache lines for decades. If there was a big enough performance gain to justify doubling it, someone would have done so by now.
I suspect that a big part of why Apple has designed their L1 cache to be so large is because they want to be able to keep more bits of code from a larger number of applications in the cache so that when people move between them, the performance is much snappier. Their clock speed being more moderate enables this, but there aren't that many applications that need a vast L1 cache whereas every application needs a fast L1 cache. You only really want to increase the size when this can be done without paying any cost to access time. Otherwise it's not worth it.