Socket7: The L2 cache was on the motherboard and ran at FSB speed.
When CPUs started hitting getting fast this became very slow, a 200Mhz P55C ran the L2 cache at 66mhz..that's a 1/3 divider..remember what happened to AMD with teh classic athlon when it got it's 1/3 divider..the performance got choked out....sooo enter:
Slot1: The L2 cache is moved into the processor cartridge so it can run at 1/2 CPU speed and be much faster than the 66Mhz the FSB allowed (it would be 133 today, but that would still be horribly slow for a P3-1000)
This helped them alot for performance, but later it became evident that even 1/2 speed cache was to slow so they transferred it ondie to full speed. When they did this the PCB in the CPU cartridge no longer had anything on it but the CPU...so they transferred over to a new socket so that they could cut manufacturing costs, by not using that PCB and cartridge.
Thus we are now at Socket370.
Replace Slot1 with SlotA, and Socket370 with SocketA and you get the AMD story, it's much the same (so long as you ignore the K6-3, which had ondie cache already, but because it had massive 256k L2 and 64k L1 caches it was rather costly to produce on a .25um process, especially with the bad yeilds AMD was cursed with for the K6-x)
so P-MMX, P2/3, Coppermine or K6-2, K7, Tbird it goes Socket/Slot/Socket but each step increases cache speed.