8+8 ADL and 8+16 RPL and the hybrid approach in general has worked out well for Intel. Before you jump on me let me explain.
I'm just doing some Friday night random rambling here...
Intel is/was behind in process technology. The reality of this means they simply cannot afford the transistor count that AMD can with TMSC and have the resulting part make economic sense.
Now let me be clear here. Intel decided to put power efficiency on the back burner in favor of area efficiency. There is no free lunch, choices have to be made.
So they relied on the fact that many applications fall into two categories:
Category 1 include the application that rely on ST performance and only utilize a handful of cores. Games, DAW's, and many applications today still only really use 6, maybe 8 cores. The proof of this is that ADL and RPL with only 8 P cores compete well against AMD and their 16 core parts in to many applications. What's the ideal number of P cores? TBD. But I do know a core with 50 E cores would get smashed by AMD in most applications that aren't ridiculously threaded applications. Is 4 enough? Maybe. 6 might be where the market is now if Intel can dig another 15 or 20% IPC out of Raptor Cove.
The point is that as software MT gets better and better the need for the P core numbers decreases.
Category 2 are the applications that will load every core available like rendering and come encoding algorithms. So there is a need for the E cores in order to be competitive with these applications.
Now here's something to think about. What if many of these applications score just as well with 6 P cores as with 8P's?
This would lead one to the conclusion that if you took the transistors available for the P's and fabbed 6 P's, which use the same area but have more transistors than the 8P's, these new P's would have higher IPC and better overall performance than the 8P part where 2 or 3 of the P's are not getting loaded often.
In addition if the 16E's could see a big lift in IPC, say 25%, then we could see a reason for reducing the P count.
The BIG question here is how current software is distributed in terms of MT.
Made up numbers
10% of applications use 2 or less threads.
40% of applications use 3-4 threads.
25% of applications use 5-6 threads.
15% of applications use 7-8 threads.
5% of applications use 9-12 threads.
5% of applications will load every thread available.
So Intel would need to do some work to figure out what these numbers actually are and build chips to fit the curve.
AMD on the other hand can sit back on their superior process technology and just make 16P parts that for the moment fit the market will without any hybrid complexities and idiosyncrasies. Or perhaps it should be "idosyn crazies!"