If the rumors are true for a 440mm2 128bit memory bus etc based Navi 33, then I think is very difficult for Navi 32 (2x) to be below 690mm2 and Navi 31 below 940mm2 (including everything, infinity cache etc). In David Wang presentation there was a slide shown indicating +50% performance/watt initial target for RDNA3 vs RDNA2 (not present in Tech ARP video with the static split screen slides...) and also Rich Bergman indicated similar performance/watt improvements as with RDNA2 vs 1
We have not even got a chance to explore the limits of the new RX 6000 GPUs based on the RDNA 2 architecture, and AMD is already planning some impressive performance gains for the upcoming RDNA 3 cards that should release in late 2021. The performance uplift should be facilitated through MCM...
www.notebookcheck.net
I think AMD will be able to double (+100%) the performance/watt vs RDNA2 for some of the MCM designs so at 420W TBP (which I think is the limit for AMD reference designs with 3x 8pins) we will have 2.8X max increase in performance vs 6900XT.
So according to TPU Nvidia will need 2.6x the performance of 3090 to match Navi 31 at 4K:
The ZOTAC RTX 3090 AMP Extreme Holo is the longest graphics card we ever tested. At more than 350 mm long, this triple-fan, triple-slot monster has enough power to handle all the latest games at 4K resolution. The RTX 3090 GPU is highly overclocked, and the power limits have been raised, too.
www.techpowerup.com
I guess many people they don't realize that Nvidia (if rumors are true also and we are talking about a 5nm TSMC based design with 18432 Cuda cores etc) can do a Maxwell2->Pascal minor features facelift add 96MB of NV cache lol and still be at 471mm2 size EASILY at 5nm TSMC. And also the 2.6X is quiet doable for NV depending the clocks their designs can hit.
My prediction is 2 chip designs in 5nm TSMC and 3 designs with 5nm Samsung probably all with NV cache (for TSMC is certain imo) and we are talking about minor die area anyway.
1)TSMC 5nm 192 ROPs/18432CC 384bit memory bus with 96MB NV cache
2)TSMC 5nm 128 ROPs/12288CC 256bit memory bus with 64MB NV cache
etc...
The 3rd (Samsung based) will have higher than 3090 performance at $499 and the 2nd TSMC based will have around 1.7X higher performance according to my napkin calculations. I'm a little bit worried about the size of cache but if AMD can support 1080p with 16MB for Navi24 and with 32MB 1440p with only 2-3% hit on average then at 4K the 64MB will be enough with similar minor hit (also Nvidia will have much higher additional memory bandwidth through the regular bus vs AMD designs so maybe 96MB for 2880p and 64MB for 2160p will be enough with minor hit) So similar performance in the end at maybe half the die size? (or with sizeable features gains otherwise) is this a success for AMD?