For at least two complete APU generations, AMD has had the ability to easily create an APU that was on par with 50 series NVIDIA GPUs of the prior generation. From Rembrandt, they had RDNA2 with the ability to use 16MB of infinity cache internal to the processor, requiring absolutely nothing extra from the OEM, save for a little extra power and decent processor cooling. It would have been near 2050 4MB performance. Yes, it would have required about 5-10% more die area than the processor they went with, but it would have commanded a comparable price premium. Phoenix and Hawk could have done the same and been quite close to 4MB 3050 performance.
Strix Point, instead of using a 4+8 config with dual CCXs could have used a regular 8 P cores config and used the saved space for a 16MB MALL cache and achieved near 4050 performance while not sacrificing a whole lot of MT performance and been a more flexible and compelling product overall.
Using OEMs as the excuse is a crutch. Yes, back in the Vega days, it would have required a more involved product, but since RDNA2, no. AMD is reaping the costs of not having their own fab and the volume limits that result.