GPU's scale up very well. So I'm not seeing a problem here.
GPU do have problems scaling, they have the same kinds of problems as scaling CPU count does, you just dont understand how GPU's work. If you wanted you could relate a GPU to Ryzen, a Shader engine (a CCX) has X number compute units ( cores), each compute unit has SIMD units within(just like CPU's). increasing any one of these has a scaling effect all the way up the stack. As it stands now AMD are "stuck" at 4 Shader engines and NV's equivalent is 6. This will be because this is the point they join the shared L2 along with all the other stuff that has to join the L2 like ROP's, MTU's this interconnect point is hard and very expensive to scale, as far as im aware they are full or partial meshs, not very nice for scale-abilty. Increasing unit counts within a SE or SIMD within a CU will just lower performance per unit unless you also scale your read/write bandwidth and outstanding request queues all the way up to main memory.
As a result unit count increases has slowed dramatically and per unit performance has been the target for increased performance. if your rolling with a DSBR this is even harder then a IMR (like AMD/NV) because you need a lot more storage space on your chip with tighter integration between separate GPU functional units .
There is no doubt Apple has the budget to do it ( but i doubt they have the revenue in the target market to make it worth while) but it will be a 3-4 generation 6-8 year project to get on par with AMD/NV, just like it was on the CPU side.