Van Gogh in the Steam Deck is about 1.6TFLOPs for about 10W GPU only, and that's with half the ALUs. Rembrandt (next gen APU) is 768ALUs, and with the same GPU power if my estimation's right should hold about 1.3GHz with GPU power locked at 10W, which gives you about 1.9TFLOPs.
And that's assuming it has identical V/f properties to my 6700XT, but realistically it should be safe to assume an extra 200-300MHz worth of clocks from additional optimisations and actual binning, something that doesn't take place with neither Van Gogh nor Navi22 silicon currently.
By which point you're looking at over 2.2TFLOPs, which actually is now actually in line with the 10-15% process improvement from N6 -> N5.
Oh, and let me again remind you that Apple still holds an ALU advantage here, meaning they can clock their iGPU lower and get the same performance, which by nature brings an efficiency improvement on it's own.
So then, where does your 2-3x efficiency advantage come from?