Can't agree with You here. Sandy Bridge WAS revolutionary, even if it was undersold at that time by Intel. It was design that moved from P6 ( PPRO ) legacy, to modern core with different OOP machine based on Physical Reg File ( as in from actual registers in OOO machinery to pointers to said registers in PRF ) and added uOP cache.
Haswell, Skylake, Sunny Cove and Golden Cove have all the same machinery of Sandy Bridge and kept piling "more" on to it.
So in evolution of P6->P4->P6 based C2D->Sandy Bridge we are still on that uArch. Widening instruction fetch and widening decoders is loooong overdue, just like adding 5th ALU, but that does not make any revolutions, rather belated evolution of necessity.
Oh, and lets wait for actual chip release before we evaluate how good that 6-wide decode is in practice. It could be as
bad retarded as having 1-complex + 5 simple decoders, that is 6-wide when all stars align in instruction stream