To contrast GB5 with CBR20 (4.4 GHz, 1.344v-1.38v)
ST: ~49W package power
MT: ~162W package power
ST: ~42W average, peaked at 46W in Structure from Motion
MT: All over the place, so average seems pointless, but it was ~91W. Ray tracing seemed to push power up to around 142-147W, while Structure from Motion hit the 130s.
Pretty sure Primate Labs claims to use AVX, but . . .
On the ST side the power usage is pretty similar. Cinemark is NOT a general purpose benchmark, it tests only one thing and is pretty meaningless if what you do isn't that one thing or closely related to it. Geekbench and SPEC do a variety of tests to try to form more of an average performance across a variety of tasks. Some things (especially if they have portions that are mostly cache bound so the memory controller isn't exercised as much) will end up using less power than others.
For instance, if you test a database load versus a heavy streaming load (which I assume Cinebench is though I haven't really looked at what it tests because it isn't in the realm of stuff I care about) you will see the database load use a lot less power on a CPU with a lot of cores. It isn't because the database load isn't stressing it, it is because databases can't effectively use all cores all the time due to locking and such. Tasks that are considered "embarrassingly parallel", i.e. those that will benefit from more cores assuming they can get enough memory bandwidth, will burn more and more up to the package max because there are no inter thread depencies.