Dresdenboy
Golden Member
The integer cores have no direct control over what the FPU executes. The main point where the FPU handles only one thread during a cycle is when instructions are being dispatched to it (together with those instructions going to the int core belonging to this thread). But after they've been received, the FPU is handling instructions of both threads simultaneously, which means it can execute ops of both threads simultaneously. Only retiring the FP ops seems to be one thread per cycle again. Even AMD calls the FPU's execution mode "SMT":You may be mistaken here. Hasn't AMD said the FP unit can be used by only one thread at a time? In other words, the FP unit can't be 'shared' by the INT units.
I might try it out. But otherwise I might miss some entertaining postingsCHeck his posting history and after that use the neat forum option that has a list .
According to AMD this is with reduced area and power, leaving room for interpretations. So the CMP arch might consume more power to achieve that 2x mark.So the final question I have is this "80%" versus "180%" number. AMD slides clearly only state "80%"...and 80% x 2 = 1.6
So is the performance scaling in going from 1 thread in one module to having two threads in one module going to merely be 1x -> 1.6x for applications which would have otherwise scaled perfectly on a CMP architecture 1x -> 2x?