I think Phynaz meant that AMD should do what you are saying -- send a Zen chip to a popular website and let them go to town showing how it curb stomps Broadwell-E![]()
They will, once it s launched, or did any manfacturer of anything send an ES to any site before the product was launched, tell us when this already happened, i m much interested to see if any other firm ever bowed to such irrealistic and irrational demands...
The question is quite simple: you cannot use all available ports all the time, even with such kind of favorable code.
Not sure that your statement is not self contradictory..
In Blender it is obvious that HW for instance doesnt execute 2FP MUL or 1 FP MUL + 1 FP ADD for a single thread each cycle, otherwise there wouldnt be enough ressource left to gain 50-60% when pushing a second thread in the same core, the ratio suggest that the code has dependencies such that only 1.3 FP ops per thread and per cycle are executed.
This explain both HW huge SMT gain and Zen inability to do much better than BDW despite a more adequate FPU.
If each cycle the first thread does 1.3 FP then Zen could theoricaly execute 2.6 FP ops/cycle when using SMT, that is 30% more than Haswell, but of course this would require that the ops repartition in the code is such that a unit comprising 2 FP MUL + 2 FP ADD could provide 30% more ops/cycle than the unit comprising 2 FP MUL or 1 FP MUL + 1 FP ADD, wich is of course unlikely, for instance if the ops are mainly FP MULs and few FP ADDs then the two cores will yield about the same throughput in both ST and SMT..
Last edited: