- Nov 8, 2011
- 392
- 0
- 0
Beware! All of the K-model Haswell CPUs deliberately lack an important performance feature for multi-threading: TSX. Therefore the 'flagship' i7-4770K may actually be slower than the i7-4770, even when overclocked. D:
http://www.realworldtech.com/haswell-tm/Generally, Intel’s TSX should be helpful for improving the programmability and scalability for concurrent workloads. Even with a modest number of threads, locks can easily limit the benefits from additional cores. While that is not a problem for 2-4 core processors, it is a much bigger factor going forward. Extremely popular applications such as MySQL have well-known locking issues that HLE or RTM could significantly alleviate.
Beware! All of the K-model Haswell CPUs deliberately lack an important performance feature for multi-threading: TSX. Therefore the 'flagship' i7-4770K may actually be slower than the i7-4770, even when overclocked. D:
Not necessarily. TSX speeds up synchronization between threads, and this functionality can be part of a library, a framework, or even the operating system. So software can benefit from without even needing adjustment.Software has to be written to take advantage of the feature.
It is comparable to Hyper-Threading in usefulness. Many of today's multi-threaded applications waste a lot of time trying to synchronize operations between threads. With 8+ threads, that's no easy task and can take up to 30% of all cycles. TSX eliminates the overhead of the cases where no synchronization was necessary. Games can definitely benefit from this too since they have to perform lots of different tasks within a matter of milliseconds.It may become an issue in the long-term, though... but that depends on the rate of adoption and whether there's any benefit in desktop applications and games.
It will matter to a lot more than 0.1%. What they've done is remove a performance feature from what's supposed to be their fastest CPU! :thumbsdown:It's strange how Intel chose to "cripple" the K-series by removing features that don't matter at all for 99.9% of the users, yet could potentially be very important for the remaining 0.1%. I guess they needed to find a way to penalize people for wanting to overclock.
David Kanter from RWT, who has expert knowledge in TSX, says it's marketing.What I would love to know is, did they (Intel) do this on purpose, purely for greedy marketing purposes, to stop people overclocking it, and using it as a cheaper version of their more expensive (sometimes) Xeon and/or 2011-socket chips OR ...
Are there genuine technical reasons for this, such as that the TSX instructions (and/or other disabled options, such as virtualization stuff) need more levels of gate delays (or similar), and so can't work at the 3.5 GHz (the fully enabled chips e.g. 4770 (nonK) are at 3.4 GHz I think) ?
Yup !What I would love to know is, did they (Intel) do this on purpose, purely for greedy marketing purposes, to stop people overclocking it, and using it as a cheaper version of their more expensive (sometimes) Xeon and/or 2011-socket chips OR ...
Nope !Are there genuine technical reasons for this, such as that the TSX instructions (and/or other disabled options, such as virtualization stuff) need more levels of gate delays (or similar), and so can't work at the 3.5 GHz (the fully enabled chips e.g. 4770 (nonK) are at 3.4 GHz I think) ?
David Kanter from RWT, who has expert knowledge in TSX, says it's marketing.
Yup !
Nope !
Exactly. So what's the point in getting a Haswell when you're not getting all of Haswell's features?It's no more "crippled" than Ivy or Sandy Bridge, though, since the feature debuted with Haswell.
So where's the benchies ?
Get some multipurpose lib or video something that's fully threaded to support TSX, and test clock for clock.
If when the first TSX enabled applications in daily use arrive - and the use is not more than 10% - i call red herring.
It'll be useful once we go +10threads, mainstream then and that's still sadly far out :/
In order to be a red herring it has to distract from an issue. What issue is being distracted from by pointing out that Intel intentionally removed a performance feature from a performance CPU? Who cares if nothing uses it yet. They made the feature so that people would use it. They need CPUs with that feature out there for people to use it. By lowering the number of CPUs with that feature out there, they are making it more difficult to make that feature become common. It seems pretty stupid to me.
Nice catch :biggrin:Obviously, you both are already using 4770K chips, if you had TSX it would have stopped you simultaneously posting at the some time (JOKE).
Its all about moar money with Intel, if they could they'd suck you dry instantly without remorseThanks, it is so annoying that they have disabled the functionality, just for marketing reasons (assuming that is the reason why).
So where's the benchies ?
Get some multipurpose lib or video something that's fully threaded to support TSX, and test clock for clock.
If when the first TSX enabled applications in daily use arrive - and the use is not more than 10% - i call red herring.
It'll be useful once we go +10threads, mainstream then and that's still sadly far out :/
Exactly. So what's the point in getting a Haswell when you're not getting all of Haswell's features?
The application also needs to have well-known locking issues for the feature to become relevant. Just because a game spawns multiple threads doesn't mean it has locking issues. It seems it may mostly affect database applications and such.
We'll just have to wait and see on this one. It's a brand new feature. It will probably be time to upgrade again by the time it becomes an issue (if it does).
That's my point.
And the feature will at base only really shine when being applied to 16 20 thread EP\EX Monster CPUs.
Until the pragmatical way of programming for ... "client\desktop" changes.
Intel TSX targets a certain class of shared-memory multi-threaded applications; specifically multi-threaded applications that actively share data. Intel TSX is about allowing programs to achieve fine-grain lock performance without requiring the complexity of reasoning about fine-grain locking.
However, if there is high data contention the algorithm would need to change in order to have an opportunity for high scalability. There are no magic bullets that can solve the problem, since true high data contention implies that the algorithm is effectively serialized.
but I guess you can upgrade
HLE allows the rate of modify-only transactions increase by a massive 5x - blowing both classic and even R/W locks out of the water! Applications that use many threads and locks will see a huge increase in performance when changing their locking to HLE.