Haswell to support transactional memory in hardware

Ajay · Feb 13, 2012

Phynaz said:
Actually yes, the instructions are backwards compatible. You can run this code today, on a SB CPU and it is just ignored and it falls back to the old locking method.

So do compilers/runtimes supporting TSX insert a check for the CPU ID?

exar333 · Feb 13, 2012

Does TM require additional memory bandwidth over the existing locking methods, or is it just used more efficiently as the # of threads increase? My understanding is that this would provide minimal impact in a small number of threads, but the performance delta as the # of threads goes up can be huge.

GammaLaser · Feb 13, 2012

Ajay said:
So do compilers/runtimes supporting TSX insert a check for the CPU ID?

For the HLE part of the extension, a separate codepath is not needed. HLE re-uses existing instruction prefixes (REPE and REPNE) that current hardware will ignore because these prefixes are not currently used in conjunction with lock manipulation instructions.

On the other hand, the RTM part of the extension does require a separate codepath. RTM defines new instructions altogether and older hardware will generate a fault if it sees these instructions.

Cerb · Feb 13, 2012

ExarKun333 said:
Does TM require additional memory bandwidth over the existing locking methods, or is it just used more efficiently as the # of threads increase? My understanding is that this would provide minimal impact in a small number of threads, but the performance delta as the # of threads goes up can be huge.

Transactional memory is one big step from intricate procedural programming for space and logic-starved hardware, towards predicated programming for bandwidth-starved logic-rich hardware. In pure software, editing common data structures can eat up more time in the STM system than they are doing work, which is where hardware support for the grunt work should help immensely. More or less bandwidth is going to be dependent on implementation (software more than hardware, provided you have hardware support), and the specific kind of work the software does (in particular, what's the likelihood of cache line crossing on a regular basis?). It should use less, normalized to the same amount of results, but saying it will, across the board, I think would be a bit naive.

If managing locks is taking a bunch of CPU time that could be put towards doing real work, I would expect substantial gains. If managing locks to improve performance carries too much risk, with requirements-driven design changes over time (many a small business' dev team will be in this category), I would expect substantial gains. Beyond that, however, it's another available tool, and whether it eats up more time/bandwidth/energy is all in how it's used. Shared-memory procedural languages (IE, the common in-demand ones, like C++, C#, and Java) will still be giving programmers all the rope they need to hang themselves with, IMO, as part of allowing them to make the hardware sing and dance.

Idontcare · Feb 13, 2012

Ajay said:
PS. Interesting chart IDK, where is it from?

From my doctoral dissertation, meaning eons ago, carved into rocks that dinosaurs then crapped on

Phynaz · Feb 13, 2012

Says the guy that's young enough to be my kid :biggrin:

Ajay · Feb 14, 2012

Idontcare said:
From my doctoral dissertation, meaning eons ago, carved into rocks that dinosaurs then crapped on

Well, Hennessy and Patterson would be proud

Or was that b/4 they published the 1st edition

Search

Haswell to support transactional memory in hardware

Ajay

Lifer

exar333

Diamond Member

GammaLaser

Member

Cerb

Elite Member

Idontcare

Elite Member

Phynaz

Lifer

Ajay

Lifer

TRENDING THREADS