# An instruction to speed up x87 floating point to integer conversion
# Five instructions to improve the efficiency of loading, moving and duplicating SIMD data, useful in complex arithmetic algorithms
# An instruction to avoid cache line splits when loading data, useful in certain video compression applications
# Four instructions to enable more efficient handling of arrays of structures. This is useful in 3D graphics, particularly when processing vertex buffers.
# Two instructions that help manage thread synchronization, which will in turn improve Hyper-Threading performance.