- Sep 23, 2011
- 2,023
- 275
- 126
+ some ISA'sImportant differences between S.I. and C.I. GPUs
Multi queue compute
Lets multiple user-level queues of compute workloads be bound to the device and processed simultaneous. Hardware supports up to eight compute pipelines with up to eight queues bound to each pipeline.
System unified addressing
Allows GPU access to process coherent address space.
Device unified addressing
Lets a kernel view LDS and video memory as a single addressable memory. It also adds shader instructions, which provide access to flat memory space.
Memory address watch
Lets a shader determine if a region of memory has been accessed.
Conditional debug
Adds the ability to execute or skip a section of code based on state bits under control of debugger software. This feature adds two bits of state to each wavefront; these bits are initialized by the state register values set by the debugger, and they can be used in conditional branch instructions to skip or execute debug-only code in the kernel.
Support for unaligned memory accesses
Detection and reporting of violations in memory accesses
seems that amd made more of changes in compute.... looks like thay can have up to 8 ACEs right now, and each ACEs can use 8 comand lines (was only 2 in 7000 series)
Sea Islands ISA Manual