3DVagabond
Lifer
- Aug 10, 2009
- 11,951
- 204
- 106
This is all guessing from my part :
I am wondering about how much clock cycles a given gcn instruction needs to complete when compared between the different gcn generations.
I am still trying to find some information about it, but it makes sense that AMD already optimized GCN as much as possible.
So, they cannot shave of much (if at all) clock cycles per instruction to increase IPC that way. Because usually for higher clocks, a longer pipeline is needed. I assume here that gcn is pipelined because that makes sense for a gpu which does the same iterative work over and over again. But then again, a new smaller process may allow for less stages for some instructions, so that these instructions complete faster with less clock cycles. This may limit of course the maximum clock speed. But if that is not necessary because the reachable clock speed plus IPC improvements are already enough.
Do you know if there is somewhere a guide about gcn instructionset ?
We could all compare the difference between 1.0,1.1 and 1.2 and make an educated guess how much gcn has improved and how 1.3 may have improved.
edit:
While looking, i found something right here from anandtech.![]()
http://www.anandtech.com/show/4455/amds-graphics-core-next-preview-amd-architects-for-compute/2
Don't get too hung up on the naming. AMD could have just as easily not called it GCN. Would that mean it's a more complete overhaul of the uarch? No. Just because nVidia uses different names of physicists between gens does that mean their uarch is a bigger update? Again, no. You know the saying, a rose by any other name?
