- Jan 11, 2011
- 1,982
- 102
- 106
First-time poster, long-time lurker here. Given all the recent bad news (and rumors) circulating around AMD, I thought it would be interesting to talk about what the performance of these future parts might be, given what little we know about them.
I've given Bulldozer a lot more thought than Llano, so I'll give that a go first. It goes without saying everything I'm about to say is a gross over-generalization/ballpark estimate.
First of all, if we normalize a Star's core IPC to 1, I think it would be safe to define Nehalem and SB IPC to ~ 1.4 and 1.6 respectively.
Now, John Fruehe has said that we can expect 50% more throughout from 33% more cores from Interlagos. He has also been quick to point out that you can't compare server and client workloads, a point which I will address shortly.
So before we continue, I'll make one huge heroic assumption: That Interlagos will operate at the same frequency as the MC chip John Fruehe was referring to. Assuming this is the case, I find the same ~ 12.5% increase in IPC that has been posted over and over again. John is correct though, on the client side there will likely be less threads, at higher clock speeds. If you recall, AMD has said that a single thread running on a module will incur a penalty of ~ 20% if another thread is running on that module. Backing this out, BD performance of a single-thread on a single module should be about ~ 1.4, or about identical to Nehalem.
So to recap:
Stars: 1
Nehalem: 1.4
Sandy Bridge: 1.6
Bulldozer (2-thread/Mod): 1.125
Bulldozer (1-thread/Mod): 1.4
If these happens to be the case, how do things look for Bulldozer? Using this (very) simple model of performance, we'd rate a Gulftown CPU @ 8.4, and a 4-module BD @ 9 for multithreaded code (and identical for single), which is consistent with the rumor that Bulldozer will be very close to Gulftown performance, and not completely blown away (but solidly losing) to a hypothetical 6-core Sandy Bridge processor.
Of course, this is completely ignoring the actual clocking of the processors, while assuming again that Interlagos operates at the same clock speed as MC. That being said, I don't think anything I've said sounds too outlandish, given that Nehalem has been out for a while, if AMD wasn't planning/able to hit Nehalem-level IPC (or able to clock significantly higher) Bulldozer for all intents and purposes a failed project.
All of that being said, I think the worst case scenario is that when the modules are fully loaded, Bulldozer will be able to deliver Stars-level performance. If this is the case, then the numbers change to:
Stars: 1
Nehalem: 1.4
Sandy Bridge: 1.6
Bulldozer (2-thread/Mod): 1
Bulldozer (1-thread/Mod): 1.25
In this case, Bulldozer is still 'close' to Gulftown for multi-threaded workloads, and could conceivable compete in single-threaded performance if AMD manages to push the clocks high enough (4.5-5ghz).
Unless Bulldozer somehow manages to deliver worse performance than current AMD processors, Bulldozer should be competitive enough to at least keep AMD going until they can get Enhanced Bulldozer out in 2012 (pretty quick, considering BD is coming out in Q2).
As for Llano, I think it is going to surprise everyone at how competitive it actually is. If it can hit mid 4ghz on turbo with one or two cores and the graphics performance is there, I think it'll be competitive with the new SB mobile-CPUs. Of course, nothing is stopping Intel from ramping up the clock speed and pulling the plug out from Llano, but given the model refresh-rate of laptops, this probably isn't too large an issue, and probably a huge reason why AMD has been as silent as possible as to the performance of this part.
So, thoughts? Think I'm right, wrong, crazy?
I've given Bulldozer a lot more thought than Llano, so I'll give that a go first. It goes without saying everything I'm about to say is a gross over-generalization/ballpark estimate.
First of all, if we normalize a Star's core IPC to 1, I think it would be safe to define Nehalem and SB IPC to ~ 1.4 and 1.6 respectively.
Now, John Fruehe has said that we can expect 50% more throughout from 33% more cores from Interlagos. He has also been quick to point out that you can't compare server and client workloads, a point which I will address shortly.
So before we continue, I'll make one huge heroic assumption: That Interlagos will operate at the same frequency as the MC chip John Fruehe was referring to. Assuming this is the case, I find the same ~ 12.5% increase in IPC that has been posted over and over again. John is correct though, on the client side there will likely be less threads, at higher clock speeds. If you recall, AMD has said that a single thread running on a module will incur a penalty of ~ 20% if another thread is running on that module. Backing this out, BD performance of a single-thread on a single module should be about ~ 1.4, or about identical to Nehalem.
So to recap:
Stars: 1
Nehalem: 1.4
Sandy Bridge: 1.6
Bulldozer (2-thread/Mod): 1.125
Bulldozer (1-thread/Mod): 1.4
If these happens to be the case, how do things look for Bulldozer? Using this (very) simple model of performance, we'd rate a Gulftown CPU @ 8.4, and a 4-module BD @ 9 for multithreaded code (and identical for single), which is consistent with the rumor that Bulldozer will be very close to Gulftown performance, and not completely blown away (but solidly losing) to a hypothetical 6-core Sandy Bridge processor.
Of course, this is completely ignoring the actual clocking of the processors, while assuming again that Interlagos operates at the same clock speed as MC. That being said, I don't think anything I've said sounds too outlandish, given that Nehalem has been out for a while, if AMD wasn't planning/able to hit Nehalem-level IPC (or able to clock significantly higher) Bulldozer for all intents and purposes a failed project.
All of that being said, I think the worst case scenario is that when the modules are fully loaded, Bulldozer will be able to deliver Stars-level performance. If this is the case, then the numbers change to:
Stars: 1
Nehalem: 1.4
Sandy Bridge: 1.6
Bulldozer (2-thread/Mod): 1
Bulldozer (1-thread/Mod): 1.25
In this case, Bulldozer is still 'close' to Gulftown for multi-threaded workloads, and could conceivable compete in single-threaded performance if AMD manages to push the clocks high enough (4.5-5ghz).
Unless Bulldozer somehow manages to deliver worse performance than current AMD processors, Bulldozer should be competitive enough to at least keep AMD going until they can get Enhanced Bulldozer out in 2012 (pretty quick, considering BD is coming out in Q2).
As for Llano, I think it is going to surprise everyone at how competitive it actually is. If it can hit mid 4ghz on turbo with one or two cores and the graphics performance is there, I think it'll be competitive with the new SB mobile-CPUs. Of course, nothing is stopping Intel from ramping up the clock speed and pulling the plug out from Llano, but given the model refresh-rate of laptops, this probably isn't too large an issue, and probably a huge reason why AMD has been as silent as possible as to the performance of this part.
So, thoughts? Think I'm right, wrong, crazy?