• We’re currently investigating an issue related to the forum theme and styling that is impacting page layout and visual formatting. The problem has been identified, and we are actively working on a resolution. There is no impact to user data or functionality, this is strictly a front-end display issue. We’ll post an update once the fix has been deployed. Thanks for your patience while we get this sorted.

Steamroller core

Olikan

Platinum Member
slides:

AMD-Hot_Chips_Symposium-Steamroller,C-Q-350378-13.png

AMD-Hot_Chips_Symposium-Steamroller,C-R-350379-13.png

AMD-Hot_Chips_Symposium-Steamroller,C-S-350380-13.png

AMD-Hot_Chips_Symposium-Steamroller,C-T-350381-13.png

sr_sl05.jpg


http://www.tomshardware.com/news/AMD-Steamroller-Piledriver-Kaveri-processors,17217.html
http://www.pcper.com/reviews/Processors/AMD-Unveils-Steamroller-Improvements
 
Last edited:
Steamroller: back to K8 level of performance? Lol. We can only hope.

Why are we seeing Steamroller slides when Vishera isnt even out yet? Are they trying to prepare us for a disappointment by hyping Steamroller?

Sorry I'm just jaded on AMD after they hyped BD up to much and it turned out to be the return of the Pentium 4.

EDIT: Its interesting that the slides seem to focus on increasing single threaded execution,particular integer execution, via faster instruction decoding and especially faster integer execution. Here is an idea AMD - if you didnt remove all of this great stuff from K10 to make BD, maybe you wouldnt need to go through the trouble of adding it back to Steamroller?
 
Last edited:
I hope it lives up to the hype. Competition is always welcome. Will be interesting to see how the APUs do.
 
"Major improvements in store handling" and "Dynamic resizing of L2 cache" are the two most interesting parts to me. Do you think they're abandoning "write-through" for the L1-D cache? Also I hope that the dynamic L2 resizing is going to be a performance enhancement option, and not just a power savings technique.
 
At a store near you in 2013-2015 or perhaps later judging by AMD's history.

Hopefully it's a late 2013 release at the latest.
 
Sorry if I missed in in the article, but does anyone know if this is an AM3+ part, or a new socket? I got a long useful life out of my AM2+ (still on it in fact), so I'm just curious.
 
Steamroller: back to K8 level of performance? Lol. We can only hope.
Be serious please. Bulldozer is faster than K8,it's around the level of first gen K10 core (65nm Barcelona). This core is ~10-15% faster than K8 in integer and 1.5-2x faster in SSE workloads. In FP stuff Bulldozer is just crushing K8 (not hard since K8 has 64bit FPU).

On topic of SR core,integer execution looks to be having the major perf. boost while FP coprocessor is sort of a letdown. It's the same old FP unit we have in BD and PD. AMD even states they cut it down further to save power and area (they apparently axed one MMX pipe). Unless they somehow found a way to make that same old FP unit perform 50% better (if it had a major design bug?) then they will stand no chance against Haswell. Haswell will add FMA execution support and then AMD will need 4x the "cores" of intel equivalent Haswell server chip to match its FP throughput . It's obvious why this is not realistic(16 modules SR server chip is not gonna happen).
 
Sorry if I missed in in the article, but does anyone know if this is an AM3+ part, or a new socket? I got a long useful life out of my AM2+ (still on it in fact), so I'm just curious.
Last I heard, Piledriver was the last AM3+ processor.
On topic of SR core,integer execution looks to be having the major perf. boost while FP coprocessor is sort of a letdown. It's the same old FP unit we have in BD and PD. AMD even states they cut it down further to save power and area (they apparently axed one MMX pipe). Unless they somehow found a way to make that same old FP unit perform 50% better (if it had a major design bug?) then they will stand no chance against Haswell. Haswell will add FMA execution support and then AMD will need 4x the "cores" of intel equivalent Haswell server chip to match its FP throughput . It's obvious why this is not realistic(16 modules SR server chip is not gonna happen).
It says the same throughput at lower power/area.

What should help improve overall performance is the cache/prediction/front end improvements.

FMA also is only a fraction of all FP instructions.
 
Going back to independent decoders for each core.

The past is the future, or something like that. Guess it invalidates the major design ideas behind bulldozer.
 
Unfortunately these are the changes that needed to be included with Piledriver in 2012, not Steamroller launching sometime in 2013.
 

Wait wait wait... I thought THIS was universally known/admitted to be the problem with Bulldozer - the fact that they moved away from hand-tuned/drawn logic and moved to automated crap which ended up costing them in terms of power, performance AND delays. 😕

(A quick google = first link found)
 
Actually I think steam roller will be released early. Hopefully vishera is cancelled all together and they release steam roller/frontloader in January. That would be nice.
 
Actually I think steam roller will be released early. Hopefully vishera is cancelled all together and they release steam roller/frontloader in January. That would be nice.

Is there a Front Loader?

I thought it went

Bulldozer
Steam Roller
Pile Driver
Excavator
 
^ The first thing that came to my mind when i read the article.

Does steamroller have AVX2?

when i read the article, it says that FPU would lose peak frequency...
yet the cpu is made to reach high performance :colbert:

AVX-2 ? probably not...way too soon.....maybe in excavator
 
Is there a Front Loader?

I thought it went

Bulldozer
Steam Roller
Pile Driver
Excavator

BD
PD
SR
EX

The frontloader comment refers to the frontloader tractor pictured on the steamroller slide. Alas, if CPU design doesn't work out for them it appears they don't have much of a future in construction either...
 
when i read the article, it says that FPU would lose peak frequency...
yet the cpu is made to reach high performance :colbert:

AVX-2 ? probably not...way too soon.....maybe in excavator

No the article doesn't state that. AT has a good coverage. FP unit is "streamlined" so that it's smaller and yet performs the same as old one. SO basically from execution POV they didn't expand it. Whether the rest of the core changes(which are significant) may help the FP execution ,remains to be seen. Clock frequency may suffer due to other factors (teh core is more complex now).
AVX2 is a possibility. They will give Jaguar full ISA support next year . I say full since Jaguar will probably launch around Haswell and AVX will still be the top level ISA one can support. AMD already supports FMA3 with PD so they need to add support for integer 256bit AVX instructions. Since the FP unit remains unchanged , the AVX2 execution will be done analog to how Bulldozer does FP AVX : across one module. So not a big deal I guess. Performance on the other hand may be a big deal since Haswell will probably eat it alive in AVX2 code 🙂.
 
Last edited:
I sure they fixed all their mistakes and what not in time. But Intel will always be the leader as far as CPU and only CPU , not the graphic card port...
 
What are you expecting out of it. They flopped once. I think they do good this time around, but Intel will always have better tech and what not in desktop field..
 
Back
Top