Speculation: Ryzen 4000 series/Zen 3

Yotsugi · Oct 7, 2019

Thunder 57 said:
We may/probably will see it at some point though.

No.

moinmoin · Oct 7, 2019

Yotsugi said:
No.

One day you'll manage to put even fewer words into a response.

Thunder 57 · Oct 7, 2019

Yotsugi said:
No.

LoL, care to add to that statement? I don't see it anytime soon, but to say it won't happen at all? That's premature I think.

amd6502 · Oct 7, 2019

Well probably lots of wishful thinking on my part since i want my next notebook to be ULP and have a threadripping mode too. You almost surely are correct on that and unfortunately i'ts going to be another year, end 2020 till we know for sure.

Right now I think essentially zero chance on SMT4 and guesstimate of very low chance (~3% or 1:30) only of 4-way multithreading .

Hopefully Nosta finds more interesting patents.

Richie Rich · Oct 8, 2019

Thunder 57 said:
Put a fork in it, SMT4 is dead. It was never a thing. There was never any evidence to suggest it.

I don't care about SMT4 itself, I want wide core with high IPC. Me as a customer I demand good products. And I cannot be happy that Apple mobile CPU A11 from 2017 is much stronger then Zen 3 will be in 2020. That's a shame for x86.

I'm pretty sure AMD wouldn't leak SMT4 feature on that Zen 2 presentation. Yeah, hope's still alive.
On the other side. Canceling ambitious projects might be the reason why Keller left AMD again in 2015. He left AMD in 1999 when they canceled his high performance Hammer CPU. History repeats? I wouldn't be surprised.

Kedas · Oct 8, 2019

1) TSMCs High Volume Production ready for 7nm+ !!

TSMC Achieves HVM On Its 7nm+ EUV Process - Next Generation CPUs And GPUs On 7nm EUV Are Now A Go

In a statement published yesterday, TSMC has confirmed that it has achieved High Volume Manufacturing (HVM) on its critical 7nm+ Extreme Ultra Violet (EUV) process. This is a landmark event because it marks the critical shift from a sub-EUV wavelength to EUV successfully completed by a foundry...

wccftech.com

2) AMDs Design of ZEN 3 is ready.

3) We know it's the same AM4 socket (Ryzen 5000, Zen 4 is DDR5)

So what are we waiting for?

Arzachel · Oct 8, 2019

Richie Rich said:
I don't care about SMT4 itself, I want wide core with high IPC. Me as a customer I demand good products. And I cannot be happy that Apple mobile CPU A11 from 2017 is much stronger then Zen 3 will be in 2020. That's a shame for x86.

How are people still parroting this meme

Yotsugi · Oct 8, 2019

Kedas said:
AMDs Design of ZEN 3 is ready.

Better, they even taped out some stuff.

Gideon · Oct 8, 2019

Arzachel said:
How are people still parroting this meme

Why is this a meme? I agree that apple cores are different, they sacrifice considerable density to achive what they do, etc ... but they are considerably wider and considerably faster in most general-purpose operations, even when you disregard full stack optimizations, etc.

E.g. if Zen4 ends up being 50% wider, then why is this a "meme"?

EDIT: I get if people don't take specint2006 and other synthetic workloads seriously, but there have been plenty of other cases. One software dev tried some single-threaded self-made workloads (impossible to "optimize" by vendor or full-stack advantages, apple supposedly has) on both Desktop Mac Pro and IPhone and the phone still had the same (or better) ST performance, despite running Ghz's slower. A12/A13 has the best IPC in the business, and this was also said by Anandtech's Andrei.

DrMrLordX · Oct 8, 2019

Richie Rich said:
I don't care about SMT4 itself, I want wide core with high IPC. Me as a customer I demand good products. And I cannot be happy that Apple mobile CPU A11 from 2017 is much stronger then Zen 3 will be in 2020. That's a shame for x86.

A11 isn't really stronger than Zen2. What makes you think it'll be stronger than Zen3? It might be stronger at some arbitrary low-power point but that's functionally meaningless. Anyway, SMT2 will do quite well on a wider core. Instead of 25-30% performance improvement, we might see more gains. 40% is not too much to hope for in applications that can't saturate the pipeline with just one thread per core.

Kedas said:
So what are we waiting for?

For Zen2 to go through its product cycle. July 2020 here we come!

DisEnchantment · Oct 8, 2019

New AMD Patent Application
Prefetch data from RAM into L3 to reduce latency. With those big L3s this could mean something.

20190294546 - PREFETCHER BASED SPECULATIVE DYNAMIC RANDOM-ACCESS MEMORY READ REQUEST TECHNIQUE

A method includes monitoring a request rate of speculative memory read requests from a penultimate-level cache to a main memory. The speculative memory read requests correspond to data read requests that missed in the penultimate-level cache. A hit rate of searches of a last-level cache for data requested by the data read requests is monitored. Core demand speculative memory read requests to the main memory are selectively enabled in parallel with searching of the last-level cache for data of a corresponding core demand data read request based on the request rate and the hit rate. Prefetch speculative memory read requests to the main memory are selectively enabled in parallel with searching of the last-level cache for data of a corresponding prefetch data read request based on the request rate and the hit rate.

Vattila · Oct 8, 2019

H T C said:
What if Zen 3 has an interposer? How would that change number of links?

It should have no effect. The interconnect between L3 slices is implemented on the CPU chiplet.

An interposer should allow more complex, wider, faster and more efficient interconnect between chiplets, though, since a silicon interposer allows much finer metal layers and much lower energy-per-bit.

Richie Rich · Oct 8, 2019

DrMrLordX said:
A11 isn't really stronger than Zen2.

A12 has 158% of Skylake IPC in SPECint. A11 is slower but not much because it has 6xALUs too. It is nice example that 6xALU core needs some evolution steps to get max performance (pick the lowest fruits).

DrMrLordX said:
we might see more gains. 40% is not too much to hope for

Good point. If Zen 2 SMT2 can gain +20% more performance this means average ALU loading is 80%. With 6xALU core you have base of 150%... so theoretically there might be +70% gain ( to Zen 2). However according to Zen 3 ST it would be around 40%. That's massive gain.

NTMBK · Oct 8, 2019

Vattila said:
It should have no effect. The interconnect between L3 slices is implemented on the CPU chiplet.

An interposer should allow more complex, wider, faster and more efficient interconnect between chiplets, though, since a silicon interposer allows much finer metal layers and much lower energy-per-bit.

Of course, an active interposer could move some logic off the compute die and into the interposer, opening up all sorts of options for topology.

Ajay · Oct 8, 2019

DisEnchantment said:
New AMD Patent Application
Prefetch data from RAM into L3 to reduce latency. With those big L3s this could mean something.

20190294546 - PREFETCHER BASED SPECULATIVE DYNAMIC RANDOM-ACCESS MEMORY READ REQUEST TECHNIQUE

A method includes monitoring a request rate of speculative memory read requests from a penultimate-level cache to a main memory. The speculative memory read requests correspond to data read requests that missed in the penultimate-level cache. A hit rate of searches of a last-level cache for data requested by the data read requests is monitored. Core demand speculative memory read requests to the main memory are selectively enabled in parallel with searching of the last-level cache for data of a corresponding core demand data read request based on the request rate and the hit rate. Prefetch speculative memory read requests to the main memory are selectively enabled in parallel with searching of the last-level cache for data of a corresponding prefetch data read request based on the request rate and the hit rate. View attachment 11717

Wait a minute here. How much memory bandwidth does Zen3 have in order to have significant enough read throughput to make speculative reads?!
Heh, and why does that patent show only 4 cores? I don't this this Patent is for Zen3. Seems like AMD is engaged in some patent bracketing or something.

Ajay · Oct 8, 2019

Yotsugi said:
Better, they even taped out some stuff.

What 'stuff'? Zen3 CCD? Zen3 IOD? Please elaborate, TIA.

soresu · Oct 8, 2019

Richie Rich said:
A12 has 158% of Skylake IPC in SPECint. A11 is slower but not much because it has 6xALUs too. It is nice example that 6xALU core needs some evolution steps to get max performance (pick the lowest fruits).

Ah, but at what clock freq does power consumption jump through the roof due to uArch mobile optimisations?

And we already know the intrinsic vector length limits of NEON SIMD are below that of AMD, let alone Intel with AVX512.

Of course this will change in the future with SVE2, but that is then, this is now.

There still seems to be a gulf between benchmarking the 2 platforms that respects all possible performance avenues, and vector/SIMD length is a big one in certain use cases.

soresu · Oct 8, 2019

NTMBK said:
Of course, an active interposer could move some logic off the compute die and into the interposer, opening up all sorts of options for topology.

I thought the whole point of the interposer was interconnect, surely integrating the IO would be the better use case then?

scannall · Oct 8, 2019

soresu said:
Ah, but at what clock freq does power consumption jump through the roof due to uArch mobile optimisations?

And we already know the intrinsic vector length limits of NEON SIMD are below that of AMD, let alone Intel with AVX512.

Of course this will change in the future with SVE2, but that is then, this is now.

There still seems to be a gulf between benchmarking the 2 platforms that respects all possible performance avenues, and vector/SIMD length is a big one in certain use cases.

iOS is OSX, with a touch interface. Should be pretty easy to compare.

soresu · Oct 8, 2019

scannall said:
iOS is OSX, with a touch interface. Should be pretty easy to compare.

Should be, and yet we still have these strangely limited benchmarks that miss a crucial area of modern CPU performance in the SIMD execution.

Dunno how we would go about comparing them though - perhaps dav1d would suffice to at least test an AVX2 cpu vs a NEON cpu, but dav1d lacks AVX512 code at present to compare further.

extide · Oct 8, 2019

Ajay said:
What 'stuff'? Zen3 CCD? Zen3 IOD? Please elaborate, TIA.

Frankly at this point they probably have full on engineering samples back in the labs. They taped out a while ago.

Gideon · Oct 9, 2019

soresu said:
Should be, and yet we still have these strangely limited benchmarks that miss a crucial area of modern CPU performance in the SIMD execution.

Dunno how we would go about comparing them though - perhaps dav1d would suffice to at least test an AVX2 cpu vs a NEON cpu, but dav1d lacks AVX512 code at present to compare further.

Geekbench has had AVX512 for ages, why not use it?

Yotsugi · Oct 9, 2019

Gideon said:
Geekbench has had AVX512 for ages, why not use it?

Nowhere near ubiquitous enough.

DrMrLordX · Oct 9, 2019

Richie Rich said:
A12 has 158% of Skylake IPC in SPECint.

Yeah, but . . .

soresu said:
Ah, but at what clock freq does power consumption jump through the roof due to uArch mobile optimisations?

. . . ah, you beat me to it. A11 (and A12) don't clock high enough to be 158% faster than Skylake, Zen2, or Zen3. Only the Apple design team really knows why someone, somewhere hasn't tried making a higher-clocked version of their cores to set the world on fire. They ARE impressive in what they've gained over the years. AMD should be taking notes. But let's be realistic here.

Bringing it back to Zen3 . . . yes, I think AMD could benefit from a wider core. And I'll repeat my point that SMT2 (not 4) will make it pretty easy for everyday users to exploit that wider core. I just don't think AMD needs to lose any sleep over the possibility that Zen3 might be slower than some Apple SoC at such a low clockspeed that nobody's really going to care about that comparison anyway. Zen3 might lock horns with an Axx variant in the notebook sector, eventually. But the software they run will be so different that it'll be hard to make reliable comparisons between the two.

Richie Rich · Oct 9, 2019

DrMrLordX said:
Apple SoC at such a low clockspeed that nobody's really going to care about that comparison anyway

Don't be influenced by desktop CPUs. Server 64c Epyc 7742 has a base frequency 2.25 GHz (may boost to 2.5 GHz within TDP). Apple A13 runs at 2.66 GHz too.... so for servers is freqency identical however performance is around +50% higher for fruit machine A13. Power consumption for A13 is around 4W, subtract consumption of GPU and idling/sleeping 5 more cores, it can be 3.5W x 64c = 224 W (Epyc has TDP 225W). Pretty comparable consumption with massive performance gain +50%.

6xALUs is killing feature. That is loud alarm for Intel and AMD and they should lose a sleep. So far they are lucky that this 6xALU beast is bounded in iPhone only thanks to Apple management. Steve Jobs was very challenging person and IMHO he would had a courage to change server business by server version of their 6xALU beast (Apple needs for their cloud service thousands servers too). And cloud service allows to keep HW in Apple's hands by selling service instead of HW.

Zen 3 with 6xALUs will be already 3 years behind Apple in CPU technology (A11 appeared in 2017). If Zen 3 won't be wide core, then it is tragedy for x86 and ARM with Cortex A78 will take server and laptop markets. Don't forget how ended up superior archs like IBM PowerPC, Itanium, Motorola 68000, DEC Alpha - all these were smashed by cheap, mass produced and thus faster evolving black horse called x86. Today the history repeats, just this time the black horse is ARM.

Speculation: Ryzen 4000 series/Zen 3

Golden Member

Diamond Member

Diamond Member

Senior member

Senior member

Senior member

Senior member

Golden Member

Platinum Member

Lifer

Golden Member

Senior member

Senior member

Lifer

Lifer

Lifer

Diamond Member

Diamond Member

Golden Member

Diamond Member

Senior member

Platinum Member

Golden Member

Lifer

Senior member