- Mar 3, 2017
- 1,777
- 6,791
- 136
Address him as "ES user" in your post and say please and maybe he will?Igor, assuming the ES user that provided the leaks earlier still has it, is there any way it would be possible to get him to run a locked 4 or 5 GHz ST run...
They claimed a 11-22% MT uplift in Blender. (11% for 9700X, 16% for 9600X, 17% for 9900X and 22% for 9950X). Blender has 23% IPC according to them. So for 17% IPC R23 you get like 8-16% uplift. 8% for 9700X, ~12% for 9600X and 9900X and 16% for 9950X.
And you'd have to recompile all your Linux distro from scratch with the proper Zen5 compiler flags. I have never done that as I never felt the need during my 30 years of using Linux because my new CPU was always performing better by default than the previous one.
If a CPU requires recompilation of existing applications to perform well, I call that a failure. [...]
TLDR: A modern successful CPU should not require recompilation of applications to perform well.
AMD Ryzen 9 7950X "Zen 4" Rocks On Intel's Clear Linux - Phoronix
www.phoronix.com
Even Zen 4 sees a performance improvement when running on an Intel optimized (and thus recompiled) distro.
View attachment 103989
Yeah but there will be developers who WILL optimize for Zen 5."Developers just need to optimize for it more!" is a bad excuse made by vendors of subpar microarchitectures for significantly longer than I've been alive.
Yeah but there will be developers who WILL optimize for Zen 5.
Think Unreal Engine.
Think AAA games.
Think Windows itself (once they release an optimized Visual Studio).
And obviously Linux developers will optimize too coz that's their passion.
And many, many others.
If you are competing in a tough market, you will do whatever is necessary to maintain your edge.
Because that's the quickest one I could find. My reasoning is that if an Intel optimized distro is giving Zen 4 an uplift, imagine what a distro focused on Zen 4/5 performance would do.Hardly a compelling argument for "recompiling to get acceptable perf is normal."
Because that's the quickest one I could find. My reasoning is that if an Intel optimized distro is giving Zen 4 an uplift, imagine what a distro focused on Zen 4/5 performance would do.
If I had the time and resources, I would find open source versions of software before Zen 4 announcement and one year after announcement and build them both with the latest compiler and maybe then we would have more data on what I'm claiming.
You are asking a code monkey who dropped out of the 42 main program coz I couldn't figure out in one month how to write my own secure malloc() function that passed strict Valgrind checksOkay. Elaborate. What kind of optimizations do you propose? Be specific.
You are asking a code monkey who dropped out of the 42 main program coz I couldn't figure out in one month how to write my own secure malloc() function that passed strict Valgrind checks
I have no clue TBH. But my brain says it's possible.
Here's one idea:What sorts of optimizations are you proposing?
With how starved the uncore / memory of Granite Ridge is i'm gonna go out on a limb and say 19% IPC for Turin (at 5600MT/s).What will be the rough average IPC uplift for Turin?
I predict ~17%
![]()
Here's one idea:
View attachment 103997
Suppose a developer has done extensive profiling for Zen 4 and made changes to his application so that when Zen 4 is detected, he uses specific hot functions that matter a lot to his application's core performance.
Now suppose he uses the same tool to profile Zen 5 and sees some big differences. Some of his assumptions about Zen 4 no longer hold true with Zen 5. So he creates specific functions for Zen 5 again to make sure that his application can get the most out of the new architecture.
This isn't something unheard of in the software world. Yes, most monkey programmers won't go to all this trouble. But Linux gurus, benchmark writers, game engine developers and authors of widely used software like 7-zip and WinRAR may do that. Can't say anything about the latter since it's closed source but maybe someone can look at 7-zip source and see if there are architecture specific optimizations in that.
Hackers love to hack. If someone like that thinks there is more performance to be squeezed out of a new architecture, you bet they will love to tackle that challenge. Because that's the real fun of hacking. The feeling of satisfaction when you crack a hard problem.
Don't forget John Carmack (his personal pet projects that he doesn't release to the world), Tim Sweeney, all the engine developers of AAA studios, Adobe and other workstation software developers.But with its weird front end and big FPU I do expect hand-tuned assembly could be much better. But no one outside of Oak Ridge and Lawrence Livermore will even consider that.
Good point. Yeah, twelve channels of DDR5-8800 could really make that baby rock!With how starved the uncore / memory of Granite Ridge is i'm gonna go out on a limb and say 19% IPC for Turin (at 5600MT/s).
It's already starved on Raphael and I can only see the gap widening between Desktop and Epyc.
I don't know but it is possible. Epic/Rad did write extremely optimized Zen 2 specific code for Unreal. They even had an article about it but I can't find it now.Don't forget John Carmack (his personal pet projects that he doesn't release to the world), Tim Sweeney, all the engine developers of AAA studios, Adobe and other workstation software developers.
Don't forget about Frostbite engine's former lead dev Johan Andersson who spearheaded the Mantle work with AMD all those years ago.Don't forget John Carmack (his personal pet projects that he doesn't release to the world), Tim Sweeney, all the engine developers of AAA studios, Adobe and other workstation software developers.
Twisting the knife, are you?And remember, you weren't just talking about optimized assembly - which is a thing but not necessarily a highly profitable one most of the time - but improvements just from recompiling with Zen5 specific compiler support.
Twisting the knife, are you?![]()
Clearly he’ll need to add some nested if statements and nested for loops to take advantage of the 2-ahead branch predictorOkay. Elaborate. What kind of optimizations do you propose? Be specific.
Thank you!Clearly he’ll need to add some nested if statements and nested for loops to take advantage of the 2-ahead branch predictor