FlameTail
Diamond Member
- Dec 15, 2021
- 4,384
- 2,761
- 106
I think the process I used is clearly described: I picked the 2023 top phone, Galaxy S23 Ultra (so X3) then picked S24 Ultra to derive the generational uplift between X4 and X925.You took the baseline as Cortex X3?
I think the process I used is clearly described: I picked the 2023 top phone, Galaxy S23 Ultra (so X3) then picked S24 Ultra to derive the generational uplift between X4 and X925.
That's still guessing, but that looks more or less what is expected.Yeah, I think that's reasonable, and it does line up (ish) with the apparent 15% iso-clock gain over X4, I think.
Sounds to me like ARM is really hungry.This man loves getting interviewed.
According to https://browser.geekbench.com/android-benchmarks
S24 Ultra vs S23 Ultra is +14%
So if the comparison really is against S23 Ultra, that'd be ~19.6% above S24 Ultra.
And GB ST score would be ~2600 (iPhone 14 level).
Thanks a lot!Conclusion
With slightly more accurate math, Bar 3 & 4 put X925 perf in GB6.2 perf quite high:
2,939 pts @ 3.8 GHz = 773 Pts / GHz for the Cortex-X925
3,110 pts @ 3.8 GHz = 818 Pts / GHz for the Cortex-X925 tweaked
From my previous chart,
M3 IPC = 774 Pts / GHz
M4 IPC = 844 Pts / GHz
this seems to show Arm has caught up in "IPC" vs Apple. Now the question is: how high will Arm customers go with frequency?
Yes, that was implied by my frequency question. I think freq will be limited by power in devices such as phones.Doesn't say anything about power either.
Well, they have. X925 would have the same IPC as A17 Pro. And a stone's throw away from M4 (if you exclude SME).Thanks a lot!
Though this kind of computations is prone to errors, this seems to show Arm has caught up in "IPC" vs Apple. Now the question is: how high will Arm customers go with frequency?
Give this man an award🏆Arm's GB6.2 1T IPC chart → the Cortex-X4 is likely Arm's baseline.
Step 1 - what is the X925 vs X4 GB6.2 uplift iso-frequency?
Using the handy Plot Digitizer website I've stumbled on, the GB6 IPC uplift is approximately +14.65% from the X4 → X925. This is the only time Arm states IPC vs X4.
View attachment 102206
Step 2 - so what are the actual scores?
Cortex X4 @ 3.390 GHz = 2287 Pts
Cortex X925 @ 3.390 GHz = 2622 Pts (+14.65% IPC claim via step 1)
Cortex-X925 @ 3.800 GHz = 2939 Pts (+12.10% claimed freq. target)
Step 3 - plug the data into the 2nd chart
View attachment 102204
We only "know" Bar 3 = Cortex-X925 @ 3.8 GHz = 2939 from Step 2. Thus, Plot Digitizer and a little arithmetic:
Bar 1 | 2023 Premium Android = 2286.87 (Bar 3 / 1.328166)
Bar 2 | 2023 Best-in-Class 3.8 GHz = 2890.91 (Bar 1 * 1.264413)
Bar 3 | Cortex-X925 @ 3.8 GHz = 2939
Bar 4 | Cortex-X925 @ 3.8 GHz + tweaks = 3110.14 (Bar 1 * 1.36)
Step 4 - which CPUs align with those scores?
Bar 1 estimate via Step 3 = 2286.87 (100%)
NBC | Cortex-X4 @ 3.390 GHz = 2287 (100.00%) ✅
NBC | Cortex-X3 @ 3.360 Hz = 2107 (92.13%)
It's the X4. Thus, Bar 1 | "2023 Premium Android" baseline at 1x is the Cortex-X4.
//
Bar 2 estimate via Step 3 = 2890.91 (100%)
NBC | Apple A17 Pro @ 3.780 GHz = 2930 (101.35%) ✅
NBC | Apple A16 @ 3.460 GHz = 2644 (91.46%)
By frequency alone, it's the A17 Pro. But the scores also nearly precisely match. Thus, Bar 2 | "Best-in-Class 3.8 GHz" is the A17 Pro.
So I believe @SarahKerrigan had it right from the get-go and the math lines up:
2023 Premium Android → TCS23 | Cortex-X4; Cortex-X4 phones launched in late 2023.
2023 Best in Class → A17 Pro launched in late 2023. Apple is the well-known "Best in Class" Arm uArch.
Conclusion
With slightly more accurate math, Bar 3 & 4 put X925 perf in GB6.2 perf quite high:
2,939 pts @ 3.8 GHz = 773 Pts / GHz for the Cortex-X925
3,110 pts @ 3.8 GHz = 818 Pts / GHz for the Cortex-X925 tweaked
From my previous chart,
M3 IPC = 774 Pts / GHz
M4 IPC = 844 Pts / GHz
Thanks a lot!
Though this kind of computations is prone to errors, this seems to show Arm has caught up in "IPC" vs Apple. Now the question is: how high will Arm customers go with frequency?
Well, they have. X925 would have the same IPC as A17 Pro. And a stone's throw away from M4 (if you exclude SME).
Give this man an award🏆
Doesn't say anything about power either.
Yes, that was implied by my frequency question. I think freq will be limited by power in devices such as phones.
We know there’s going to be some other [Windows AI PC] systems coming out in the upcoming years. So, while the first-generation systems are going to be interesting, the second generation systems are going to be even more [so]. And folks who bought the first ones are probably going to be a little bit green with envy when they see what the second ones look like.
The only thing that comes to mind is probably talking about DLSS from nVidia offerings allowing for better gaming performance with a well tuned superscaling/AA solution.This interview has one interesting disclosure: Arm has finally straightforwardly & officially confirmed a "second generation" (aka non-Qualcomm) Windows on Arm SoCs.
#3 seems like a dig at Qualcomm's Oryon, without acknowledging Oryon will have been shipping for at least 6+ months to years earlier. Maybe Arm has some frustration at MS/QC's alleged exclusivity agreement, but whatever Arm ships will go head-to-head with with laptops equipped with M4 (4C+6c), Oryon (12C), Lunar Lake (4C+4c), and Kraken Point (Zen5 4C+4C APU).
- Multiple (?) "second generation" Windows on Arm SoCs in development.
- Launches are in "upcoming years".
- Claims the "second generation" systems are more interesting.
Surely, Cristiano Amon, SoCs releasing in 2025 should be more interesting than SoCs released in 2024.
Uh, don't you mean TSMC is getting a major new customer?ARM is now getting a major new customer for their CPUs and GPUs.
GOOGLE PIXEL
The Tensor G5 will be fully designed in house by Google, and manufactured on TSMC 3nm.
Until now (Tensor G1-G4) were co-designed with Samsung LSI, and manufactured at Samsung Foundry.
(Curiously, Arm gave no mention of any X925 cores on Intel 18A. Coming later? Delayed? Never?)
Intel - “Intel Foundry is deeply engaged with Arm, and this announcement demonstrates progress on the Arm Client. We are collaborating closely on leading-edge technology nodes, including Intel 18A, to provide best-in-class power, performance, and area metrics and enable next-generation mobile SoC products based on Arm’s new Compute Subsystems (CSS) for Client.” – Suk Lee, Vice President and General Manager of the Ecosystem Technology Office, Intel Foundry Services
Samsung Foundry - “GenAI capabilities embedded in mobile devices are in major demand amongst customers, requiring top-tier silicon technologies integrated with leading computing solutions. Combining Arm’s Cortex-X925 CPU solutions with Samsung Foundry’s latest 3nm GAA process node delivers on this demand. Our long-standing partnership with Arm allowed for early and tight collaboration in the areas of DTCO and PPA maximization for an on-time silicon delivery that hit performance and efficiency demands.” – Jongwook Kye, Executive Vice President and Head of Foundry Design Platform Development, Samsung Electronics
TSMC - “The AI-optimized Arm CSS is a prime example of the collaboration between Arm and TSMC in helping designers achieve unprecedented levels of performance and power efficiency, pushing the boundaries of semiconductor innovation for AI. Together with Arm and our Open Innovation Platform® (OIP) ecosystem partners, we empower our customers to accelerate their AI innovation using the most advanced process technologies and design solutions.” – Dan Kochpatcharin, Head of the Ecosystem and Alliance Management Division, TSMC
A number of Arm Ltd CPUs suffer from errata whereby an MSR to the SSBS special-purpose register does not affect subsequent speculative instructions, permitting speculative store bypassing for a window of time. We worked around this for Cortex-X4 and Neoverse-V3, in commit:
Pretty sure V3/Poseidon is X4 based.Neoverse V3 (?? based): +12% "Integer performance" (is this SPEC rate or SPEC? Surely not 2006? Shouldn't be GB6. Nonsense to avoid naming it, when Arm knows it)
Not for a long while.When will ARM cores move to wider vector pipelines?
A64fx has a chungus 2*512b FMA setup.Except for Neoverse V1, every other ARM core has had 128b vectors in recent history.
It is, but apparently Fujitsu's followup chip Monaka will be targeting a wider use case than just HPC/supercomputer stuff.A64fx is about as niche as you can get these days...