Discussion AMD's Soundwave ARM APU: The Beginning of Transformation !!!

Page 27 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

LightningZ71

Platinum Member
Mar 10, 2017
2,691
3,390
136
Apple demonstrated a way, actually a few ways, to do both instruction and OS emulation/conversion on the fly with their switch to ARM and the Rosetta 2 implementation. It isn't as costly as it used to be, and, to be frank, CPUs have a lot of performance left on the table in most situations these days.
 

Tigerick

Senior member
Apr 1, 2022
942
857
106
RDNA4.5 iGPU = Subset of RDNA5 GPU

RDNA4.5.jpg

After done with XDNA3, let's move on to RDNA4.5 iGPU. Due to N3P node, RDNA 4.5/5 will get biggest upgrade in term of numbers of SP per CU. Each CU will get two times of SP, from 64 to 128. For example, STX with 16CU will have same amount of SP as upcoming MDS1 with 8CU. MDS1 might have same TF if the clock speeds have remained the same.

It is not in the table above but RDNA4 still remain 64 SP per CU. That's why RDNA4.5 is actually considered subset of upcoming RDNA5. By doubling SP per CU, the amount of CU will be cut by half, therefore each Radiance core will need to have more than 3-4 times powerful. I am putting question mark not because I have doubt, it is because I don't know final clock speed of APU. Clock speed will determine the final TF, that's why Soundwave only need 12CU(24CU) to get double performance of E2600. Even with 24CU, SWV still won't be faster than STX/MDS1 also due to clock speed because SWV is designed to be power efficient APU; see my thread in Custom APU.

In conclusion, Soundwave APU will get first mover of latest technology from AMD in 2026, namely RDNA4.5 and XDNA3, period. :cool:
 
Last edited:

Tigerick

Senior member
Apr 1, 2022
942
857
106
Wait... isn't Exynos 2600 and 1680 will get that GPU uArch too?
Exynos 2600, yes. My calculation is 5.7TF

RDNA4+.jpg

Exynos 1680 should feature Xclipse 550 GPU, RDNA3.5+ with 2CU, same family as E2500. Considered this as entry SoC.
 

DZero

Platinum Member
Jun 20, 2024
2,153
781
96
Exynos 2600, yes. My calculation is 5.7TF

View attachment 133883

Exynos 1680 should feature Xclipse 550 GPU, RDNA3.5+ with 2CU, same family as E2500. Considered this as entry SoC.
Interesting, let's see how develops. I expect
E2600 going 10% less than SD 8 Gen 5 Elite, but better than SD 8 Gen 5
E1680 going on par with olde Exynos 2200. And that could be interesting to watch.
 

Tigerick

Senior member
Apr 1, 2022
942
857
106

About times; Let's see tomorrow is what I expect Microsoft going to announce...

Tomorrow #MSIgnite kicks off the next wave of AI transformation.
⚡️
🌐
Join us live at 9AM PT as Microsoft leaders unveil what's next in AI, cloud, and security.

And AMD is joining as well:

 

Tigerick

Senior member
Apr 1, 2022
942
857
106

Geez, NV is sponsoring the official Microsoft Ignite backpack meanwhile AMD is sponsoring water bottle. Wonder what are NV and AMD cooking? :p
 
  • Haha
Reactions: lightmanek

Tigerick

Senior member
Apr 1, 2022
942
857
106
Wait... isn't Exynos 2600 and 1680 will get that GPU uArch too?
After some diggings, E1680's GPU might be using some variant of E2600's RDNA4.5 GPU cause the older model, E1580 comes with RDNA3 4CU which perform similar with E1680's 2CU. The most interesting question should be why is AMD licensing their latest Radeon IP in entry SoC from Samsung? :p

Exynos 1580Exynos 1680Exynos 2600SD 7+ G3SD 8s G4SD 8 G5SD 8 G6SD 8 Elite G4SD 8 Elite G5SD 8 Elite G6
End ProductTab S10 FEXiaomi Pad 7Xiaomi Pad 8Xiaomi Pad 9?Xiaomi Pad 8 ProXiaomi Pad 9 Pro?
Launch DateQ4 2024??Q1 2024Q2 2025Q4 2025Q4 2026Q4 2024Q3 2025Q3 2026
Node4LPP+SF2 ?SF2N4PN4PN3PN2N3EN3PN2
Die Size (mm2)89.54105.56103.55124.1126.2
ModemIntegrated 5348External 5410
CPU1+3+4 8-core1+4+3 8-core1+3+6 10-core1+4+3 8-core1+3+2+2 8-core2+6 8-core2+6 8-core2+6 8-core2+3+3 8-core
Prime CPU2.9GHz A7202.9GHz ?3.8GHz C1 Ultra2.8GHz Cortex-X43.21GHz Cortex-X43.8GHz Oryon4.32GHz Oryon4.61GHz Oryon
GPURDNA3 4CURDNA4.5 2CURDNA4.5 8CUA732 3SA825 2SA829 2SA830 3SA840 3S4S ?
FP321.33 TF~1.38 TF~4.1 TF1.46 TF2.36 TF2.5 TF3.38 TF3.69 TF
NPU14.7 TOPS?~100 TOPS~10 TOPS~16 TOPS~50 TOPS~80 TOPS

PS: Sammobile thought E1680 is a downgrade from E1580. Well, if you read my table, you should think otherwise. :cool:
 
Last edited:
  • Like
Reactions: bearmoo

DZero

Platinum Member
Jun 20, 2024
2,153
781
96
After some diggings, E1680's GPU might be using some variant of E2600's RDNA4.5 GPU cause the older model, E1580 comes with RDNA3 4CU which perform similar with E1680's 2CU. The most interesting question should be why is AMD licensing their latest Radeon IP in entry SoC from Samsung? :p

Exynos 1580Exynos 1680Exynos 2600
Launch DateQ4 2024??
Node4LPP+SF2 ?SF2
CPU1 + 3 + 4 8-core1 + 4 + 3 8-core1 + 3 + 6 10-core
Prime CPU2.9GHz A7202GHz ?3.8GHz C1 Ultra
GPURDNA3 4CURDNA4.5 2CURDNA4.5 8CU
NPU14.7 TOPS?~100 TOPS

PS: Sammobile thought E1680 is a downgrade from E1580. Well, if you read my table, you should think otherwise. :cool:
1680 uses 2 nm??? I tought that uses 4nm or even 3nm

Meanwhile, I don't be surprised if we see an Exynos 1630, to replace the 1330 with the RDNA3 with 2CU.
 

Doug S

Diamond Member
Feb 8, 2020
3,832
6,767
136
1680 uses 2 nm??? I tought that uses 4nm or even 3nm

Meanwhile, I don't be surprised if we see an Exynos 1630, to replace the 1330 with the RDNA3 with 2CU.

Does Samsung have a working process between 4LPP+ and SF2? AFAIK they never got their 3nm node to work at all.
 

regen1

Senior member
Aug 28, 2025
362
455
96
Does Samsung have a working process between 4LPP+ and SF2? AFAIK they never got their 3nm node to work at all.
Samsung's renaming/rebranding of some nodes has made it blurry.
They rebranded their upcoming SF3P to SF2.


As of now they had somewhat limited success on their 3nm node family with a cryptomining ASIC, W1000, Exynos 2500. Upcoming Exynos 2600 is supposedly on SF2(which is rebranded SF3P).
 

Doug S

Diamond Member
Feb 8, 2020
3,832
6,767
136

I remember reading somewhere that Exynos 2500 had been canceled and Samsung went all-Qualcomm for the S25 but I guess only the second part was true since it seems like they did find a few things to put the 2500 in.

Based on @regen1's post it sounds like SF2 is part of the same node family as SF3, so choosing it hoping to save money would be like choosing TSMC N5 to save money versus N4P - whatever higher per wafer cost you pay is canceled out by the improved density. Even more so if they've got SF2's yielding higher than SF3.
 

DZero

Platinum Member
Jun 20, 2024
2,153
781
96
I remember reading somewhere that Exynos 2500 had been canceled and Samsung went all-Qualcomm for the S25 but I guess only the second part was true since it seems like they did find a few things to put the 2500 in.

Based on @regen1's post it sounds like SF2 is part of the same node family as SF3, so choosing it hoping to save money would be like choosing TSMC N5 to save money versus N4P - whatever higher per wafer cost you pay is canceled out by the improved density. Even more so if they've got SF2's yielding higher than SF3.
Indeed, but it might be weird, I mean, I suspect that SF2 is around 50% yield, while SF3 might reached 60%. Let's see how ends that. I don't be surprised if even Samsung launches the Exynos 1630 or 1730 at 3nm with them.
 

Tigerick

Senior member
Apr 1, 2022
942
857
106
I have created two OS threads: WoA11 and Android PC and pinned them in the frontpage in case anyone wants to know more about upcoming OS.

TLDR: Both Android OS and WoA will be starting new race with ARM PC platforms. All OEMs are going to use whatever APU being made by Qualcomm, NV, AMD, Mediatek and Samsung?. What we are seeing right now are just the first wave of ARM APUs. Do you all still think Soundwave is just one-off project from AMD? :rolleyes:

Each platform has their own Pros and Cons. Who will win the war, nobody knows? But I do know who the loser would be: Intel. :p As Gemini said about Aluminium OS:
Google's new OS will treat ARM as the future and x86 as the legacy platform.
 
Last edited:

Tigerick

Senior member
Apr 1, 2022
942
857
106
View attachment 133103
Hoho, what is RDNA4-derived micro-architecture ???

Doesn't it refer to RDNA4+ or RDNA4.5 ???

Per agreement in 2019: Why does AMD license their latest RDNA IP to Samsung with Android support? Think guys.... :rolleyes:

I have calculated FP32 with above info. There are few ways to hit 6TF with 1.4GHz. One of them is through doubling of SP per CU. Let's see is it the RDNA4.5 improvement which we will see in Soundwave and E2600. As I said E2600 5G is little brother of Soundwave with half memory bandwidth. Do you still think Soundwave which is fabbed by N3P and 128-bit memory bus comes with tiny CPU and GPU cores? I have decoded the XDNA3 AIE, now it is RDNA4.5 turn...:cool:

PS: E2600 with 8CU's SNL scores hit 3100+, higher than A19 Pro. Go figures. :cool:
Xclipse 960 which is RDNA4-derived micro-architecture is being codenamed as AMD Juno with clockspeed reduced to 985MHz. Samsung has boosted the clockspeed of C1 Ultra to 3.8GHz to show better ST performance; still below D9500's 4.21GHz: 10-core vs 8-core. The reduction of GPU's clockspeed has decreased the FP32 from 5.7TF to 4TF. Still higher than 8 Elite Gen5's 3.7TF though.

Hoho, what is Juno's graphics IP fabbed by SF2? I bet the codename is not in the official roadmap of AMD. Let's see whether Juno is having same IP as upcoming Soundwave APU which is fabbed by N3P.

 
Last edited:
  • Like
Reactions: marees

Gideon

Platinum Member
Nov 27, 2007
2,044
5,103
136
Hoho, what is Juno's graphics IP fabbed by SF2? I bet the codename is not in the official roadmap of AMD. Let's see whether Juno is having same IP as upcoming Soundwave APU which is fabbed by N3P.

Kinda ironic if AMDs own Medusa APUs end up having older version of RDNA than Samsung phones (which seems to be the case if the rumor is true)
 
  • Like
Reactions: Tlh97 and marees

Tigerick

Senior member
Apr 1, 2022
942
857
106
Exynos 2600/2700 vs Soundwave APU: CPU Analysis

Dimensity 9500 5GExynos 2600 5GDimensity 9600 5GExynos 2700 5GSoundwaveM5X2 Elite X2E-80-100
Integrated 5G ModemYesExternal, 5410Yes??NoYes
NodeN3PSF2N2SF2PN3PN3PN3P
Memory64-bit LPDDR5x-1066764-bit LPDDR5x-1066796-bit LPDDR6-1066796-bit LPDDR6-10667128-bit LPDDR5x-9600128-bit LPDDR5x-9600128-bit LPDDR5x-9523
Memory BW85.33 GB/s85.33 GB/s114.1 GB/s114.1 GB/s153.6 GB/s153.6 GB/s152 GB/s
Prime Cores1 x C1-Ultra 4.21GHz1 x C1-Ultra 3.8GHzC2 UltraC2 Ultra4 x 4.6GHz6 x Oryon cores 4GHz
Single-Core 4.7GHz
Dual-Core 4.4GHz
Performance Cores3 x C1 Premium 3.5GHz6 x 3.04GHz6 x 3.4GHz
Pro1 Cores3 x C1 Pro 3.25GHz
Pro2 Cores4 x C1 Pro 2.7GHz6 x C1 Pro 2.75GHz
 
Last edited:

Tigerick

Senior member
Apr 1, 2022
942
857
106
Exynos 2600/2700 vs Soundwave APU: GPU Analysis

Exynos 2500 5GExynos 2600 5GExynos 2700 5GSoundwaveMedusa Point MDS1Gorgon Point
NodeSF3SF2SF2PN3PN3PN4P
Memory64-bit LPDDR5x-960064-bit LPDDR5x-1066796-bit LPDDR6-10667128-bit LPDDR5x-9600128-bit LPDDR5x-9600?128-bit LPDDR5x-8533
Memory BW76.8 GB/s85.33 GB/s114.4 GB/s153.6 GB/s153.6 GB/s136 GB/s
GPURDNA3+Juno RDNA 4.5 ?Juno RDNA 4.5 ?RDNA 4.5 ?RDNA 4.5 ?RDNA 3.5
WGP844648
CU168812816
SP Per CU6412812812812864
Total SP102410241024153610241024
GPU Frequency999 MHz999 MHz ???3100 MHz ?3100 MHz
FP32 (Dual Issue)4.1 TF4.1 TF~ 5 TF ?12.7 TF12.7 TF
 
Last edited:
  • Like
Reactions: lightmanek

Tigerick

Senior member
Apr 1, 2022
942
857
106
Thanks to a new architecture, the computing performance of the Exynos Xclipse 960 GPU is twice as high as that of its predecessor.⁴ This drives a ray tracing performance improvement of up to 50%.⁵
4,5: Based on internal testing in comparison to the Exynos 2500. Results may vary depending on operating conditions and other factors.

Hoho, let's decode the above statement regarding Exynos 2600's iGPU: Xclipse 960 aka AMD's Juno RDNA IP. It is based on new architecture; definitely refer to new generation of RDNA. Before I proceed, let's assume the story between Samsung and AMD: Samsung wants to showoff more selling features of X960 but AMD does not want to reveal too many details about new RDNA IP because it is the same IP would be used for all future APUs (N3P). AMD definitely have a final word on the statement; thus, the claims should be correct.

Check my table regarding the relationship between STX (N4P) and E2500 (SF3): the performance delta is caused by the clock speed; the features should be similar...Exynos 2500 is using RDNA3+/3.5 IP; thus it is safe to assume Xclipse 960 is using RDNA4+/4.5 IP. However RDNA4s (N4P) are designed for desktop dGPU; some leakers keep saying RDNA4 are not for mobile SoC, then what IP Samsung/AMD is using? :cool: Let's check the features below:

  • The computing performance of the Exynos Xclipse 960 GPU is twice as high as that of its predecessor. This is bold claim: Samsung claims computing performance is double than Xclipse 950. Hoho, even RDNA4's improvement is only 20% faster in rasterization's IPC. Then where is 100% performance claim from? Hoho, if you read my speculation about doubling SP per CU aka Compute Unit; then with same clock speed, I could claim double performance per CU as well. :cool: It is not full picture cause the number of CUs has been cut by half, that's why FP32 still remains the same.
  • If above feature is considered overselling, then the second statement is underselling: This drives a ray tracing performance improvement of up to 50%. With 8CU, X960 still manage to offer 50% better RT performance than X950's 16CU/RT. Hoho, anyone wants to estimate how powerful is each Radiance core in X960? :cool:

As I speculated before, Soundwave (N3P) is like bigger brother of E2600. E2600 is having 8CU with 4.1TF, then how many CUs would Soundwave having? Don't believe what AMD marketing people told you; I rather believe in MLID's story regarding Senior Management about SWV. And if anyone still believes in Medusa Point is having RDNA3.5+ GPU, hoho...:p
 

marees

Platinum Member
Apr 28, 2024
2,267
2,890
96
And if anyone still believes in Medusa Point is having RDNA3.5+ GPU, hoho...
AMD has a history of bundling old GPU tech in their APus. Witness vega gpu tech in AMD APus — long after AMD stopped supporting vega

Samsung could have pushed AMD for premium ip as their hardware could go into flagship devices but soundwave seems the complete opposite of flagship to me. Same is the case for medusa point APUs
 

Tigerick

Senior member
Apr 1, 2022
942
857
106
Exynos 2600/2700 vs SWV/SWV2? APU: NPU Analysis

NPU.jpg

AIE-XDNA Roadmap.jpg

Samsung Exynos 2600 has presented new generation of NPU with 32K (32,768) MAC units. It is the largest units atm. That's why Samsung is claiming "improves generative AI performance by 113% compared to its predecessor, E2500." That's mean E2600 has about 126 TOPS, biggest by far. By calculation, Samsung only needs to set the NPU speed @ 1.92GHz. That's the difference between phone SoC and mobile SoC. Due to low power, phone SoC will need to set the clock speed as low as possible. However, compared to A19Pro at 8192@2.17GHz, Samsung goes a bit extreme by employing 32,768@1.92GHz at the expense of die area. That's why E2600's NPU is about 3.5 times faster than A19 Pro.

@Fjodor2001 If you interested on AMD XDNA AIE, here is the updated version. Based on roadmap from FAD, XDNA4 should offer better scale and power efficiency. I am updating XDNA4 to employ 32,768 MAC units, that's mean next APUs (N2P?) with XDNA4 should offer 160 TOPS; double than SWV's 80 TOPS. FYI, MDS1 and Medusa Premium are both employing XDNA3 which could offer NPU TOPS in range of 100-110 depending on the clock speed. The formula is presented in the Excel sheet above if you are interested on the calculation of TOPS. :cool:
 
Last edited:
  • Like
Reactions: Fjodor2001