Discussion NV Re-Enter ARM PC market in 2025!

Tigerick

Senior member
Apr 1, 2022
719
674
106
Highly Confident:

When Reuter reported NV and AMD are going to launch ARM SoC for PC:
Nvidia dominates the market for artificial intelligence computing chips. Now it is coming after Intel’s longtime stronghold of personal computers. Nvidia has quietly begun designing central processing units (CPUs) that would run Microsoft’s Windows operating system and use technology from Arm Holdings, two people familiar with the matter told Reuters.
Nvidia and AMD could sell PC chips as soon as 2025

I contacted my source, turned up there might have some truths there. Apparently, MS Surface which presumably will upgrade Surface Pro X to Snapdragon X Elite this year. Next year 2025 and later, NV and AMD have won the contract for upcoming Surface X series. PS: And I believe here is the source of Reuter's leaks. If Microsoft going to launch new Surface in 2025, the specs must be better than upcoming Surface Pro X w/ X Elite.......or not? See below:

N200N1X ?Tegra 3MendocinoA16SD X EliteSD XKrackan PointSD 8 EliteD9400D9300+SD 7+ Gen3D8400SD 6 Gen 4Exynos 1580Exynos 1580
PlatformWindowsWoAWindows RTWindowsiPad OSWoAWoAWindowsAndroid TabletAndroid TabletAndroid TabletAndroid TabletAndroidAndroidAndroidAndroid Tablet
NodesIntel 740 nmN6N4PN4PN4PN4PN3EN3EN4PN4PN4PN4PSF4P
FanlessYesYesYesNoYesNoNoNoYesYesYesYesYesYesYes
CPU cores
Prime2 x A725 - 3.2GHz ?--2 - 3.46GHz12 x Oryon - 4.3GHz8 x Oryon - 3GHz4 x Zen5 - 5GHz2 x Oryon G2 - 4.3GHz1 x X925 - 3.62GHz4 x X4 - 3.25GHz1 x X4 - 2.8GHz1 x A725 - 3.25GHz1 x A720 - 2.3GHz1 x A720 - 2.9GHz
Performance2 x A7254 x A9: 1.5GHz4 x Zen2 4.3GHz3 - 2GHz--4 x Zen5c - 3.5GHz5 x Oryon G2 - 3.5GHz3 x X4 - 3.3GHz4 x A720 - 2GHz4 x A720 - 2.6GHz3 x A725 - 3GHz3 x A720 - 2.2GHz3 x A720 - 2.6GHz
Efficiency4 x ADL-N - 3.7GHz-1------4 x A720 - 2.4GHz-3 x A520 - 1.9GHz4 x A725 - 2.1GHz4 x A520 - 1.8GHz4 x A520 - 1.95GHz
GPU32 EU - 750MHz - 0.4TFBlackwell520 MHzRDNA2 Radeon 610M4 @ 1398MHz -
1.44 TF
3.8 TF1.7 TFRDNA3+ 8CU @ 3000MHz - 3.1TFAdreno 830 @ 1100MHz - 3.4TFG925 MC12@ 1612MHz - 5TFG720 MC12 @ 1300MHz - 4TFAdreno 732 @ 950MHz - 1.46TFG720 MC7 @ 1300MHz - 2.3TF2 @ 895MHz - 0.46TFXclipse 540 - RDNA3 4CU
Memory64-bit LPDDR5-480032-bit LPDDR2-106664-bit LPDDR5-550064-bit LPDDR5-6400128-bit LPDDR5x-8448128-bit LPDDR5x-8448128-bit LPDDR5x-800064-bit LPDDR5T-960064-bit LPDDR5X-1066764-bit LPDDR5T-960064-bit LPDDR5x-840064-bit LPDDR5x-853364-bit LPDDR5-640064-bit LPDDR5-6400
Memory BW4.3 GB/s50 GB/s135 GB/s135 GB/s50 GB/s50 GB/s
NPU-40 TOP ?NANA17 TOPS45 TOPS45 TOPS50 TOPS14.7 TOPS
CellularNANAOptional 5GOptional X65 5GOptional X65 5GNA5G5G5G5G5G5G5G
GeekBench 6 ST~790~1361~2500~2345~2100~2677~ 3100~ 2821~2239~1873~1639~1130~1360~1360
Surface Go 4Surface Go ?Surface RTRyzen 5 7520UiPad 11Surface Pro 11Ryzen AI 7 350Oppo Pad 4 Pro (China)vivo Pad5 Pro (China)iQOO Pad2 Pro (China)Xiaomi Pad 7 w/ Keyboard and PenRealme Neo7 SE (China)Realme Neo 7x (China)Samsung A56Samsung Galaxy Tab S10FE
SRP8 + 64: $5792 + 32: $499< $5006 + 128: $34916 + 512: $1399>= $600>= $80016 + 512: CNY4000 (~$600)16 + 512:16 + 512: CNY4099 (~$600)8 + 256: ~$40012+512: CNY1899 (~$270)12 + 512 : CNY1599 (~$220)8 + 128: $5008 + 128: $499

Qualcomm, AMD and NV are all creating their ARM SoC because they are coming after Intel's longtime stronghold of personal computers. As ARM's Cortex X series are getting faster each generation especially with the arrival of LPDDR6 memory, ARM PCs are getting strong enough to challenge x86's performance.
 
Last edited:
  • Wow
Reactions: Saylick

Tigerick

Senior member
Apr 1, 2022
719
674
106
@Kepler_L2

Steam Deck and upcoming Switch 2 are having same memory bandwidth, their raw performance (ie. excluding DLSS) should be similar. There is no way Switch 2's performance is up to XSS which has double memory bandwidth. MLID does not know what he is talking about. We are talking about rasterization performance, DLSS of course will make difference in FPS.

PS: XSS has 4TF while T239 should have around 1.88 TF (depends on final clock speed), slightly higher than Steam Deck's 1.6TF.
 
Last edited:

Saylick

Diamond Member
Sep 10, 2012
3,881
9,012
136
@Kepler_L2

Steam Deck and upcoming Switch 2 are having same memory bandwidth, their raw performance (ie. excluding DLSS) should be similar. There is no way Switch 2's performance is up to XSS which has double memory bandwidth. MLID does not know what he is talking about. We are talking about rasterization performance, DLSS of course will make difference in FPS.

PS: XSS has 4TF while T239 should have around 1.88 TF (depends on final clock speed), slightly higher than Steam Deck's 1.6TF.
One thing to note is that Ampere's SM isn't as performant on a per-TFLOP basis as RDNA 2, which is what the Steam Deck uses. When Nvidia transitioned from Turing to Ampere, TFLOPS per SM doubled but performance was only like 30% higher, not double. That 1.88 TF might behave closer to 1.25 TF in comparison to the Steam Deck.
 
  • Like
Reactions: Tlh97 and Tigerick

Tigerick

Senior member
Apr 1, 2022
719
674
106
One thing to note is that Ampere's SM isn't as performant on a per-TFLOP basis as RDNA 2, which is what the Steam Deck uses. When Nvidia transitioned from Turing to Ampere, TFLOPS per SM doubled but performance was only like 30% higher, not double. That 1.88 TF might behave closer to 1.25 TF in comparison to the Steam Deck.
NV got deal mostly because Samsung 8N is cheap :p
 

soresu

Diamond Member
Dec 19, 2014
3,703
3,037
136
One can also say they got the deal because Nintendo is cheap, too. ;)
To an extent, but mostly that they would rather risk most investment on the games rather than the hardware since their last truly big step forward console Gamecube was a bit of a flop for them.

I'd expect what follows Switch to be >=3x GPU tho, and likely >=2x CPU.
 
  • Like
Reactions: Tlh97 and Saylick

Ghostsonplanets

Senior member
Mar 1, 2024
773
1,227
96
Switch 2 at 1.88 TFLOPs would be 612MHz? That's the clock I would expect for Portable at most. Specially when compared to current Switch clocking at 768MHz Docked.

Also, it depends on their BC solution, but surely they would need to at least match the Docked clock of current Switch for BC games running on Switch 2 to not break, no? Assuming the same 768MHz Docked, Switch 2 would be around 2.35TFLOPs. I guess that is what can be expected.
 

soresu

Diamond Member
Dec 19, 2014
3,703
3,037
136
Switch 2 at 1.88 TFLOPs would be 612MHz? That's the clock I would expect for Portable at most. Specially when compared to current Switch clocking at 768MHz Docked.
Given clocks have more than doubled for APUs since the Switch SoC was originally in the SHIELD TV it's a toss up what would be viable today - largely depending on a combo of cooling (to prevent throttling), battery, ALU count and fab node.
 
  • Like
Reactions: Ghostsonplanets

soresu

Diamond Member
Dec 19, 2014
3,703
3,037
136
Ooof wow, taking 2015's Godaveri APU as a reference the clockspeed of APUs has almost quadrupled from its 720 Mhz to the 2,800 Mhz of Hawk Point.
 

Ghostsonplanets

Senior member
Mar 1, 2024
773
1,227
96
Given clocks have more than doubled for APUs since the Switch SoC was originally in the SHIELD TV it's a toss up what would be viable today - largely depending on a combo of cooling (to prevent throttling), battery, ALU count and fab node.
I expect Switch 2 to be lower clocked. Nintendo has a power budget for the entire device of around 5 - 10W Portable and 10 - 13W Docked. I don't think this will change with their next device. But 612MHz would certainly be low for the GPU Docked clocks in my opinion. I think in the upwards of 800 to 900MHz is something more likely to be their choice.

But as you said, it depends on multiple factors. Switch form-factor performance is largely held back due to battery and cooling solution into a small body. High Performance from higher power budgets like PC Handhelds isn't to be expected at all.
 

jpiniero

Lifer
Oct 1, 2010
16,098
6,561
136
Given clocks have more than doubled for APUs since the Switch SoC was originally in the SHIELD TV it's a toss up what would be viable today - largely depending on a combo of cooling (to prevent throttling), battery, ALU count and fab node.

Thing is, the Switch 2 is likely to be fabbed at Samsung 8 nm, ie: the same as Ampere. There's a chance it's fabbed at a better Samsung node... but I wouldn't bet on it.
 
  • Like
Reactions: soresu

soresu

Diamond Member
Dec 19, 2014
3,703
3,037
136
Thing is, the Switch 2 is likely to be fabbed at Samsung 8 nm, ie: the same as Ampere. There's a chance it's fabbed at a better Samsung node... but I wouldn't bet on it.
That does fit with Nintendo's habit of using older hw and on cheap nodes.
 

TESKATLIPOKA

Platinum Member
May 1, 2020
2,691
3,246
136
Steam Deck has up to 1.6GHz TFLOPs.
That means up to 1.5625GHz clockspeed for the 8CU IGP.

Switch has supposedly 6x more Cuda than the previous version. Link
That's 1536 cores, If It's Ampere based, It would mean 12SM.
For It to be on par with Steam Deck It would need only ~800MHz.
If It has only 8SM, then ~1.2GHz.

I personally think Nintendo will have to increase TDP to 15W, but It will be compensated by a bigger battery.
 
Last edited:

Ghostsonplanets

Senior member
Mar 1, 2024
773
1,227
96
Steam Deck has up to 1.6GHz TFLOPs.
That means up to 1.5625MHz clockspeed for the 8CU IGP.

Switch has supposedly 6x more Cuda than the previous version. Link
That's 1536 cores, If It's Ampere based, It would mean 12SM.
For It to be on par with Steam Deck It would need only ~800MHz.
If It has only 8SM, then ~1.2GHz.

I personally think Nintendo will have to increase TDP to 15W, but It will be compensated by a bigger battery.
Wouldn't 1536 CC@800MHz be 2.45TFLOPs though?

For 1536 CC/12 SMs to match SteamDeck 512 SC/ALUs / 8 CU @1.6GHz 1.6TFLOPs wouldn't it just need around 530MHz?

Of course just broadly speaking about the TFLOPs value. Not comparing real world performance nor specific Ampere and RDNA 2 layout or quirks.

As a thought experiment, if Nintendo just use the same clocks as Switch (Which doesn't make sense as it's a new SoC design with an entirely different manufacturing process), it would leave Switch 2 with:

Portable:
1536 CC@307.2 MHz: ~943 GFLOPs
1536 CC@375MHz: ~1152 TFLOPs
1536 CC@460MHz: ~ 1.41 TFLOPs

Docked:
1536 CC@768MHz: ~ 2.35 TFLOPs
 

Tigerick

Senior member
Apr 1, 2022
719
674
106
Charlie @ SemiAccurate reported that NV's ARM SoC is made by Intel 3 process for 2025 launching which aligned with my leak.

I am not sure IFS going to make both 64-bit and 128-bit version, I tend to believe only 64-bit version available. The performance wise of course is much faster than T239. GPU should have around 4TF depends on clock speed and CUDA cores.

And this SoC also likely to be used on NV's handheld console to fight with AMD's custom APU. Can't wait.... ;)

PS: Anyone interested on upcoming Surface X RT Edition? :p
 
Last edited:

SpudLobby

Golden Member
May 18, 2022
1,041
701
106
Charlie @ SemiAccurate reported that NV's ARM SoC is made by Intel 3 process for 2025 launching which aligned with my leak.

I am not sure IFS going to make both 64-bit and 128-bit version, I tend to believe only 64-bit version available. The performance wise of course is much faster than T239. GPU should have around 4TF depends on clock speed and CUDA cores.

And this SoC also likely to be used on NV's handheld console to fight with AMD's custom APU. Can't wait.... ;)

PS: Anyone interested on upcoming Surface X RT Edition? :p
There’s a 64 and 128-bit bus version?
 

FlameTail

Diamond Member
Dec 15, 2021
4,384
2,756
106
64-bit bus will kill any potential of the iGPU. Unless it's a very cheap SKU like the Snapdragon 7c series.
Snapdragon 7c series actually had a 32 bit bus iirc. Horrible.

Still, 64-bit + LPDDR6 is actually 102 GB/s, which is a decent amount of bandwidth.

That's the same amount as Apple M3, and more than the M1 (68 GB/s).
 

Ghostsonplanets

Senior member
Mar 1, 2024
773
1,227
96
Snapdragon 7c series actually had a 32 bit bus iirc. Horrible.

Still, 64-bit + LPDDR6 is actually 102 GB/s, which is a decent amount of bandwidth.

That's the same amount as Apple M3, and more than the M1 (68 GB/s).
Good call with the M1 comparison. Had forgotten that used LPDDR4x. So even a 64-bit bus will provide a good amount of bandwidth with LPDDR5/X/T. As long as Nvidia provides a reasonable cheap SoC, that's good for me.
Why is Nvidia partnering with Microsoft for the next Surface X?
WoA deal with QCOM expired and rumor says MS is pissed off about QCOM not fostering the WoA ecosystem despite having exclusive rights to it until now. So they're partnering with AMD for a custom Arm SoC for Surface and Nvidia is also developing an SoC to enter the WoA market. Not sure if they also have a deal with MS, but they might have.
 

FlameTail

Diamond Member
Dec 15, 2021
4,384
2,756
106
WoA deal with QCOM expired and rumor says MS is pissed off about QCOM not fostering the WoA ecosystem despite having exclusive rights to it until now.
Damn. Guess we should have seen this coming.

After more than 10 years of WoA's debut, it still has <0.1% marketshare.

However, the unfortunate thing is that they are doing it just at the time Qualcomm seems to be finally getting their # together, with their Nuvia acquisition bearing fruit and the debut of the new Snapdragon X series.

Even as a Qualcomm fan, I am not complaining. I welcome Nvidia as a competitor to Qualcomm in the WoA space. It will ensure that Qualcomm doesn't become complacent or stagnant.
So they're partnering with AMD for a custom Arm SoC for Surface and Nvidia is also developing an SoC to enter the WoA market. Not sure if they also have a deal with MS, but they might have.
Well, I can see how Microsoft would like to give Nvidia a leg up to enter the industry.