Discussion AMD’s Custom APU Discussion Thread

Page 2 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Tigerick

Senior member
Apr 1, 2022
782
750
106
AMD-RYZEN-Z2-SERIES-1200x659.jpg

Next Gen XBox Consoles and Handheld Preliminary Specs

ROG Xbox Ally / SteamDeck OLEDROG Xbox Ally XXbox Series XPS5 ProMagnus
Date2025202520202024
CodenameAerith PlusStrix PointCustom
ModelZ2AZ2 Extreme AI
N4PN3P + N3P ?
279 mm2144 + 264 = 408 mm2
CPU4 x Zen 24 x Zen 5 + 4 x Zen 5c8 x Zen 28 x Zen 2Zen 6
GPURDNA2 8CURDNA3.5 16CURDNA2 52CURDNA2 60CUNavi5 68CU
Memory128-bit LPDDR5-6400128-bit LPDDR5x-8000320-bit 10GB GDDR6 + 192-bit 6GB GDDR6256-bit 16GB GDDR6192-bit GDDR7
Memory Bandwidth102 GB/s120 GB/s560 GB/s576 GB/s864 GB/s
 
Last edited:

basix

Member
Oct 4, 2024
158
313
96
Or they simply had some space left, to put another core on the Die.

But 144m2 for the SoC part seems to be rather big. Are there additional memory PHY on the SoC Die? Or does the GPU Die really have a 384bit SI instead of 192bit?
 
  • Like
Reactions: Tlh97

MS_AT

Senior member
Jul 15, 2024
752
1,521
96
Probably 10C/20T in the console, with one extra Zen6c for binning
So 2 + 8 or 3+7? The latter looks even more weird (sorry if something is not a power of 2 it looks weird ;))

Still I would expect for 12 cores a mix of 8 + 4 dense with 4 reserved for the OS/background stuff, and 8 perf fully available for games. Now the devs will need to make core aware thread placement.
 
  • Like
Reactions: Tlh97 and marees

Kepler_L2

Senior member
Sep 6, 2020
916
3,749
136
Or they simply had some space left, to put another core on the Die.

But 144m2 for the SoC part seems to be rather big. Are there additional memory PHY on the SoC Die? Or does the GPU Die really have a 384bit SI instead of 192bit?
It looks like the GPU die is just Shader Engines + GPU Front-end, Memory controllers + MALL? And SoC is everything else (CPU, Display, Media, PCIe, NPU?)
 
  • Like
Reactions: Tlh97 and marees

ToTTenTranz

Senior member
Feb 4, 2021
473
864
136
It looks like the GPU die is just Shader Engines + GPU Front-end, Memory controllers + MALL? And SoC is everything else (CPU, Display, Media, PCIe, NPU?)

Does it make sense to put the NPU far from the GPU cores and caches? That way they can't use the NPU to offload FSR4 or other ML upscalers in games, for example.
 

marees

Golden Member
Apr 28, 2024
1,277
1,831
96
The speculation is that performance will be between 5080 and 4090/5080 super

Assuming this releases next november what would be the street price of the 5080 super at that point in time ? I don't see the console selling below that

As a sop to gamers, Microsoft could partner with AMD for an RDNA 5 based z3 extreme ? for handhelds by various 3rd party partners

Not sure if microsoft will also partner with AMD for a medusa halo like APU
 

basix

Member
Oct 4, 2024
158
313
96
They aren't going to run upscalers on the NPU.
Wouldn't that be a really nice use case on APUs? XDNA2 does support FP8 (50 TFLOPS) und is much more powerful than the iGPU in this regard. It would give AMDs iGPUs a quite relevant competitive edge.

When I look at FSR4 Redstone and its upcoming features (Neural Radiance Caching, Ray Regeneration, ML based frame generation, enhanced ML based super resolution), all could potentially benefit from NPU acceleration. Even without RT, only FG and SR would already be worth it.
So instead of having a mediocre gaming experience or using precious Die area for a bigger GPU, leverage the NPU to amortize its cost. At least for gaming use cases. Currently, the NPU is dark silicon for most of the time. If APU gaming gets into the mix, only few people would say no to that and the added cost for the NPU is better leveraged.
 

Kepler_L2

Senior member
Sep 6, 2020
916
3,749
136
Wouldn't that be a really nice use case on APUs? XDNA2 does support FP8 (50 TFLOPS) und is much more powerful than the iGPU in this regard. It would give AMDs iGPUs a quite relevant competitive edge.

When I look at FSR4 Redstone and its upcoming features (Neural Radiance Caching, Ray Regeneration, ML based frame generation, enhanced ML based super resolution), all could potentially benefit from NPU acceleration. Even without RT, only FG and SR would already be worth it.
So instead of having a mediocre gaming experience or using precious Die area for a bigger GPU, leverage the NPU to amortize its cost. At least for gaming use cases. Currently, the NPU is dark silicon for most of the time. If APU gaming gets into the mix, only few people would say no to that and the added cost for the NPU is better leveraged.
It's a lot of work to port GPU code to NPU, it isn't really that fast and upscaling on the NPU introduces an additional frame of input lag.
 
  • Like
Reactions: marees

basix

Member
Oct 4, 2024
158
313
96
I just talk about the matrix accelerated part of the algorithm. Everything else would stay on the GPU. That would not add a single frame latency.
Just look at the NPU as "remote matrix core / accelerator" but as it is on Die, not that remote as if PCIe or something else would be in-between.

Sure, it would be effort but it should be doable nevertheless. Prequisite is that matrix operations are cleanly separated from the rest of the algorithm. If it is intertwined with many other operations, it would get a mess.
We should be able to evaluate that when AMD releases the FSR4 code on GPUOpen.
 

Kepler_L2

Senior member
Sep 6, 2020
916
3,749
136
I just talk about the matrix accelerated part of the algorithm. Everything else would stay on the GPU. That would not add a single frame latency.
Just look at the NPU as "remote matrix core / accelerator" but as it is on Die, not that remote as if PCIe or something else would be in-between.

Sure, it would be effort but it should be doable nevertheless. Prequisite is that matrix operations are cleanly separated from the rest of the algorithm. If it is intertwined with many other operations, it would get a mess.
We should be able to evaluate that when AMD releases the FSR4 code on GPUOpen.
There is an extra frame of latency, just look at the Windows Upscaler using the X Elite NPU.
 

basix

Member
Oct 4, 2024
158
313
96
As far as I know, that one functions completely differently compared to FSR or DLSS.

Edit:
Here is a presentation from ARM at Siggraph 2024, where they describe different ML approaches to upsampling. What they came up with was basically a super-charged FSR2 ("v3 - parameter prediction"):
- "Regular" Upsampling algorithm like FSR2+ or DLSS2+ (FSR2 was the basis in the ARM presentation)
- Filter weight / parameter prediction with a DNN showed best results ("v3" in the presentation; Microsoft's Auto SR is more of an image prediction like "v1" in the presentation). Pretty much what AMD and Nvidia are doing with FSR4 and DLSS2+

Now, if the DNN has clean interfaces (pre-compute/post-compute in the presentation get executed on the GPU, DNN in-between on an "accelerator"), it might get executed on any DNN accelerator HW. Might be matrix cores in the GPU or offloaded to an NPU.
I do not see any fundamental obstacle to that. Sure - datapaths, caches etc. need to be supported and any kind of API to do that on a chip. But as it is possible to move data from CPU to GPU and vice versa, that would also be doable looking at GPU and NPU. Also, no added frames of latency.

In the presentation it is not clear, what ARM has in mind (NPU or GPU acceleration). They just call it "𝐴𝑐𝑐𝑒𝑙𝑒𝑟𝑎𝑡𝑜𝑟𝑇𝑂𝑃𝑠"
 
Last edited:

marees

Golden Member
Apr 28, 2024
1,277
1,831
96
More code names drop

  1. orion (PS6 ?)
  2. Robin (PS5 handheld ?)
  3. Robin plus (PS5 handheld revision ?)
  4. Canis (Microsoft handheld?)


CORRECTIONS:

  1. orion (PS6)
  2. Robin (PS5 blockchain mining version)
  3. Robin plus (PS5 blockchain mining version)
  4. Canis (PS6 handheld)


 
Last edited:

soresu

Diamond Member
Dec 19, 2014
3,910
3,338
136
Seems very odd for PS6 and PS6 handheld to have opposing codenames tho.

It would have made a lot more sense for PS6 to be Magnus or Majoris rather than Xbox Next.
 

soresu

Diamond Member
Dec 19, 2014
3,910
3,338
136
From a funny situation to see the unveiling of the return of the handheld consoles
I mean, they never left and on Nintendo side have usually been the best selling consoles in any generation.

Given Nintendo are being increasingly full of themselves that may not be the case for Switch 2 this time around.

Sony have a real chance to make inroads this time if they don't mess it up.

If they made a 'lite' version of PSVR to go with it for a HMD/NED in place of TV tethering then it could be just what the doctor ordered.

Something along the lines of Bigscreen Beyond in size/weight would be perfect, with the console acting as the compute/battery/IO 'puck' equivalent to Apple Vision Pro.

Doesn't even need to do full tracked VR, just to display a beeg screen with some visual passthrough.