Discussion AMD SoC Halo series GPU discussion

Page 12 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Joe NYC

Diamond Member
Jun 26, 2021
3,240
4,738
136
Or you can just quadruple the MALL with SoIC magicwaffen.

If Medusa Halo stays with 256 bit LPDDR5, and assuming GPU will be more capable, the only way to avoid the GPU being bandwidth bottlenecked all the time would be to have massive MALL / Infinity cache.

If the both top and bottom die were to be ~250 mm2 die size, than MALL could be 256 MB and leave a plenty of room on the bottom die for other things.
 
  • Like
Reactions: lightmanek

leoneazzurro

Golden Member
Jul 26, 2016
1,113
1,863
136
More than a bottleneck, it is probably a case of heavily diminishing returns for a noticeable increase in costs. I remember a graph about the hit rate of the cache against the size , which returned the bandwidth multiplier factor, and while the hit rate always increase with size, the increase went rapidly down after a certain amount of cache. 256 Mbytes was never used by AMD, even in the N21 case (128). N31 used 96 Mbytes for a 384 bit bus - and both these GPUs are classes above what Halo GPU is. 7800XT uses 64 Mbytes. I don't see Halo using more than 32 Mbytes for the reasons above.
 

Joe NYC

Diamond Member
Jun 26, 2021
3,240
4,738
136
More than a bottleneck, it is probably a case of heavily diminishing returns for a noticeable increase in costs. I remember a graph about the hit rate of the cache against the size , which returned the bandwidth multiplier factor, and while the hit rate always increase with size, the increase went rapidly down after a certain amount of cache. 256 Mbytes was never used by AMD, even in the N21 case (128). N31 used 96 Mbytes for a 384 bit bus - and both these GPUs are classes above what Halo GPU is. 7800XT uses 64 Mbytes. I don't see Halo using more than 32 Mbytes for the reasons above.

The cache hit rates reached diminishing returns in that chart, depending on resolution. and the hit rate was only in 50-60% range mostly, in 1440p and 40-50% in 4k.

It would be interesting to see the effect on actual FPS in a system that has a balanced bandwidth to GPU performance and then in one that is severely bandwidth starved.

1739870398580.png
 

leoneazzurro

Golden Member
Jul 26, 2016
1,113
1,863
136
I think after that graph AMD modified the caches in N3x and following, because with RDNA3 the cache size went down while they kept the efficiency at the same level. IIRC I did never see a similar graph for the RDNA3 architecture. Also, Strix Halo is not made for gaming at 4K for sure.
 

coercitiv

Diamond Member
Jan 24, 2014
7,225
16,982
136
I think after that graph AMD modified the caches in N3x and following, because with RDNA3 the cache size went down while they kept the efficiency at the same level. IIRC I did never see a similar graph for the RDNA3 architecture. Also, Strix Halo is not made for gaming at 4K for sure.
RDNA3 went from 256 bit max to 384 bit max, memory clocks went up as well.
 
  • Like
Reactions: Joe NYC

CouncilorIrissa

Senior member
Jul 28, 2023
662
2,566
96
Also we can see how memory bandwidth affects rendering. If my hunch is right Strix Halo should have higher single thread than Strix Point.
Not by much. Read bandwidth remains the same, only write bandwidth from IOD -> CCD has changed from 16B/clk to 32B.
 

Hitman928

Diamond Member
Apr 15, 2012
6,642
12,245
136
@igor_kavinski Would like to see long idle tests to see if the rumored LPE cores are a thing (haven't seen any official material on it so it seems like it isn't true). The heavy load test is also very interesting as the M4 is supposed to have a significantly lower power limit and has a bigger battery, but is still showing less battery life. Probably the lowered scores for the AMD systems on battery is because they are reducing the power limit to have better battery life when under load whereas the Macbook doesn't lower the power limit (or not nearly as much) so performance doesn't drop significantly on battery, but then battery life takes a much bigger hit when under heavy load.
 

branch_suggestion

Senior member
Aug 4, 2023
711
1,537
96
@igor_kavinski Would like to see long idle tests to see if the rumored LPE cores are a thing (haven't seen any official material on it so it seems like it isn't true). The heavy load test is also very interesting as the M4 is supposed to have a significantly lower power limit and has a bigger battery, but is still showing less battery life. Probably the lowered scores for the AMD systems on battery is because they are reducing the power limit to have better battery life when under load whereas the Macbook doesn't lower the power limit (or not nearly as much) so performance doesn't drop significantly on battery, but then battery life takes a much bigger hit when under heavy load.
Well the SoC tile is N4P so you can probably figure out why it isn't there.
Plans changed or the N3E info was complete bogus.
It does just fine vs M4 Pro when you consider the node disadvantage and the chassis being not ideal.
 

Hitman928

Diamond Member
Apr 15, 2012
6,642
12,245
136
Well the SoC tile is N4P so you can probably figure out why it isn't there.
Plans changed or the N3E info was complete bogus.
It does just fine vs M4 Pro when you consider the node disadvantage and the chassis being not ideal.

Right, seems like the leaks were wrong or, at the least, very outdated info.