- Mar 3, 2017
- 1,777
- 6,789
- 136
Very.How good is the memory controller on Strix Halo
No.As far I know it’s not very good and AMD had trouble with this.
It's stuck because board requirements.There is a reason why it’s stuck to 8000MHz as well. The latency issue is probably exacerbated due to this
This is a fatal flaw with LPDDR5X. Sure, when on battery, it should run as-is but when on AC power, it should switch to low latency power burning mode. Always conserving energy was just a bad engineering decision. It badly needs a turbo mode.
It's the only review that mentions throttling, where others usually say positive things about the thermal solution. At least from those I read.Kinda weird, my mini pc with the same cooler that most of the other ones have never throttles. It looked like frameworks heatsink was going to be better than that cooler that's making the rounds in the other designs.
Have you looked at cpu load vs time plots? It is possible that x3D is able to pull ahead on critical path thanks to higher boost, where the power limit is not a problem. I never compiled unreal engine so have no intuition about its cpu load vs time profile, so might be I am completely off the mark.Edit: The workload was compiling unreal engine. It doesn't benefit very much, maybe 3.25% from V$ on the 9950X3D. Nothing significant. Other related workloads varied back and forth a bit, but in my line of work there was nowhere I found a compelling performance or efficiency advantage for Strix Halo
AMD slides cite some improvement within CCX if I gets the slide right https://chipsandcheese.com/?attachment_id=3194647 cycles (i.e. it's the same).
This review is excellent too
it's very interesting bc it's basically the exact same CCD... Halo only has better IOD, low power interconnect and lpddr5x 8000
how is halo really achieving even up to x2 less power for same perf?
also both 10% faster and 20% slower in some scenarios? the slower is bc memory not designed for high perf?
basically it shows that zen 5 full power is still bottlenecked/untapped and minimal improvements dramatically increase performance
It is delusional to think that laptops aren't used plugged in.Yes what a terrible engineering decision to make LOW POWER DDR conserve power! They should have anticipated people would inappropriately use it in situations for which it isn't intended, and designed it so it is able to discard its main reason for existence.![]()
be so frbasically it shows that zen 5 full power is still bottlenecked/untapped and minimal improvements dramatically increase performance
ok. the issue is most likely LPDDR5X then
it's very interesting bc it's basically the exact same CCD... Halo only has better IOD, low power interconnect and lpddr5x 8000
how is halo really achieving even up to x2 less power for same perf?
also both 10% faster and 20% slower in some scenarios? the slower is bc memory not designed for high perf?
basically it shows that zen 5 full power is still bottlenecked/untapped and minimal improvements dramatically increase performance
Its not the same CCD. AMD confirmed that somewhere (I dont have a link at the moment)-- it was produced on a low power track and uses low power transistors-- this is why it maxes out at 5.1 GHz.it's very interesting bc it's basically the exact same CCD... Halo only has better IOD, low power interconnect and lpddr5x 8000
how is halo really achieving even up to x2 less power for same perf?
also both 10% faster and 20% slower in some scenarios? the slower is bc memory not designed for high perf?
basically it shows that zen 5 full power is still bottlenecked/untapped and minimal improvements dramatically increase performance
It is a high bandwidth interface. But that had low-power states that could only take it so far. And you had retraining and latency implications every time the chip went down and came back up and so on. So for an always-on kind of a desktop kind of machine, that seemed like the best interconnect to connect that as we try to build this into an APU. The first thing we had to do was to change the interconnect between the two dies. And so the CCD that you see here, the core die that you see here, has a different item. That's the first change.
Its not the same CCD. AMD confirmed that somewhere (I dont have a link at the moment)-- it was produced on a low power track and uses low power transistors-- this is why it maxes out at 5.1 GHz.
Its 100% load the entire duration for the specific example case I've been using. The frequencies I stated are for effective frequency, as well.Have you looked at cpu load vs time plots? It is possible that x3D is able to pull ahead on critical path thanks to higher boost, where the power limit is not a problem. I never compiled unreal engine so have no intuition about its cpu load vs time profile, so might be I am completely off the mark.
CPU/Config | Time | Effective Clock | System Wall Draw | Watts per core |
Strix Halo | 2414.50s | 4.6GHz | 190W | 6.5W |
9950X3D | 1742.13s | 5.2GHz | ???W | ~10W |
9950X3D Eco Mode | 2366.24s | 3.8GHz | 190W | 3.1W |
How are you determining those per core readings? hwinfo?Its 100% load the entire duration for the specific example case I've been using. The frequencies I stated are for effective frequency, as well.
Granite ridge outperforms at less than half the power per core and lower clocks. Remember, I'm saying that in eco mode where its limited to 88W PPT it still performs better, while the desktop as a whole draws the same wall power as strix halo.
I expected if it was busy waiting on memory, the effective clock and power draw would reduce to reflect that. But I might be wrong on that. Every reading I can take indicates the CPU is completely maxed out and working hard.
Edit:
CPU/Config Time Effective Clock System Wall Draw Watts per core Strix Halo 2414.50s 4.6GHz 190W 6.5W 9950X3D 1742.13s 5.2GHz ???W ~10W 9950X3D Eco Mode 2366.24s 3.8GHz 190W 3.1W
CPU/Config Time Effective Clock System Wall Draw Watts per core Strix Halo 2414.50s 4.6GHz 190W 6.5W 9950X3D 1742.13s 5.2GHz ???W ~10W 9950X3D Eco Mode 2366.24s 3.8GHz 190W 3.1W
Yeah. They might not be completely accurate but at 120W PPT, strix halo has significantly more power to play with per core than GNR at 88W PPT thanks to the big IOD overhead and lower power level.How are you determining those per core readings? hwinfo?
All the canned benchmarks I ran showed exactly what people are saying - Nearly desktop performance for a fraction of the power.However the 9700X did poorly against the Max+ 395 in Cinebench/Blender.
Sensor readings being wrong doesn't change the outcome - It uses the same power to do the work more slowly. I am measuring wall draw with a kill-a-watt, so it's not like I'm relying on sensor readings to make this claim.Since it's 100% load, and values between 9950X3D runs scale nicely with the clock (~35% faster build with ~35% higher clock) further supporting your claim about the 100% load nature, the only thing that comes to my mind is that either there is something wrong with sensors on that Halo machine or your workload is really sensitive to latency, as when going over various benchmarks there was little benefit generally to lower latency memory during code compilation. Personally I never tested for latency influence as usually I have access to setups that have the same memory config. Nevertheless, interesting observation.
Well, you measure power supply efficiency this way tooSensor readings being wrong doesn't change the outcome - It uses the same power to do the work more slowly. I am measuring wall draw with a kill-a-watt, so it's not like I'm relying on sensor readings to make this claim.
It's the Nimo Mini PC Pro. It has a GameMax 12VO Flex ATX unit that appears to be a modified version of their Flex ATX gold rated 350W, and back-of-the-napkin math comparing it to the mini pc's using a power brick, which has a datasheet-provided efficiency figure, it looks to be 90%+ efficient, like GameMax claims.Well, you measure power supply efficiency this way too
Anyway, I am not arguing with your results. I just wonder what is the reason the performance is worse than expected given apparently higher clocks. If the actual clocks were similar to power limited 9950x3d it would make more sense, that is why I suggested the readings might be off. And which of the miniPCs you have?
It is delusional to think that laptops aren't used plugged in.
You don't need an engineering degree to realize that. Unfortunately, the problem is that meetings are often just nothing more than a bunch of scared people nodding heads. If someone disagrees, people either laugh at the person or use other psychological tricks to undermine the person's idea. If a meeting began with the reassurance that every idea, no matter how stupid sounding, would be welcome and there is a guarantee that no one will be ridiculed for speaking their mind, maybe we wouldn't need to keep revising standards with incremental improvements.
Couldn't be more true, and all of the hype surrounding Strix Halo seems to be completely ignoring the fact there are substantial tradeoffs compared to a normal desktop.Welcome to the real world, where there are tradeoffs!
Couldn't be more true, and all of the hype surrounding Strix Halo seems to be completely ignoring the fact there are substantial tradeoffs compared to a normal desktop.
I was definitely buying into the hype until reality slapped me in the face with the first real workload I tested on it.