CatMerc
Golden Member
Right, only looked at one result.Theoretical is just under 47GB/s and his result with the cores at 3.9GHz gets to almost 49GB/s.
With Ryzen he should be getting 44-45GB/s with the DRAM at 2933.
Didn't see the one with 49GB/s.
Right, only looked at one result.Theoretical is just under 47GB/s and his result with the cores at 3.9GHz gets to almost 49GB/s.
With Ryzen he should be getting 44-45GB/s with the DRAM at 2933.
HPET is enabled, absolutely. I will have to retest, though, as I did not realize it at the time - so thanks for catching that 😛
I think the memory could have been running at 3200, despite what the screenshot says. Others on the forums have stated seeing similar behavior with the 0083 BIOS. I ran a couple other simple benchmarks and they were entirely consistent (2300 in CPU-z ST, for example), but I pushed overclocking too far with Ryzen Master and had to clear the CMOS.
The fact that memory bandwidth scales with CPU clocks in the 2933MHz results is way weird. At stock you get very low BW and for the 2 overclocks, the BW is too high. 3200MHz wouldn't explain this so it has to be something else.
EDIT: Any changes in 4k writes perf with the new AGESA? Would be nice to see some fixes in the SSD area too.
I've got a question regarding the PCIe implementation...
I've been looking at the x370/b350 motherboards and I understand that the 'higher end' boards have two x16 slots that run x8/x8 when both are populated. Both of those pipe straight into the CPU from what I can gather.
What I don't understand though is why all of the boards with legacy PCI (as well as some others) have a single x16 to the CPU, and then for the second x16 it'll max out at x4 but drop to x1 if the PCIe x1 slot(s) are occupied. It just seems like the two should be ultimately unrelated to each other since the legacy PCI slots by definition hang off of the chipset, not the CPU. Is this just a coincidence or?
PS: I have a legacy PCI card that I want to keep one way or the other (because if I don't I'm likely looking at a $800-$1,500 additional cost for pro-audio), and all the PCI cards have this odd 'coincidence'. Unfortunately I'd also like the option of running two GPUs where one runs video editing processing and the other powers monitors, i.e. x8/x8 off of the CPU.... Not finding mobos for that combination yet... And this is what I have read and thought I understood:
![]()
Yeah, that's what I figured more or less. I just find it curious that there aren't any boards with legacy PCI and PCI-e 3.0 x8/x8.... Oh well, I may have to go with an adapter board instead.
Actually, I've seen this with every scaling test I've done - Ryzen memory performance goes up with CPU clocks. This isn't really unexpected when you consider all of the logic and cache running at core clocks. The DF interface, for example, seems to run at the L3 speed, whereas the DF itself runs at memory clocks.
Running at DDR4-3200 would explain it pretty well, I think. But a funky PLL might do so as well.
More testing to do. I've mostly focused on NVMe benchmarks. 4k-64Thrd results seem low, but everything else is higher than on Intel. Real world testing (copy tests, for example) show higher performance than Intel and latency is lower.
Most of my SSDs are well traveled, so they don't all have their full new write speeds in any event... which can be a major factor to consider. My 850 Evo, however, still does - and performs rather similar on Ryzen as it did with my Sandy Bridge, with an edge for Ryzen overall - especially in 4K-64Thrd write results, which are doubled.
I'll try to do it in a couple of hours, I'll be natively running Linux Mint 18.1 while trying to figure out what I screwed up in my CCX benchmark utility - it will be a nice reprieve 😛
UPDATE:
Results
Ryzen 7 1700X @ 3.9GHz, 16GB (2x8GB) DDR4-2933 16-16-16-38 1T.
Linux Mint 18.1, kernel 4.4.0-53-generic
Your results with the DRAM at 2666MHz do not scale and not quite sure it would make any sense for memory BW to scale with core clocks. Some impact on latency maybe but not on BW. Not aware of any evidence to back your claim either, quite the opposite.
The scaling is better than linear in your 2933 results. 42GB/s at 3.5GHz, 47GB/s at 3.8GHz and 49 GB/s at 3.9GHz.
On SSDs i was just asking if the new AGESA changed anything.
Bandwidth is a function of not just memory speed and the IMC, but of the CPU being able to process the data. I had higher bandwidth with core overclocks on Sandy Bridge, but that was rather minimal by comparison... but Sandy Bridge didn't rely on the same type of data mesh operating in multiple clock domains (IMC -> DF -> CCX fabric -> CORE).
The Ryzen 5 1400 results with DDR4-2400 CL14 on another computer entirely show scaling as well, but not as extreme.
Also, be careful to ensure you are looking at the same core counts in my results.
DDR4-2667 16-16-16-38 CR1
8x 3.0Ghz: 31501
8x 3.5Ghz: 36055
8x 3.9Ghz: 40639
It's important, too, to note that benchmarks that are sensitive to memory bandwidth also improve slightly more than just the CPU clocks would suggest - so it is a real effect.
Sorry but you are choosing the results that fit your theory, a theory that doesn't make sense and is not supported by any 3rd party data.
You make 2 very risky assumptions
- the DDR was at 3200 not at 2933 as displayed by AIDA
- the memory BW has extreme scaling with core clocks
Why do that before making sure it's not the obvious suspect, software.
I do get the core saturation point and could be interesting to look at that.
Maybe try Geekbench 3 , the memory tests?
I'm not making any assumptions, you are comparing results without taking into consideration the control results - they all show the same pattern of increased bandwidth with core clocks. And it's not just the one test - it's everything, including Geekbench 3 - though I'm not yet back to the DDR4-2933 testing.
Very few people do as much testing as I am doing or as many controls as I do.
Does this mean that there is still native PCI available on AMD chipsets? I assumed they had to use PCIe-PCI bridges just like every current Intel board?PCI will be coming from the PCH.
The results you have shared don't quite support the claim and no 3rd party data supports the scaling claim. The extreme scaling is rather absurd too.
Based on these results he core is borderline saturated at 3.5GHz-2666 and 3.8-3.9GHz assumed 3200MHz with something else providing more than 10% scaling for memory BW
3.5-2660=40.6
3.5-3200?=41.7
3.8-3200?=46.5
3.9-3200?=49
Less than 10% increase in core clocks between the 2 saturation points and 20% gains in read (real and theoretical).
That's too much and on top of that you have 2933MHz reported instead of 3200 and maybe a flaw in the benchmark that saturates the core far too early with w/e workload is used.
This seems to be a bug with AIDA. I didn't recognize how large the climbing was, despite writing the numbers, LOL! Guess that can happen when running 1,000 different numbers through your head.
Geekbench 3 shows 2% scaling on the memory subtests with AGESA 1.0.0.4 using the 8C/16T configuration and the Ryzen 5 1400. However, the former AGESA showed 14%... between 3Ghz and 3.4Ghz with the 4C/4T configuration, which lines up pretty well with AIDA results.
That is far too much for CPU clocks to explain, as you said. No idea why, but my brain was thinking we were talking about 10% total, not 30%.
GB ST Memory, 4C / 4T, DDR4-2667 CL15, ASRock AB350 BIOS 1.41 Windows 7 (Windows 10)
3.0Ghz: 3500 (3635)
3.4GHz: 3997 (4225)
3.8GHz: 3953 (4315)
GB ST Memory, 8C/16T, DDR4-2667 CL14, C6H BIOS 0083 Windows 10
3.0Ghz: 3608
3.4Ghz: 3654
3.8GHz: 3688
Only 2% total (though I didn't restart with these new tests, contrary to the former tests). HPET always active.
I will play around again with DDR4-2933 tomorrow and test higher speeds on the Ryzen 5 1400 as well. Something weird is going on, that's for sure.
Oh well, I may have to go with an adapter board instead.
Does this mean that there is still native PCI available on AMD chipsets? I assumed they had to use PCIe-PCI bridges just like every current Intel board?
A big thanks for the incredible amount of effort you put into this work.http://zen.looncraz.net/ now online.
I don't have time for the next few weeks, maybe as much as a month, to finish more than what has been done.
At that point in time I should also have more Intel results as I will be upgrading an Ivy Bridge Xeon system to Ryzen and will have the parts on consignment for a short while (long enough to run a series of benchmarks at 3Ghz with and without Hyper-threading).
Thanks. I just had a case where I closed and reopened the browser and found the main text content empty (Looncache enabled). That was the "X Threads" page. I then clicked on the "1 Thread" page and it was empty, too. Only once I clicked the "Home" page I could access the other pages again.http://zen.looncraz.net/ now online.
man.... you are a big balancing force to juanrga in the universehttp://zen.looncraz.net/ now online.
I don't have time for the next few weeks, maybe as much as a month, to finish more than what has been done.
At that point in time I should also have more Intel results as I will be upgrading an Ivy Bridge Xeon system to Ryzen and will have the parts on consignment for a short while (long enough to run a series of benchmarks at 3Ghz with and without Hyper-threading).