DX12 Multi-GPU is a challenge

Headfoot · Sep 29, 2015

Ultimate perceived Performance is not defined by frame rates. Frame rates are a stand-in substitute for defining performance that is objectively measurable. Frame rate measures throughput, not ultimate performance. Frame time measures latency. As we recently found out, these are not always matching figures and they both are useful data. Higher throughput GPUs are usually lower latency but not always.

Performance is how good the graphics look subjectively to the player. Gsync/FreeSync Monitor + lower Throughput GPU may have better subjective performance than faster throughput GPU + fixed HZ monitor.

SFR is exactly the same way. Multi-gpu methods that increase throughput usually decrease latency, but SFR decreases latency more while increasing throughput less. It may have less throughput but it has better perceived performance, which is ultimately trying to quantify the overall quality of the gaming experience. Don't confuse subjectively better (visual performance) with our best objectively measurable stand-in we use to try and quantify how much better it is (frames per second).

TL;DR: In graphics if it looks better, it is better.

thesmokingman · Sep 29, 2015

Headfoot said:
Ultimate perceived Performance is not defined by frame rates. Frame rates are a stand-in substitute for defining performance that is objectively measurable. Frame rate measures throughput, not ultimate performance.

Performance is how good the graphics look subjectively to the player. Gsync/FreeSync Monitor + lower Throughput GPU may have better subjective performance than faster throughput GPU + fixed HZ monitor.

SFR is exactly the same way. It may have less throughput but it has better perceived performance, which is the ultimately trying to quantify the overall quality of the gaming experience.

I'd also add to that, that not all games lend themselves to high fps especially non-twitch games like RPGs. This is especially relevant in large games at large resolutions.

Headfoot · Sep 29, 2015

DX12 multi gpu is explicit and coded by the programmer.

It may be that AMD and nVidia can provide a fall-back after-the-fact SLI and CF mode as well.

I dont think they are mutually exclusive.

Headfoot · Sep 29, 2015

I'll also add that SFR is the broadest possible moniker.

I'm sure we'll see an absolute plethora of SFR methods. Just like how there are many different anti-aliasing methods there will be many different SFR methods, some of which are better or worse.

tential · Sep 29, 2015

thesmokingman said:
I'd also add to that, that not all games lend themselves to high fps especially non-twitch games like RPGs. This is especially relevant in large games at large resolutions.

Then even more in these games am I happy to get a boost from 40 fps to 60 fps minimums vs going from 90 average fps to 180 fps average from going cf/sli.

ShintaiDK · Sep 29, 2015

biostud said:
One thing is the optimization for different architectures, but will it be easier to make code that split the workload between gpu's independently of hardware? (maybe not perfect scaling, but code that crudely split the workload, and give a minimum of ex. 50% scaling, and when optimized +80%)

Have we seen any multi GPU setups with different vendors where it wasn't something entirely different from SLI/CF. For example nVidia with Intel IGP as post processing?

tential · Sep 29, 2015

railven said:
I'm so glad I'm not in the mGPU camp anymore. Some devs can't even get v-sync write (really, v-sync on caps me to 30 FPS?).

Leaving it in their hands to create working profiles for x-product lines...naaaaah. Single card for me.

Good luck to you mGPU users.

I'm about to be in this camp soon as my monitor is available stateside. I see it selling in Korea for 1500 usd.

Tbh if you don't support cf/sli I won't purchase the game until a single gpu setup can run it at acceptable settings.

Really, games don't exist to me anymore until I can run them at 1800p+ resolution with acceptable costs and framerates. Otherwise I'll just wait for a gpu that can do it. Witcher 3 will probably take Arctic islands dual gpu card and if that can't handle it then 2017 will be when I play it. I got so may games to play I couldn't care less if cf/sli I supported it just means I'll be waiting! Between just Zelda games from dolphin emulator and wii u exclusives I'm occupied til mid 2016.

biostud · Sep 29, 2015

ShintaiDK said:
Have we seen any multi GPU setups with different vendors where it wasn't something entirely different from SLI/CF. For example nVidia with Intel IGP as post processing?

I was more wondering if the developers "crude" code for splitting the work between multiple similar graphics cards (normal SLI/CF) would be the same for nvidia and AMD. So if they made a SFR code path it would work with both AMD and nvidia GPUs right from the start.

Then an optimized code path could be added for different vendors or for iGPU+dGPU setups, were different GPUs handled different part of the rendering/compute pipeline. This would of course require a lot more of work.

Headfoot · Sep 29, 2015

biostud said:
I was more wondering if the developers "crude" code for splitting the work between multiple similar graphics cards (normal SLI/CF) would be the same for nvidia and AMD. So if they made a SFR code path it would work with both AMD and nvidia GPUs right from the start.

Then an optimized code path could be added for different vendors or for iGPU+dGPU setups, were different GPUs handled different part of the rendering/compute pipeline. This would of course require a lot more of work.

if its in the game engine code, it should be able to support targeting whatever dx12 compliant gpu is present. The reason AMD and nVidia cards dont work together is mostly political, not mostly technical. In a kumbaya kind of world they could share code and have their drivers work together or even have a single unified driver, but thats not how the real world works of course. Moving multi-GPU to the game developer puts them in a more neutral position where they can do these things if they think its worth the time and effort to do so.

Crude AFR-style code would certainly be possible. At the part of the program where you start a frame you could just round robin each frame between multiple GPUs without any additional intelligence, just like CF/SLI except at the engine level instead of the driver level. And I'm sure we'll see a lot of implementations that work this way.

Very exciting stuff. Most developers won't do anything fancy, just like today, but a few of the big name ones will do really cool things

dogen1 · Sep 29, 2015

railven said:
I'm so glad I'm not in the mGPU camp anymore. Some devs can't even get v-sync write (really, v-sync on caps me to 30 FPS?).

Leaving it in their hands to create working profiles for x-product lines...naaaaah. Single card for me.

Good luck to you mGPU users.

SLI/Crossfire support is handled by amd and nvidia, not the game developers.

railven · Sep 29, 2015

dogen1 said:
SLI/Crossfire support is handled by amd and nvidia, not the game developers.

If I read this thread correctly and some of the speculation - that's gonna change. Thus my post.

Adul · Sep 29, 2015

I think they made use of the multi-GPU in the final fantasy demo Microsoft and Square showed off.

bystander36 · Sep 29, 2015

thesmokingman said:
Wow, getting 120fps is really important in Civ V, while laying waste to your frametimes.

We are talking about SFR in general. SFR simply doesn't scale as well, even in FPS games.

3DVagabond · Sep 30, 2015

railven said:
I'm so glad I'm not in the mGPU camp anymore. Some devs can't even get v-sync write (really, v-sync on caps me to 30 FPS?).

Leaving it in their hands to create working profiles for x-product lines...naaaaah. Single card for me.

Good luck to you mGPU users.

According to Stardock you just have to make multi-gpu work. It's nothing brand specific anymore. We'll see, I guess. But it's not supposed to be like you say though.

stuff_me_good · Sep 30, 2015

What happened to the magical SLI bridge soldered to the motherboard that we had news many years ago? I only remember it making SFR multi-gpu rendering scaling over 90% or something like that, by rendering frames like tiled tiled resources.

Does anyone remember?

belmonkey · Sep 30, 2015

In the case of multi-GPU being a dGPU + iGPU, there's probably going to be a limit to how much the iGPU is allowed to do right? If the iGPU just handles something like postprocessing, would something much stronger than a typical Intel HD iGPU make much of a difference? I'm really wondering if one of those cheap Kaveri APUs like the A8-7600 with 384 shaders and 8 ACEs is going to do much in such a scenario.

ShintaiDK · Sep 30, 2015

belmonkey said:
In the case of multi-GPU being a dGPU + iGPU, there's probably going to be a limit to how much the iGPU is allowed to do right? If the iGPU just handles something like postprocessing, would something much stronger than a typical Intel HD iGPU make much of a difference? I'm really wondering if one of those cheap Kaveri APUs like the A8-7600 with 384 shaders and 8 ACEs is going to do much in such a scenario.

Note the frame lag as well.

belmonkey · Sep 30, 2015

ShintaiDK said:
Note the frame lag as well.

Which part am I supposed to be looking at? The way it's labeled, it looks like the iGPU is a frame behind, or also like the iGPU has downtime after its frame while the main GPU is still working on its frame. I don't know much about this stuff.

tential · Sep 30, 2015

stuff_me_good said:
What happened to the magical SLI bridge soldered to the motherboard that we had news many years ago? I only remember it making SFR multi-gpu rendering scaling over 90% or something like that, by rendering frames like tiled tiled resources.

Does anyone remember?

Lucid hydra or whatever it's already been mentioned in the thread. As with every single mgpu dream it died.

ShintaiDK · Sep 30, 2015

belmonkey said:
Which part am I supposed to be looking at? The way it's labeled, it looks like the iGPU is a frame behind, or also like the iGPU has downtime after its frame while the main GPU is still working on its frame. I don't know much about this stuff.

There is a slight benefit in raw FPS numbers in this case (~10%). But you also get input lag that is almost a frame behind (+20ms or so). That may make it unattractive for many FPS games.

Noctifer616 · Sep 30, 2015

ShintaiDK said:
There is a slight benefit in raw FPS numbers in this case (~10%). But you also get input lag that is almost a frame behind (+20ms or so). That may make it unattractive for many FPS games.

If it's a single frame then the latency depends on the FPS. At 60 FPS it's 16.6 FPS, at higher FPS it's would be lower.

ShintaiDK · Sep 30, 2015

Noctifer616 said:
If it's a single frame then the latency depends on the FPS. At 60 FPS it's 16.6 FPS, at higher FPS it's would be lower.

Dont confuse it with screen FPS. This is rendering latency. From the start of rendering till output to the screen gives the added latency.

Essentially no different than playing on 10ms input lag screen vs a 30ms input lag screen.

biostud · Sep 30, 2015

From nvidia:
https://developer.nvidia.com/dx12-dos-and-donts

Multi GPU
Do's

Use the DX12 standard checks to find out how many GPUs are in your system
No need to use vendor specific APIs anymore
Make sure to check the CROSS_NODE_SHARING tier

Take full control over which surface syncs need to happen and which don’t
Make full use of the explicit control over resources
Create resources that need to by synchronized on each node
Use the proper CreationNodeMask
Make them visible on other nodes that need access
Copy them to the current node when needed

Minimize the number of necessary syncs

If the device supports tier 2 cross node sharing
Check to see if RTVs, DSV and UAVs work as fast as expected
Always compare performance to a tier 1 type implementation

Dont's
Don’t try to benefit from implicit MGPU scaling

Don’t rely on any surface syncs to be done automatically (implicitly behind your back)
You should take full control over what syncs happen if you need them

Headfoot · Sep 30, 2015

bystander36 said:
SFR simply doesn't scale as well, even in FPS games.

Define "scales well"

thesmokingman · Sep 30, 2015

^^Exactly. That poster comes across like a contrarian, argue just to argue. AFR or SFR, it doesn't really matter. What matters is that developers now have another way to run sli/cfx and they can choose the method that best suits their needs.

DX12 Multi-GPU is a challenge

Diamond Member

Platinum Member

Diamond Member

Diamond Member

Diamond Member

Lifer

Diamond Member

Lifer

Diamond Member

Senior member

Diamond Member

Elite Member

Diamond Member

Lifer

Senior member

Junior Member

Lifer

Junior Member

Diamond Member

Lifer

Senior member

Lifer

Lifer

Diamond Member

Platinum Member