Leo DirectX forward plus rendering lighting

piesquared · Aug 14, 2012

Good video of one of the cutting edge tech innovations inside AMD's GCN architecture.

http://www.youtube.com/watch?v=s2y7e3Zm1xc&feature=youtu.be

BFG10K · Aug 15, 2012

This needs more fanfare. It allows the performance benefit of deferred rendering while retaining full compatibility with hardware MSAA. If game engines used this, we wouldnt need post-filtering AA.

thilanliyan · Aug 15, 2012

Would everything work properly with nvidia cards?

If some of these features only work on AMD cards,I think devs would be less likely to use it.

Dark Shroud · Aug 15, 2012

thilanliyan said:
Would everything work properly with nvidia cards?

If some of these features only work on AMD cards,I think devs would be less likely to use it.

The main part uses the DX11 API for Direct Compute so that should work on Nvidia hardware. How well it work on the GTX 600 series is a different matter.

Arkadrel · Aug 15, 2012

From Revdarian @hardforums:

Well, there are two main methods to render 3d games. Forward Rendering was the original, old school one; Deferred Rendering was the new kid on the block. Each one had it's pros and cons, and the main pro of the deferred rendering was that it had a much lesser performance hit when dealing with multiple light sources and was a relatively simple method; because of that Deferred Rendering became eventually the "go to" method for most important rendering engines.

The problem is that D.R. brings a list of cons in the form of heavier performance hit when handling multiple materials, and because it usually discards the geometry data it can't really apply proper MultiSampling antialiasing. The solution for 1) chosen by most devs was "ok, we won't use multiple materials! i mean, texturing alone is good enough, right?" the solution for 2) was "we will create a special buffer, let's call it G buffer, and we will store geometry data and some other useful things there!". Problem is that the G buffer eats memory like nothing, and if you try to use different materials, each material makes the G buffer even bigger (on top of the performance hit).

At AMD this D.R. became a royal pain in the ass, since their gpu designs were made with proper MSAA in mind, and thus this workaround has made them take significantly bigger performance hits than on Nvidia's hardware, and thus they have been hard at work until they got this solution.

The solution they found is to run a proper compute shader to apply the lighting to the Forward Rendered image, instead of the usual way of "Render everything 1 time for each light source in the scene!" This way they save a great deal of passes, they save on memory by not needing the G buffer (the geometry is always present on a Forward Renderer, instead of discarded), they can use the proper MSAA included on their original gpu design, and multiple materials can be used without the big performance & memory hit of the DR, all it takes is compute time for the new shader.

So, this demo to a game dev should be interesting in the form of the performance achieved with a FR engine while dealing with multiple light sources, the lack of a noticeable performance and memory useage hit due to the multiple materials & the use of "proper" MSAA.

----------------Deferred Shadeing (http://en.wikipedia.org/wiki/Deferred_shading):
Pro's:
1) the decoupling of scene geometry from lighting. Only one geometry pass is required and each light is only computed for those pixels that it actually affects. This gives the ability to render many lights in scene without significant performance-hit.

con's:
1) inability to handle transparency within the algorithm, although this problem is a generic one in Z-buffered scenes and it tends to be handled by delaying and sorting the rendering of transparent portions of the scene. "Depth peeling" can be used to achieve order-independent transparency in deferred rendering, but at the cost of additional batches and g-buffer size. Modern hardware, supporting DirectX 10 and later, is often capable of performing batches fast enough to maintain interactive frame rates.

2) difficulty with using multiple materials. It's possible to use many different materials, but it requires more data to be stored in the G-buffer, which is already quite large and eats up a large amount of the memory bandwidth.

3) due to separating the lighting stage from the geometric stage, hardware anti-aliasing does not produce correct results any more. One of the usual techniques to overcome this limitation is using edge detection on the final image and then applying blur over the edges, however recently more advanced post-process edge-smoothing techniques have been developed, such as MLAA, FXAA, SRAA, DLAA, post MSAA.

It might be time for a comeback for Forwards rendering.
If it means much higher performance, while requireing less memory bandwidth than Deferred shadeing does.

Memory bandwidth / Memory size is always going to be a issue, and its only going to grow.
Its easier to throw "compute shader" (gpgpu) at a problem, than it is to just magically give extra memory bandwidth / more memory.

This is esp true for mobile devices, like laptops that still come with only 1x DDR3 stick of memory and make use of IGPs.
Since people are preaching about "tablets/nettops/smart phones" being the future, it seems like its time for a Forwards rendering comeback (maybe?).

BFG10K · Aug 15, 2012

thilanliyan said:
Would everything work properly with nvidia cards?

Yes, absolutely, there's nothing AMD specific about it.

It should work on any DX11 compliant part, and it should be portable to OpenGL/OpenCL as well.

Red Hawk · Aug 15, 2012

Great demo, very informative. I hope these techniques catch on, but I fear they won't until the next console generation. Deferred rendering is the king of console graphics engines right now, and rebuilding a game's rendering engine from deferred to forward+ just for a PC port would take a lot of development resources. We might get it from a PC-exclusive strategy game or an AMD Gaming Evolved title that AMD really, really pushes for.

I do wonder though if any games already have a form of this rendering tech. Total War: Shogun 2 only supports MSAA in its DirectX 11 renderer, and it's both a Gaming Evolved title and a PC exclusive strategy game. Perhaps AMD helped to implement an early form of this tech there.

piesquared · Aug 15, 2012

Red Hawk said:
Great demo, very informative. I hope these techniques catch on, but I fear they won't until the next console generation. Deferred rendering is the king of console graphics engines right now, and rebuilding a game's rendering engine from deferred to forward+ just for a PC port would take a lot of development resources. We might get it from a PC-exclusive strategy game or an AMD Gaming Evolved title that AMD really, really pushes for.

I do wonder though if any games already have a form of this rendering tech. Total War: Shogun 2 only supports MSAA in its DirectX 11 renderer, and it's both a Gaming Evolved title and a PC exclusive strategy game. Perhaps AMD helped to implement an early form of this tech there.

There are a few gaming evolved titles out that use forward+ rendering techniques like Dirt Showdown, Sniper Elite. There is a list on AMD's website.

Red Hawk · Aug 15, 2012

piesquared said:
There are a few gaming evolved titles out that use forward+ rendering techniques like Dirt Showdown, Sniper Elite. There is a list on AMD's website.

Where? I can't find it on the website.

SomeoneSimple · Aug 15, 2012

piesquared said:
There are a few gaming evolved titles out that use forward+ rendering techniques like Dirt Showdown, Sniper Elite. There is a list on AMD's website.

Sniper Elite V2 uses Unreal Engine 3, that's about as deferred as they come.

You're correct on Dirt Showdown though.

piesquared · Aug 15, 2012

SomeoneSimple said:
Sniper Elite V2 uses Unreal Engine 3, that's about as deferred as they come.

You're correct on Dirt Showdown though.

Ah yes, thanks. Here's another blog with some more detail on forward+ in Showdown.

http://blogs.amd.com/play/2012/06/20/dirt-showdown-on-gcn/

PrincessFrosty · Aug 15, 2012

Just seeing if I have this right, with the increased capacity for additional lighting, one possible real world use is to sample multiple points in the scene where a spotlight is landing and then using the colour there to create new light sources back into the scene creating the subtle hue effect.

That's actually really cool, hopefully we'll see some usable tech demos of this that work on both Nvidia and AMD cards, I'd love to see this in real time.

*edit*

Oh apparently v1.1 works on Nvidia hardware...downloading now and will test tonight.

http://developer.amd.com/samples/demos/pages/AMDRadeonHD7900SeriesGraphicsReal-TimeDemos.aspx

djsb · Aug 15, 2012

It works on any DX11 capable cards. I remember it looking good (but predictably running like poo) on the Radeon 6850 I had in my machine a few months ago. I admit I'm slightly disappointed at the educational mode in 1.1. I thought it would be something where you could play with dynamic lights rather than it just being a slideshow.

thilanliyan · Aug 15, 2012

Dark Shroud said:
The main part uses the DX11 API for Direct Compute so that should work on Nvidia hardware. How well it work on the GTX 600 series is a different matter.

That's good. Hopefully more devs start using it.

HOWEVER, if it is even a bit slower on nVidia cards, they will not be behind it I'm guessing, making it less likely to take off.

Red Hawk · Aug 15, 2012

thilanliyan said:
That's good. Hopefully more devs start using it.

HOWEVER, if it is even a bit slower on nVidia cards, they will not be behind it I'm guessing, making it less likely to take off.

Ah come on now, vendor-agnostic technology is good for everyone. *glares at PhysX :colbert:

*

Pottuvoi · Aug 15, 2012

There has been advancement on Forward+ techniques after the Leo demo was released.
One of the big advances is the tiling of light sources, new methods tile in depth as well as X&Y dimensions.
http://www.cse.chalmers.se/~olaolss/main_frame.php?contents=publication&id=clustered_shading

This allows better utilization of power in hard cases. (when tile has multiple depth ranges, this is easily visible in the Leo demonstration. (edge tiles have huge amounts of lights.)

One thing that forward+ tech makes very easy when compared to deferred renderer is lighting of transparent surfaces.
http://www.cse.chalmers.se/~olaolss/main_frame.php?contents=publication&id=tiled_clustered_forward_talk

Other Forward+ links
http://aras-p.info/blog/2012/03/27/tiled-forward-shading-links/

SomeoneSimple said:
Sniper Elite V2 uses Unreal Engine 3, that's about as deferred as they come.

UE3 is forward renderer with deferred shadows and post processing.

SirPauly · Aug 15, 2012

piesquared said:
There are a few gaming evolved titles out that use forward+ rendering techniques like Dirt Showdown,.

Indeed and very welcomed --- it's good to see strong competition!

SomeoneSimple · Aug 16, 2012

Pottuvoi said:
UE3 is forward renderer with deferred shadows and post processing.

Ah, I see. Later revisions of UE3 do support deferred shading though.

Pottuvoi · Aug 16, 2012

SomeoneSimple said:
Ah, I see. Later revisions of UE3 do support deferred shading though.

Yes, as you said it's late addition and I'm not sure if any game actually uses it yet.

Phanuel · Aug 16, 2012

Red Hawk said:
Ah come on now, vendor-agnostic technology is good for everyone. *glares at PhysX *

And this is exactly what I was thinking of, and why I hate PhysX so much and hope it remains marginalized.

Olikan · Aug 16, 2012

Phanuel said:
And this is exactly what I was thinking of, and why I hate PhysX so much and hope it remains marginalized.

Forward+ is not locked like PhysX...it's just kepler that sucks at GPGPU...

i won't be surprised if fermi goes well here...or the future maxwell

it's pretty much the same thing with tesselation last year

piesquared · Aug 16, 2012

There's more examples in Sleeping Dogs of how AMD and game developers are using GCN's compute power to produce cutting edge effects. The lack of compute power in Kepler is taking it's toll in a big way on the architecture.

http://blogs.amd.com/play/2012/08/16/sleeping-dogs-gaming-evolved-and-you/

Kepler being better at gaming is such a myth it's laughable. It may produce a couple extra percent in a select few games, but it loses more than it wins, and by a much wider margins. Not sure if it's just NV's bad drivers, but then NV's drivers are unfallable i've heard so it must be the architecture. Realistically, it's a combination of both but much more problematic on the architecture side. Kepler just doesn't have the grunt to produce these leading edge effects. Now that developers are starting to exploit all the amazing cababilities inside GCN, Kepler will fall further and further behind. There is simply no choice, Kepler doesn't have close to the compute power packed inside Tahiti and the rest of the GCN family. It's quite a chip, reviewers don't give near enough credit to what AMD has been doing in advancing gaming.

To me, Kepler seems like a chip with yesterdays features at todays performance. While GCN is a chip with today and tomorrow's features at the same or better performance level than the competition.

Grooveriding · Aug 16, 2012

I just picked up Sleeping Dogs and it is kicking my setup's ass. Trying to max it out gives me 20FPS...

Silverforce11 · Aug 17, 2012

Wow i never heard of this game til now.. SquareEnix making an adult game? Nice, i'm in.

RussianSensation · Aug 17, 2012

Silverforce11 said:
Wow i never heard of this game til now.. SquareEnix making an adult game? Nice, i'm in.

The voice over cast is stacked. Kelly Hu sexy time.

Leo DirectX forward plus rendering lighting

Golden Member

Lifer

Lifer

Golden Member

Diamond Member

Lifer

Diamond Member

Golden Member

Diamond Member

Member

Golden Member

Platinum Member

Member

Lifer

Diamond Member

Senior member

Diamond Member

Member

Senior member

Platinum Member

Platinum Member

Golden Member

Diamond Member

Lifer

Elite Member