• We’re currently investigating an issue related to the forum theme and styling that is impacting page layout and visual formatting. The problem has been identified, and we are actively working on a resolution. There is no impact to user data or functionality, this is strictly a front-end display issue. We’ll post an update once the fix has been deployed. Thanks for your patience while we get this sorted.

Discussion General GPU µArch Research Thread

Another paper I found about Out of Order execution on GPUs:

TURBULENCE: Complexity-Effective Out-of-Order Execution on GPU With Distance-Based ISA


 

Hardware Acceleration of Neural Graphics​



Proposes a neural graphics ASIC engine called the NGPC or Neural Graphics Processing Cluster. Consists of Neural Field Processors (NFP). These consist of two stages and fully fused:
#1 hashgrid encoders (Encoding Engines)
#2 a specialized MLP calculations (MLP Engine)

This engine is situated alongside the GPC's with a local 1MB scratchpad and is to be used for ALL Neural Shading MLP calculations.

Substantial area and power overhead but with +4.52% area and +2.75% power a NGCP-8 configuration could be doable.

Massive speedups were observed across NeRF, NSDF, NVR, and GIA. Based on a modified 3090 design 30FPS 4K for NeRF and 8K 120FPS for the other applications.

I know thread says GPU only, but this is a design for augmenting a GPU with an ASIC, similar to augmenting shaders by adding RT cores and other prev things so nothing new. Let me know if I need to repost it somewhere else.

I've taken a brief at the rest of the litterature at with things such as Instant-NGP and it looks like everyone is trying to avoid MLPs like the plague and find other approaches that are faster and use less ressources. MLPs are reliable and established but not particularly efficient to say the least.

So paper is interesting but as for actual HW implementation I would not bet on it. If possible SW is always king.
 
Back
Top