Discussion RDNA 5 / UDNA (CDNA Next) speculation

basix · 2025-06-28T05:03:52-0400

I cannot tell you exactly, how intense matrix math is in game engines. But it is for certain, that matrices get used everywhere in games (you can google that if you want). And today this means, that you make N-times vector * vector math instead of 1-time matrix * vector. For some part this split into multiple vectors is useful, because it allows for easy parallelization on wide SIMD units of a GPU. But you can do that with matrices as well, because you have millions of pixels anyways.

I would suspect, that the cooperative vector API greatly reduce the transition overhead between matrix cores and other parts of a CU. And in the end, everything uses the same registers and caches of a CU or SM. So the main thing you need to care about is data alignment (vectors vs. matrices) that you do not need to shuffle around your data when switching between vectors and matrices. How big the actual benefits will be, have to be seen. I hope we see some talks and presentations about cooperative vectors from AMD, Nvidia and game developers.

For me, there is another reason for matrices:
Optimize performance in general. Game developers used vector math for ages. Now they could reshape their algorithms to do direct matrix math. Will it be faster? It depends on the use case. But I could very well imagine, that it would allow developers to push further. As you said, with matrix operations you optimize for bandwidth and power. Both is scarce on GPUs if you want to get optimal performance. Will that take some effort and time? Sure.

I do not see it happening too soon, earliest with the next console cycle because those will support WMMA acceleration. If you want to squeeze the maximum out of a console, optimize your code and increase the utilization of the available hardware units. Most image filters kernels (e.g. Lanczos) use matrices (e.g. postprocessing in games), vector*matrix for orientation & transformation of things, dot-products for geometric stuff and so on. If the data is aligned right, you can put that into vectors or matrices, the result is the same.

Bigos · 2025-06-28T05:28:03-0400

basix said:
I cannot tell you exactly, how intense matrix math is in game engines. But it is for certain, that matrices get used everywhere in games (you can google that if you want). And today this means, that you make N-times vector * vector math instead of 1-time matrix * vector. For some part this split into multiple vectors is useful, because it allows for easy parallelization on wide SIMD units of a GPU. But you can do that with matrices as well, because you have millions of pixels anyways.

The matrices used in games are 4x4 at most and these are not the target of the tensor units. And the matrix x vector computations are not parallelized since terascale era.

What are you talking about?

511 · 2025-06-28T06:28:30-0400

basix said:
I cannot tell you exactly, how intense matrix math is in game engines. But it is for certain, that matrices get used everywhere in games (you can google that if you want). And today this means, that you make N-times vector * vector math instead of 1-time matrix * vector. For some part this split into multiple vectors is useful, because it allows for easy parallelization on wide SIMD units of a GPU. But you can do that with matrices as well, because you have millions of pixels anyways.

I would suspect, that the cooperative vector API greatly reduce the transition overhead between matrix cores and other parts of a CU. And in the end, everything uses the same registers and caches of a CU or SM. So the main thing you need to care about is data alignment (vectors vs. matrices) that you do not need to shuffle around your data when switching between vectors and matrices. How big the actual benefits will be, have to be seen. I hope we see some talks and presentations about cooperative vectors from AMD, Nvidia and game developers.

For me, there is another reason for matrices:
Optimize performance in general. Game developers used vector math for ages. Now they could reshape their algorithms to do direct matrix math. Will it be faster? It depends on the use case. But I could very well imagine, that it would allow developers to push further. As you said, with matrix operations you optimize for bandwidth and power. Both is scarce on GPUs if you want to get optimal performance. Will that take some effort and time? Sure.

I do not see it happening too soon, earliest with the next console cycle because those will support WMMA acceleration. If you want to squeeze the maximum out of a console, optimize your code and increase the utilization of the available hardware units. Most image filters kernels (e.g. Lanczos) use matrices (e.g. postprocessing in games), vector*matrix for orientation & transformation of things, dot-products for geometric stuff and so on. If the data is aligned right, you can put that into vectors or matrices, the result is the same.

Second this evern a basic triangle which are used to render stuff even something as basic as where the point is inside the triangle or not require matrix operation like determinant of a matricd.

menhera · 2025-06-28T10:04:12-0400

I've noticed ray intersection (and traversal) performance in my 9070 XT is directly proportional to L0 cache hitrates.

AMD really should merge two separate L0 caches per WGP in UDNA.

Tuna-Fish · 2025-06-28T11:09:01-0400

511 said:
Second this evern a basic triangle which are used to render stuff even something as basic as where the point is inside the triangle or not require matrix operation like determinant of a matricd.

What Bigos said, yes everything is matrix math but also the matrices are all 4x4. The tensor units are optimized for much, much larger matrices.

Kepler_L2 · 2025-06-28T13:00:53-0400

menhera said:
View attachment 126346

I've noticed ray intersection (and traversal) performance in my 9070 XT is directly proportional to L0 cache hitrates.

View attachment 126348AMD really should merge two separate L0 caches per WGP in UDNA.

They did (if gfx12.5 is any indication).

Search

Discussion RDNA 5 / UDNA (CDNA Next) speculation

basix

Member

Bigos

Member

511

Platinum Member

menhera

Junior Member

Tuna-Fish

Golden Member

Kepler_L2

Senior member

TRENDING THREADS