With Kopite7kimi pretty much confirm the 512-bit memory bus interface of upcoming Blackwell GPU, here comes the thread for discussion of all future Blackwell GPU family, aka RTX 5000 series. We also know about codename of all 5 die sizes; namely GB202, GB203, GB205, GB206 and GB207. Below is the table with some speculations of my own, you guys are welcome to pitch in:-
I also put in upcoming RDNA5 just for comparison sake...
Even though we know nVidia will implement 512-bit memory bus, we don't know what types of memory they will choose, I try to list down all possibility of memory choices below:
There was rumored about usage of 384-bit GDDR7 memory before, I believe nVidia is testing both 384-bit GDDR7 and 512-bit GDDR6x and then decide to use GDDR6x options. Both memories have same bandwidth even though inteface is different.
Hmm, it seems to me usage of GDDR6x make more sense compared to GDDR7, what do you think???
As for amount of CUDA cores, nVidia is changing the architecture of Blackwell GPU, so far no leaks about structure of new Shader Model...However, we should be expecting at least 50% performance improvement due to extra 50% memory bandwidth improvement...
Update 1:
Codename | Possible Model Number | Possible Memory Configuration | Possible Mobile GPU | Possible AMD's response |
---|---|---|---|---|
GB202 | 5090 Ti | 512-bit 32GB | ||
5090 | 448-bit 28GB | |||
GB203 | 5080 Ti | 384-bit 24GB | N51 384-bit 24GB GDDR7 | |
5080 | 320-bit 20GB | N51 320-bit 20GB GDDR7 | ||
GB205 | 5070 Ti | 256-bit 16GB | 256-bit 16GB GDDR6 | N52 256-bit 16GB GDDR7 |
GB206 | 5070 | 192-bit 12GB | 192-bit 12GB GDDR6 | N53 128-bit 12GB GDDR7 |
GB207 | 5060 Ti | 128-bit 8/16 GB | 128-bit 8GB GDDR6 | |
5060 | 128-bit 8GB |
Even though we know nVidia will implement 512-bit memory bus, we don't know what types of memory they will choose, I try to list down all possibility of memory choices below:
384-bit GDDR6(X) | 384-bit GDDR7 | 512-bit GDDR7 | 512-bit GDDR6X | |
---|---|---|---|---|
RTX 4090 | 24GB 21Gbps 1TB/s | 24GB 32Gbps 1.5TB/s | 32GB 32Gbps 2TB/s | 32GB 24Gbps 1.5TB/s |
+ 50% | + 100% | + 50% | ||
Pros |
|
|
| |
Cons |
|
|
| |
AMD Radeon RX 7900 XTX | 24GB 20Gbps 960GB/s | 24GB 32Gbps 1.5TB/s | ||
+ 60% |
There was rumored about usage of 384-bit GDDR7 memory before, I believe nVidia is testing both 384-bit GDDR7 and 512-bit GDDR6x and then decide to use GDDR6x options. Both memories have same bandwidth even though inteface is different.
Hmm, it seems to me usage of GDDR6x make more sense compared to GDDR7, what do you think???
As for amount of CUDA cores, nVidia is changing the architecture of Blackwell GPU, so far no leaks about structure of new Shader Model...However, we should be expecting at least 50% performance improvement due to extra 50% memory bandwidth improvement...
Update 1:
Model | Codename | Die Size (mm2) | SM/CU | CUDA/ | L2 Cache | Memory | Memory BW | BW +/- | TPU Perf |
---|---|---|---|---|---|---|---|---|---|
RTX 3090Ti | GA102 | 628 | 84 | 10752 | 6 MB | 384-bit 24GB GDDR6X | 1 TB/s | ||
RTX 4090 | AD102 | 609 | 128 | 16384 | 72 MB | 384-bit 24GB GDDR6X | 1 TB/s | + 0% | + 45% |
RTX 5090(Ti) | GB202 | ? | ? | ? | 128 MB | 512-bit 32GB GDDR6X | 1.5 TB/s | + 50% | ? |
? | 384-bit 24/36 GB GDDR7 | 1.5 TB/s | + 50% | ? | |||||
RTX 3080Ti | GA102 | 628 | 80 | 10240 | 6 MB | 384-bit 12GB GDDR6X | 912.4 GB/s | ||
RTX 4080Ti | AD102 | 609 | ? | ? | ? | 320-bit 20GB GDDR6X | ? | ||
RTX 5080Ti | GB203 | ? | ? | ? | 96 MB | 384-bit 24GB GDDR6X | 1152 GB/s (75% of N51) | + 61% | ? |
RTX 4080 | AD103 | 379 | 76 | 9728 | 64 MB | 256-bit 16GB GDDR6X | 716 GB/s | ||
RTX 5080 | GB203 | ? | ? | ? | 80 MB | 320-bit 20GB GDDR6X | 960 GB/s (75% of N51) | + 34% | ? |
RTX 3080 | GA102 | 628 | 68 | 8704 | 5 MB | 320-bit 10GB GDDR6X | 760.3 GB/s | ||
RTX 4070Ti | AD104 | 294 | 60 | 7680 | 48 MB | 192-bit 12GB GDDR6X | 504.2 GB/s | - 34% | + 17% |
RTX 5070Ti | GB205 | ? | ? | ? | 64 MB | 256-bit 16GB GDDR6X | 768 GB/s (75% of N52) | + 52% | ? |
RTX 3070Ti | GA104 | 392 | 48 | 6144 | 4 MB | 256-bit 8GB GDDR6X | 608.3 GB/s | ||
RTX 4070 | AD104 | 294 | 46 | 5888 | 36 MB | 192-bit 12GB GDDR6X | 504.2 GB/s | - 17% | + 14% |
RTX 5070 | GB206 | ? | ? | ? | 48 MB | 192-bit 12GB GDDR6X | 576 GB/s (12.5% > N53) | + 14% | ? |
RTX 3060Ti | GA104 | 392 | 38 | 4864 | 4 MB | 256-bit 8GB GDDR6 | 448 GB/s | ||
RTX 4060Ti | AD106 | 188 | 34 | 4352 | 32 MB | 128-bit 8/16 GB GDDR6 | 288 GB/s | - 36% | + 11% |
RTX 5060Ti | GB207 | ? | ? | ? | 32 MB | 128-bit 8/16 GB GDDR6X | 384 GB/s | + 33% | ? |
RTX 3060 | GA106 | 276 | 28 | 3584 | 3 MB | 192-bit 12GB GDDR6 | 360 GB/s | ||
RTX 4060 | AD107 | 159 | 24 | 3072 | 24 MB | 128-bit 8GB GDDR6 | 272 GB/s | - 25% | + 18% |
RTX 5060 | GB207 | ? | ? | ? | 24 MB? | 128-bit 8GB GDDR6 | 320 GB/s | + 18% | ? |
RX 6950 XT | Navi 21 | 520 | 80 | 5120 | 128 MB | 256-bit 16GB GDDR6 | 576 GB/s | ||
RX 7900 XTX | Navi 31 | 529 | 96 | 6144 | 96 MB | 384-bit 24GB GDDR6 | 960 GB/s | + 67% | + 36% |
RX 8950 ? | Navi 51 | ? | ? | ? | ? | 384-bit 24GB GDDR7 | 1.5 TB/s | + 60% | ? |
RX 6750 XT | Navi 22 | 335 | 40 | 2560 | 96 MB | 192-bit 12GB GDDR6 | 432 GB/s | ||
RX 7800 XT | Navi 32 | 346 | 60 | 3840 | 64 MB | 256-bit 16GB GDDR6 | 620.8 GB/s | + 44% | + 40% |
RX 8800 ? | Navi 52 | ? | ? | ? | ? | 256-bit 16GB GDDR7 | 1 TB/s | + 65% | ? |
RX 7700 XT | Navi 32 | 346 | 54 | 3456 | 48 MB | 192-bit 12GB GDDR6 | 432 GB/s | ||
RX 8700 ? | Navi 53 | ? | ? | ? | ? | 128-bit 12GB GDDR7 | 512 GB/s | + 18.5% | ? |
Last edited: