witeken
Diamond Member
- Dec 25, 2013
- 3,899
- 193
- 106
We are talking about the design of the iGPU not the density of the process.
Performance per mm² is directly related to process used.
We are talking about the design of the iGPU not the density of the process.
Iris Pro(HD5200) iGPU is close to 100-110mm2, add 85mm2 for the eDRAM and we are close to 200mm2 vs 110-120mm2 for Kaveri.
Intel Iris (HD5000) without eDRAM is considerable slower than Kaveri at the same die size with a full node advantage. Kaveris iGPU at 22nm would be smaller than Intels HD5000 and perform faster.
You could as well have added the other caches to your die size calculation...
Performance per mm² is directly related to process used.
You could as well have added the other caches to your die size calculation...
You don't decide what's being talked about. Someone made a comment about Intel's IGP's performance per mm². I replied.Again, we are not talking about process density.
You cherry picked Intel's L4 cache, which has only indirectly to do with the performance. You could as well have added L1, L2 and L3, or while you're at it, the whole die.What other caches ??
HD5000 is the same iGPU as Iris Pro HD5200 but without the eDRAM. HD5000 is slower than both Iris Pro and Kaveri.
You don't decide what's being talked about. Someone made a comment about Intel's IGP's performance per mm². I replied.
The most dreadful thing about Intel IGP is performance/mm^2 right now. Looking at the competitors it seems they are 5-10 years behind in this and they even have a decent process to hide the problem!
Imagine the same architecture on 28 or 32nm process...
Performance per mm² isn't stellar because Intel didn't focus on density until 14nm.
You cherry picked Intel's L4 cache, which has only indirectly to do with the performance. You could as well have added L1, L2 and L3, or while you're at it, the whole die.
You could as well have added the other caches to your die size calculation...
It shows how far behind intel is.
This is how it always goes. It's like the comparison GCN vs Maxwell; Nvidia's GPU architecture is 1 iteration ahead. There's always some bias: when it launched, Iris Pro was pretty decent, and just after Broadwell-K's launch, it will also look good... until AMD comes with a new generation to leapfrog Intel etc.
If you insist on keeping eDRAM iris pro results, compare it against kaveri without memory bottleneck, which is the same as HD7750.
Compare it to a product that doesn't exist except in your mind...
Last time I checked HD7750 was a real product. (It has 20% shaders disabled)
Kaveri without memory bandwidth limits doesn't exist except in your mind.
It a sign of desperation when you have to make up hypothetical products to make your case against a real shipping product.
You're the one who brought up Kaveri, not me.
No. Some here were comparing performance of intel igp to kaveri with conclusion that intel graphics architecture is as good (or even better) than gcn since kaveri have more space dedicated to gpu than iris pro.
No. Some here were comparing performance of intel igp to kaveri with conclusion that intel graphics architecture is as good (or even better) than gcn since kaveri have more space dedicated to gpu than iris pro.
Its not unusual for people here to show amd in worst possible scenario. Here they put gcn in the most bottlenecked flavour available and dance their victory dance.
If they want to compare intel graphics to amd graphics, at least be square.
Comparing ~85mm2 iris pro to 90mm2 gcn part to draw perf/mm2 seems reasonable.
Given that one core is 3.1 mm2, extrapolating out gives the size of the die at 31.4x the size of a single core, or 97.3 mm2. The GPU area is approximately 5.2x the size of a core, giving ~16.1 mm2 for 128 GCN cores, compared to 12.4 mm2 for CPU cores. The Video Codec Engine and Unified Video Decoder are not part of these totals, located on other parts of the APU. The memory controller clocks in at ~9.4 mm2 and the display/IO portion runs at ~7.3 mm2.
To be fair, I thought an on die/APU comparison would be best. That is what is being sold after all and implementation frequently is more important than technical.
To really make a comparison the performance area of the gcn must be compared to the performance area of Gen 7.5 graphics. With process scaling and driver handicapps taken into account.
In an implementation form AMD or Intel can do stupid things. Like putting 512 shaders when you only have the bandwidth to feed 384.
Ie) http://www.anandtech.com/show/8067/amd-am1-kabini-part-2-athlon-53505150-and-sempron-38502650-tested
16.1 mm^2 for 128 sp -> 64.4 mm^2 for 512 sp.
Something similar must be done for intel.
