What? They took out SMT with LNC and now PNC, the point is that LBT saw DMR without SMT and knew it immensely hurt their competitiveness with Venice. Nothing in the letter even implies that Granite Rapids is the generation bringing back SMT. Why would the CEO announce bringing back a feature to DCAI products that hasn’t even left the segment yet, and launched before his tenure?
To support [regaining share as we ramp Granite Rapids], we are reintroducing simultaneous multi-threading (SMT).
The last time that I read something extensive about it was here at AT. I'd have to go digging in the internet archive to find it again, but they had a color coded comparison of the processor functional units showing what was unique, what was shared, what was partitioned, and what was wholely duplicated.Could you link your source for the breakdown what is statically partitioned, what is watermarked and what is shared for HT? In AMD case that information is in software optimization guide among other sources. Is that true for Intel too?
Sounds really good:It's not that deep. He's saying it because it sounds good.
That's exactly what a CEO wants to do, invent a non-existent reason to talk about his company losing competitive advantage.Moving away from SMT put us at a competitive disadvantage.
You know that it is the reason they are not confident with DMR on winning all the benchmarks a single feature lmao.That's exactly what a CEO wants to do, invent a non-existent reason to talk about his company losing competitive advantage.
Even with the first implementation in Zen 1 most of it was already competitively shared though.The last time that I read something extensive about it was here at AT. I'd have to go digging in the internet archive to find it again, but they had a color coded comparison of the processor functional units showing what was unique, what was shared, what was partitioned, and what was wholely duplicated.
Server is where the highest margins are. Server is where the highest growth is. Removing features like SMT and AVX512 was an unmitigated disaster for Intel from a financial standpoint.That particular part of that quote doesn't make sense. Intel only dropped hyperthreading on Arrow Lake and Lunar Lake. The lack of SMT is pretty low on the list of issues with those CPUs. Server is where SMT matters the most, and they haven't dropped it for any server products. That reads to me as Lip Bu Tan trying to distance himself from his predecessor.
.. and that is relevant how? Intel proclaiming that they shouldn't have removed SMT seems like pretty conclusive evidence that the removal was a mistake .... don't you think?Not out yet.
That may be less of a problem in server than it is in desktop. In server, the processor is often power constrained by the socket power limits, or thermally limited by the density of the cores in the package.Really I wouldn't expect Intel to be competitive in server again until they commit to TSMC.
My thoughts are:Process node is the least of their problems in server.
In my management positions over the last 20 years, I have always argued against layoffs of engineers:Every layoffs bleeds talent
DMR has AVX-512 as for High latency we will see with DMR it's unknown1) Lack of SMT
2) High latency between chips
3) Lack of AVX512
Intel has always had AVX-512 on servers since Skylake Xeons on all P-core server stuff. DMR will have AVX512/AVX 10.2 too.which is why I am so distraught that Intel botched the architecture by removing SMT and AVX512.
My thoughts are:
1) Lack of SMT
2) High latency between chips
3) Lack of AVX512
Intel has always had AVX-512 on servers since Skylake Xeons on all P-core server stuff. DMR will have AVX512/AVX 10.2 too.
Only E-core server stuff of SRF and CWF don't have AVX512.
If E-core based RRF happens in 2027 it will have AVX512/AVX10.2.
It has 288Cores the problem I see is the launch schedule it's too damm close to Venice dense if it had come out this year it would have been fine.The problem with CWF is the lack of SMT. It could be a tough sell.
The problem with CWF is the lack of SMT. It could be a tough sell.
Those are the diagrams that I was thinking of, but it's not the information I was looking for. I think I am confusing two separate articles. I remember HotHardware had a good one and someone else had one that addressed a characteristic of AMD's implementation that was giving them more efficient resources allocation.Even with the first implementation in Zen 1 most of it was already competitively shared though.
![]()
Zen - Microarchitectures - AMD - WikiChip
Zen (family 17h) is the microarchitecture developed by AMD as a successor to both Excavator and Puma. Zen is an entirely new design, built from the ground up for optimal balance of performance and power capable of covering the entire computing spectrum from fanless notebooks to high-performance...en.wikichip.org
![]()
AMD has an update for Zen 5 at https://www.amd.com/en/blogs/2025/simultaneous-multithreading-driving-performance-a.html
- Red - Competitively shared structures
- Turquoise - Competitively shared and SMT tagged
- Blue - Competitively shared with Algorithmic Priority
- Green - Statically Partitioned
View attachment 129504
View attachment 129505
The core count(288) is there though especially on 2S (576 cores). It should be a decent part for Cloud though it's likely not as versatile as Turin D in lots of other workloads. Has other issues as well like the launch delay, and perhaps slow L3 cache bottle-necking performance. Have to wait for reviews to see how well it performs and scales.The problem with CWF is the lack of SMT. It could be a tough sell.
Those are the diagrams that I was thinking of, but it's not the information I was looking for. I think I am confusing two separate articles. I remember HotHardware had a good one and someone else had one that addressed a characteristic of AMD's implementation that was giving them more efficient resources allocation.
It will be interesting to see how DMR competes with Venice (non D). Anyone have any idea how many cores Venice (non D) will have?DMR has AVX-512 as for High latency we will see with DMR it's unknown
Only E-core server stuff of SRF and CWF don't have AVX512.
for cloud computing, more threads will matter. Considering the target market of CWF, the lack of SMT makes it likely that Venice D will best it by quite a bit.The problem with CWF is the lack of SMT. It could be a tough sell.
Agree. Against Turin D, CWF looks very good. Against Venice D .... not so much.It has 288Cores the problem I see is the launch schedule it's too damm close to Venice dense if it had come out this year it would have been fine.
It still has to compete with a 2S Venice D which will be sporting 512c/1024t. Last round, Intel's 2S offering wasn't impressive. Hopefully they have worked out whatever issue they had going on with the last Xeon's.The core count(288) is there though especially on 2S (576 cores). It should be a decent part for Cloud though it's likely not as versatile as Turin D in lots of other workloads. Has other issues as well like the launch delay, and perhaps slow L3 cache bottle-necking performance. Have to wait for reviews to see how well it performs and scales.
At least till now E-core team has been focused on optimizing area and IPC. SMT hasn't been a focus of E-core guys throughout.
Nobody was saying that past autumn 2020 when vaccines became available en masse, and Pat got his job in Feb 2021.
He bet a lot on High NA which clearly isn't a magic wand he wanted it to be.
Will it? If SMT matters so much for cloud, then why exactly do none of the custom ARM processors deployed by AWS, Azure, Google and Alibaba have SMT? It's not like ARM is incapable of offering a core design with SMT.for cloud computing, more threads will matter. Considering the target market of CWF, the lack of SMT makes it likely that Venice D will best it by quite a bit.
I hadn't seen that particular chart before, but that addresses what I was talking about.View attachment 129522
did you mean this one by chance? source is C&C https://chipsandcheese.com/p/intels-redwood-cove-baby-steps-are-still-steps
ARM itself currently only offers three cores with SMT: Neoverse-e for network edge and a pair of A65e variants for specific markets that use it more for compute verification purposes. A big issue for SMT is that it is often performance indeterminate in that a thread can run on the same core with the same data and perform differently depending on what the other thread is up to. That behavior is unpopular in ARM world. It also increases cache pressure and can make core power draw less predictable.Will it? If SMT matters so much for cloud, then why exactly do none of the custom ARM processors deployed by AWS, Azure, Google and Alibaba have SMT? It's not like ARM is incapable of offering a core design with SMT.
I'm not saying that cloud providers don't want to have some instance types with SMT - for certain workloads it is advantageous and their customers will want it. But for plenty of cloud instance types SMT is going to provide zero benefit. Some instance types even hard disable it.
He's much more likely to make this type of decision based on (updated) reports from engineering teams. You need to keep in mind the information LBT received may not necessarily be the same as Pat. (due to changes in company environment and structure, updates in simulations etc.)
SMT provides the best performance for the transistor budget of anything I can think of.Will it? If SMT matters so much for cloud, then why exactly do none of the custom ARM processors deployed by AWS, Azure, Google and Alibaba have SMT? It's not like ARM is incapable of offering a core design with SMT.
I'm not saying that cloud providers don't want to have some instance types with SMT - for certain workloads it is advantageous and their customers will want it. But for plenty of cloud instance types SMT is going to provide zero benefit. Some instance types even hard disable it.
Depends on the definition of spitting distance. Half the total performance or better for half the price is what Ampere's latest looks like - https://www.servethehome.com/ampere...permicro-nvidia-broadcom-kioxia-server-cpu/2/Currently, there aren't any ARM servers that can come within spitting distance of x86 that I have seen (someone feel free to show me a benchmark that proves me wrong here).
NV already announcing upcoming Vera CPU comes with SMT, guess it is the feature eventually comes to all CPU.Depends on the definition of spitting distance. Half the total performance or better for half the price is what Ampere's latest looks like - https://www.servethehome.com/ampere...permicro-nvidia-broadcom-kioxia-server-cpu/2/
ARM could implement SMT on all their Neoverse offerings if their customers were asking for it. They aren't asking for it. It is fair to argue that they're not asking for it because AMD and Intel provide an acceptably priced option for servicing that market. But if SMT really was the best thing ever they wouldn't be ignoring it.