Intel's Next Gen - Conroe - what do you make of the speculation?

Page 2 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Intelia

Banned
May 12, 2005
832
0
0
Originally posted by: Hard Ball
Originally posted by: Intelia
Originally posted by: Diasper
Fixed link to anandtech's article.

More thoughts?

Anandtech's seems more probably explanation - coming out with something truely radical like what the inquirer suggests I can't imagine they'd do this generation - that's too quick.

We got two differant things from our reading . Anand really touches on all the possibilities well thought out and makes since . But he touches on the compilier issue. The Inquery article was actually fucosed on the Compiler issue. And it is a fact that Intel did infact buy that Russian company. So I am really leaning towards the Inquire article as being pretty close to what well see . My opion only.

Well Intelia, I think you really mean:
Originally posted by: Intelia
My opium only.

Since when has the inquirer been a reliable source of information; not only that, the author fully upfront admits that all of his predictions are merely speculation, mostly fueled by this line of reasoning:
The change is so big in fact, it?s the reason for Apple?s processor switch. Indeed the phrase given when Steve Jobs announced the switch, "performance per watt" is the very same phrase being used by Intel spokesmen.

Even more dubious, some of the othe justifications for his theory, were things like transfer of some of the Alpha RISC personel to Intel years ago. Please keep things in perspective before going off the deep end.
Start a thread talking about future processors from amd and I will be happy to come in and flame ya . TROLL

 

Hard Ball

Senior member
Jul 3, 2005
594
0
0
Originally posted by: Intelia
Originally posted by: Hard Ball
Originally posted by: Intelia
Originally posted by: Diasper
Fixed link to anandtech's article.

More thoughts?

Anandtech's seems more probably explanation - coming out with something truely radical like what the inquirer suggests I can't imagine they'd do this generation - that's too quick.

We got two differant things from our reading . Anand really touches on all the possibilities well thought out and makes since . But he touches on the compilier issue. The Inquery article was actually fucosed on the Compiler issue. And it is a fact that Intel did infact buy that Russian company. So I am really leaning towards the Inquire article as being pretty close to what well see . My opion only.

Well Intelia, I think you really mean:
Originally posted by: Intelia
My opium only.

Since when has the inquirer been a reliable source of information; not only that, the author fully upfront admits that all of his predictions are merely speculation, mostly fueled by this line of reasoning:
The change is so big in fact, it?s the reason for Apple?s processor switch. Indeed the phrase given when Steve Jobs announced the switch, "performance per watt" is the very same phrase being used by Intel spokesmen.

Even more dubious, some of the othe justifications for his theory, were things like transfer of some of the Alpha RISC personel to Intel years ago. Please keep things in perspective before going off the deep end.
Start a thread talking about future processors from amd and I will be happy to come in and flame ya . TROLL

If you are taking this kind of article from the inquirer to be fact, than it is you who's the biggest troll around here. You have no evidence, yet you insist that the article is with full merits, and that the EE merom will have HT. Where in the world are you getting your "information" from??

 

Elcs

Diamond Member
Apr 27, 2002
6,278
6
81
Originally posted by: Kensai
Meh, too long to read..

I read about 1/2 of the article and its rather interesting from a reading perspective.

Sounds like too much technobabble to me.
 

clarkey01

Diamond Member
Feb 4, 2004
3,419
1
0
I dont pay attention to the hype. itanic was suppose to take over the world and kill X86, Nehalem (netburst) was to hit 10.25 Ghz in 2005 and prescott was suppose to be the final nail in AMD's coffin after the poor showing of the 3200 AXP.

Itanic didnt take over the world. It didnt kill x86. Nehalem and the long terms of plans of netburst were dicthed. 10 Ghz or even 4Ghz was not seen in 2005. Prescott was delayed and became preshott and handed the performance crown over to AMD for being a backwards step interms of performance over northwood.


My point...Wait for the damn for thing.
 

Intelia

Banned
May 12, 2005
832
0
0
Originally posted by: Intelia
Originally posted by: Sentential
Originally posted by: Intelia
Originally posted by: Sentential
Personally Im very dissapointed because from what I have seen thus far neither it or Presler will have Hyperthreading. So to me there is no real point in buying it since I wont gain much of anything over a single core with HT.

It will have H/T and 4+4 of shared cache in the ee model $$$$$

Still tho, an EE @ 1k isnt going to be selling much nor will I be buying one. So its a moot point.

If the EE has 4+4 of shared cache it would be worth the primiem price 8mb of gaming cache Oh YA

Intel has a lot of experience of VLIW processors from its Itanium project which has now been going on for more than a decade. Intel also now has HP?s expertise on board as HP?s entire Itanium design team was recently transferred to Intel.

Another technology Intel has access to is DEC?s FX!32. This was written in the mid 1990s and allowed X86 software to run on Alpha RISC microprocessors. A lot of the Alpha people and technology was transferred to Intel and FX!32 most likely went with it, indeed it has already been developing similar technology to run X86 binaries on Itanium for quite some time now.

It gets better. Both the Itanium and the Transmeta designs were said to be inspired by VLIW designs built in Russia by a company called Elbrus. Intel did a deal with Elbrus in mid 2004 then went on to buy the company in August 2004. The exact nature of the deal is unclear, however, as another company continued and taped out the E2K processor earlier this year.

Most interestingly though is the E2K compiler technology which allows it to run X86 software. This is exactly the sort of technology Intel need and since last year they have had access to it and employ many of it?s designers.

So, Intel has access to VLIW technology from the Itanium and HP as well as the translation software from DEC. Most importantly it has the highly advanced technology from Elbrus which has been in development since the 1980s.

One really needs to look at the picture as a whole not just bits and piecies.

If intel gets the Itanic to run X86 without a speed penalty . It will be a great great CPU.
This is from the Anand article

Moving on to the final architecture, we come to IA-64/EPIC. While similar in some ways to VLIW (Very Long Instruction Word) architectures of the past, Intel worked to overcome some of the problems with VLIW (specifically the need to recompile code for every processor update) and called their new approach EPIC: "Explicitly Parallel Instruction Computer". In contrast to the P6, NetBurst, K7, and K8 architectures that can issue up to three instructions per cycle, the current Itanium 2 chips can issue eight instructions per clock. From a purely theoretical standpoint, the fastest Itanium 2 running at 1.6 GHz actually has more computational power than any other Intel chip. Throw in dual core designs with HyperThreading - HyperThreading that actually works much better than NetBurst HTT due to the wide design of EPIC - and each chip not only has the potential to issue eight instructions per clock, but it should actually come relatively close to that number. Another difference between Itanium and the other designs is that large amounts of cache are present in order to keep the pipelines fed with data. Current models ship with up to 9MB of L3 cache, while future parts like the Montecito will have 24MB of L3 cache (and a transistor count of 1.7 billion transistors - about eight times the transistor count of the Pentium D Smithfield core)!

Of course, with the wide issue rate of Itanium 2 (the original Itanium "only" had a 6-wide issue rate), you need a lot of execution units. NetBurst has 7 execution units in Prescott: two simple integer units (which can function as 4 integer units if you count the double pumped design), a complex integer unit, two FP/SIMD units, and dedicated memory load and store units. If you want to count the simple integer units as 2 each, you could make a stretch and say NetBurst has nine execution units. AMD's K7 and K8 both have nine execution units as well, only they go for a less customized approach and instead have three each of the integer, FP/SIMD, and memory units. Each of AMD's units is fully functional, unlike the "simple" and "complex" integer units in NetBurst. In contrast to these architectures, the current Itanium 2 chips have six ALUs (Arithmetic Logic Units), three BRUs (Branch Units), two FPUs, one SIMD, two load units, and two store units - call it 16 functional units if you prefer, though the specialization of some of them makes it slightly less than that. While Itanium 2 is very wide, the length of the pipeline is only 8 stages - less than any other modern x86 processor by a significant amount. That certainly plays a role in the reduced clock speeds, but like Athlon 64, lower clock speeds with a more efficient architecture can outperform long pipelines in many instances. In order to extract all of the potential performance from Itanium, however, a lot of work needs to be done during code compilation. This is the Achilles' heel of VLIW designs; Processor updates require the code to be recompiled. While EPIC doesn't require that you recompile the code, newer compiler optimizations can improve performance significantly.


 

Intelia

Banned
May 12, 2005
832
0
0
Originally posted by: Hard Ball
Originally posted by: Intelia
Originally posted by: Hard Ball
Originally posted by: Intelia
Originally posted by: Diasper
Fixed link to anandtech's article.

More thoughts?

Anandtech's seems more probably explanation - coming out with something truely radical like what the inquirer suggests I can't imagine they'd do this generation - that's too quick.

We got two differant things from our reading . Anand really touches on all the possibilities well thought out and makes since . But he touches on the compilier issue. The Inquery article was actually fucosed on the Compiler issue. And it is a fact that Intel did infact buy that Russian company. So I am really leaning towards the Inquire article as being pretty close to what well see . My opion only.

Well Intelia, I think you really mean:
Originally posted by: Intelia
My opium only.

Since when has the inquirer been a reliable source of information; not only that, the author fully upfront admits that all of his predictions are merely speculation, mostly fueled by this line of reasoning:
The change is so big in fact, it?s the reason for Apple?s processor switch. Indeed the phrase given when Steve Jobs announced the switch, "performance per watt" is the very same phrase being used by Intel spokesmen.

Even more dubious, some of the othe justifications for his theory, were things like transfer of some of the Alpha RISC personel to Intel years ago. Please keep things in perspective before going off the deep end.
Start a thread talking about future processors from amd and I will be happy to come in and flame ya . TROLL

If you are taking this kind of article from the inquirer to be fact, than it is you who's the biggest troll around here. You have no evidence, yet you insist that the article is with full merits, and that the EE merom will have HT. Where in the world are you getting your "information" from??
This is a discussion about future processor development and what we get out of it . speculations is all we have . What part of that don't you understand.

 

Intelia

Banned
May 12, 2005
832
0
0
Start a thread talking about future processors from amd and I will be happy to come in and flame ya . TROLL (this was meant for the meth, guy hard ball)
 

Hard Ball

Senior member
Jul 3, 2005
594
0
0
Originally posted by: Intelia
Originally posted by: Hard Ball
Originally posted by: Intelia
Originally posted by: Hard Ball
Originally posted by: Intelia
Originally posted by: Diasper
Fixed link to anandtech's article.

More thoughts?

Anandtech's seems more probably explanation - coming out with something truely radical like what the inquirer suggests I can't imagine they'd do this generation - that's too quick.

We got two differant things from our reading . Anand really touches on all the possibilities well thought out and makes since . But he touches on the compilier issue. The Inquery article was actually fucosed on the Compiler issue. And it is a fact that Intel did infact buy that Russian company. So I am really leaning towards the Inquire article as being pretty close to what well see . My opion only.

Well Intelia, I think you really mean:
Originally posted by: Intelia
My opium only.

Since when has the inquirer been a reliable source of information; not only that, the author fully upfront admits that all of his predictions are merely speculation, mostly fueled by this line of reasoning:
The change is so big in fact, it?s the reason for Apple?s processor switch. Indeed the phrase given when Steve Jobs announced the switch, "performance per watt" is the very same phrase being used by Intel spokesmen.

Even more dubious, some of the othe justifications for his theory, were things like transfer of some of the Alpha RISC personel to Intel years ago. Please keep things in perspective before going off the deep end.
Start a thread talking about future processors from amd and I will be happy to come in and flame ya . TROLL

If you are taking this kind of article from the inquirer to be fact, than it is you who's the biggest troll around here. You have no evidence, yet you insist that the article is with full merits, and that the EE merom will have HT. Where in the world are you getting your "information" from??
This is a discussion about future processor development and what we get out of it . speculations is all we have . What part of that don't you understand.

So, you are saying that you don't have evidence, nor would you need to have evidence for any of this discussion. One of the classic signs that points to a TROLL is preponderance of speculations with no fact to back it up; and passing all of this to the forum as if it were fact.


Start a thread talking about future processors from amd and I will be happy to come in and flame ya . TROLL (this was meant for the meth, guy hard ball)

That description would be best applied to you; Dude, or dudette.
 

fatty4ksu

Golden Member
Mar 5, 2005
1,282
0
0
All we know is it's going to be a multi-core, it's also going to be 64 bit and support hyper threading. The problem is trying to do all this at the same isn?t going to reduce power consumption, in fact doing all this means power consumption is more likely to increase.

Got Domination? Intel will soon folks.
 

Diasper

Senior member
Mar 7, 2005
709
0
0
Originally posted by: fatty4ksu
All we know is it's going to be a multi-core, it's also going to be 64 bit and support hyper threading. The problem is trying to do all this at the same isn?t going to reduce power consumption, in fact doing all this means power consumption is more likely to increase.

Got Domination? Intel will soon folks.

I'd rather not have 'domination' Intel or otherwise thank you!

Hush now! :p

Noone, can say what the situation will be in 2007. Maybe I should have started a separate thread about speculating on the K10 design given that's what Conroe will be coming against, but alas it'll be too early and fruitless.

Either way Intel are at least looking to be coming out with a far better chip than Netburst and for that we can all be thankful - maybe it'll even give some fanboys something worth touting ;)
 

IntelUser2000

Elite Member
Oct 14, 2003
8,686
3,787
136
The performance rumor of Merom over Yonah is 30% PER CLOCK. Remember Conroe is a desktop variant of Merom. I hate to disappoint people, but it COULD be this way. Let's look at the rumored specs of Merom first before I explain my theory:

-65nm
-Arbitration logic(manages data between two cores)
-4MB shared L2 cache(meaning one core can use all the cache when the other core is off, or two cores can share)
-Four-wide implementation of the x86 architecture
-Similar pipeline stages to Pentium M, little possibility its greater
-Dual Core
-Hyperthreading or some sort of multi-threading on Extreme Editions??
-1066MHz bus

Let's look at Yonah
-65nm
-Arbitration logic
-2MB shared L2 cache, which has the ability to increase bandwidth when two cores access data, with better data prefetch
-667MHz bus
-More advanced thermal management
-Dual Core
-Three-wide implementation of the x86 architecture, which is what all P6 cores have
-Similar pipeline stages to Pentium M


Let me explain what "three-wide" and "four-wide" means. 3-wide means that it can have theoretical IPC of 3 and 4-wide means, you guessed it, IPC of 4. Now none of the CPUs ever manufactured gets close to the theoretical IPC numbers(In contrast, Pentium 4 can have maximum IPC of 2, or 2-wide due to limitations, and all Athlons have 3).

To get as close as possible to theoretical IPC:
-You must have programs that are optimized to be parallel as possible
-The CPU must have enough bandwidth
-The CPU must have enough resources, like registers(which x86 seriously lacks with 8 registers, and x86-64 only a bit better with 16)
-It should have massively Out of Order execution capability

"My theory:
This is where it may be disappointing for people who is expecting 30% performance increase per clock from Yonah. I believe the first person who heard the 4-wide rumor, thought since 4-wide is 33% greater than 3-wide(4/3=1.33), the performance increase must be 30%. Then everyone now says performance increase is 30% since IPC is 30% greater. That's not true because it gets harder to extract IPC as IPC theoretical increases. It may be in reality, 10%. Of course that may be the real performance increase. But remember how there was a rumor that Prescott would have been 40% faster PER CLOCK over Northwood??"
 

IntelUser2000

Elite Member
Oct 14, 2003
8,686
3,787
136
Conroe/Merom is not based off P6 architecture due to the fact that it may be a 4-wide CPU.

It is nearly impossible or not feasible from either technical or marketing standpoint to have Conroe/Merom be based on EPIC. The IA-32EL(translates IA32 instructions to EPIC instructions) have a performance that's around 50% in Integer and 35% in Floating Point compared to native performance. In order to have a competitive CPU based on that architecture for 32-bit performance, the native 64-bit performance would have to be unimaginable from current point of view.

I think my theory above is right.

(Response to the Anandtech article:
First Itanium is 6-wide
Itanium 2 is 6-wide

Itanium 2 doesn't increase the issue rate, what Itanium 2 does is increase the possibility that IPC of 6 is possible by making better architecture.

Brief overview of Itanium architecture: The CPU processes the EPIC instructions by using two bundles of 3 instructions each. Each bundle can have a certain combination of different instructions.

Main execution units in Itanium consists of 4, that is Branch unit, Floating Point Unit, Memory Unit, Integer Unit. Memory and Integer unit can be considered in simple terms as ALU from what I understand.

In one bundle, you can have certain combinations of those execution units. Examples may be: MMI(memory, memory, integer), MII, MIF, BBB, and such. Remember that each bundle can have that combinations, and there is like 26 combinations or so. That means if the second bundle can't have that combinations due to the lack of execution units, 6-wide isn't possible.

Itanium had 2 M units, 2 I units, 2 FP units, 3 B units. So if the first bundle is MMI and the second bundle is MMI, it can't have 6-wide execution.

According to the article I read, first Itanium can have in theory of ~3.8 IPC due to lack of execution units, and Itanium 2 have theoretical IPC of 5.6-5.7 due to more execution units, specifically 4 M units rather than 2 as in Itanium.)

 

Fox5

Diamond Member
Jan 31, 2005
5,957
7
81
Conroe/Merom will be a true next generation architecture, or maybe current generation architecture, considering Dothan was just a suped up P3.

BTW, isn't a shared L2 cache slower than one that isn't shared?
 

DarkKnight69

Golden Member
Jun 15, 2005
1,688
0
76
it really is too bad that intel dropped P3 archetecture...

Netburst was a waste of time and money. I hope someone lost their job over that!

It is nice they are trying something new with the processors cuz amd is killing them in the gaming world right now!
 

Markfw

Moderator Emeritus, Elite Member
May 16, 2002
27,329
16,161
136
cuz amd is killing them in the gaming world right now!
At the moment, what (other than dothan laptop) does Intel have that competes at all ? Dual-core ? I don't think so, server ? I don't think so., as you pointed out, single-core no way. And the Turion at least takes a stab at dothan. My son got a new laptop with it, and it was $400 cheaper than the least exspensive pentium M, and it is smokin fast.
 

Fox5

Diamond Member
Jan 31, 2005
5,957
7
81
Originally posted by: DarkKnight69
it really is too bad that intel dropped P3 archetecture...

Netburst was a waste of time and money. I hope someone lost their job over that!

It is nice they are trying something new with the processors cuz amd is killing them in the gaming world right now!

Netburst enjoyed...something like 3-6 months on top before A64s came out, plus gave Intel a much needed boost in the server/workstation market that P3 never could have done performance wise, and really helped them in sales as people bought on the mhz, and allowed them to really cripple Celeron yet still get people to buy it.

And if Netburst hadn't run out of steam, it could possibly still be on top. 5ghz prescott versus ~3ghz athlon 64 would probably be a similar matchup to northwood versus athlon xp.
 

BitByBit

Senior member
Jan 2, 2005
474
2
81
Originally posted by: Hacp
Why is HT only good for processors with Deep pipelines? Northwoods had it and they only had 20 pipe stages.

The idea that Hyperthreading only benefits processors with longer pipelines is usually perpetuated by those who also equate pipeline depth with IPC.
Pipeline depth has absolutely nothing to do with determining whether a given processor architecture is suitable for SMT.
Hyperthreading allows the P4's schedulers to pick from instructions from independent threads. If a thread has a very low amount of ILP (often caused by a high degree of data dependency) then it is going to be very difficult for the schedulers to find instructions that can be executed in parallel. Hyperthreading allows the schedulers to pick instructions from two independent threads, meaning that the schedulers should be able to find non-dependent instructions it can issue in parellel, since data dependencies do not exist across threads.
This is the whole point of SMT; to assist superscalar architectures to manage an IPC of > 1. Indeed, SMT would have been useless on pre-Pentium architectures, as superscalar execution didn't exist on the x86 platform previously.

There are two simple ways to increase the performance of an architecture:
1. An increase in frequency through an extended pipeline (accompanied with according improvements to the branch prediction mechanism to offset the negative impact on IPC).
2. An increase in theoretical IPC.

Of course, ideally we'd implement both methods, but since the high IPC method seems to have been given the spotlight of late, let's discuss this.

The maximum number of instructions a processor can execute per clock is determined by the number of execution units of each type. A thread comprised of integer code, for example, can only be executed by the integer units, and the maximum IPC attainable by the processor when executing this code will be determined by the number of these.
The average IPC however, is determined by the issue rate, the accuracy of the branch predictor (and the number of clock cycles wasted in flushing the pipeline), and the ability of the schedulers to find instructions to execute in parallel, which was discussed above.
The Athlon can sustain a very high issue rate, thanks to its three general purpose decoders, but if the schedulers can't find an equal number of intructions to execute in parallel per clock, then its decode ability is going to go to waste. One way to increase the Athlon's (admittedly already high) IPC would be to increase the instruction window and correspondingly the number of tracked instructions. This would make life easier for the schedulers but would greatly increase the power draw.
An alternative method would be to introduce an independent instruction stream, reducing the need for a larger instruction window.
The importance of finding instructions that can be executed in parallel increases with the number of execution units.
So, if Conroe is going to be a wide-issue design, it will need to either have a very large instruction window, or it will need Hyperthreading.
Considering the impact on power requirements of the former method, I would expect Intel to place more emphasis on the latter.






 

IntelUser2000

Elite Member
Oct 14, 2003
8,686
3,787
136
Shared caches are better when you manage to implement it well. It seems that Yonah's caches are really good. One of the features of the shared cache that Intel calls called Bandwidth Adaptation Buffer, when two cores are accessing the cache, it doubles the bandwidth and implements prefetch algorithm when compared to single core accessing that same size cache.
 

Vee

Senior member
Jun 18, 2004
689
0
0
- Exactly as expected. :cool:
...Which is good because it also means no surprises... :cool:
(why ain't there any smilies for 'unbearably smug' ? )