athlon 64: why is it's poor multitasking ignored/downplayed?

Page 9 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Markfw

Moderator Emeritus, Elite Member
May 16, 2002
27,282
16,122
136
Originally posted by: CaiNaM
Originally posted by: Markfw900
CaiNam, again you are missing my point, when I encode, and multitask, I am gaming, so if it takes 30 min, or 1 hr 30 min, I don't care, but I don't like my games to lag. I give up on this thread, you have your mind set. Buy a P4, it works for you, fine.....

no, i don't "miss" your point.. that you "don't care" doesn't make the fact this condition exist conveniently "disappear"... and you certainly didn't say, "yea, but while that may be true, it doesn't really affect me..", rather you suggest the issue doesn't exist, regardless of whether it affects they way YOU use your pc.

as a matter of fact, i stated just that earlier when someone specifically asked what he should buy - that this may not bother some people as they just don't use their pc in that manner, and specifically referenced gaming and encoding.

No, I don't care if it takes 30 seconds or 31 seconds.... go buy your P4 and stop saying the Athlon64 has poor multitasking, its fine, just not the same as the P4, and better in single tasks by your own admission.

Edit: let me qualify: poor connotates a 3 or less on a scale of 1-10. The athlon64 is about an 8 compared to the P4 in multitasking by my book, and that isn't poor, just not as good.
 

CaiNaM

Diamond Member
Oct 26, 2000
3,718
0
0
Originally posted by: Markfw900
Originally posted by: CaiNaM
Originally posted by: Markfw900
CaiNam, again you are missing my point, when I encode, and multitask, I am gaming, so if it takes 30 min, or 1 hr 30 min, I don't care, but I don't like my games to lag. I give up on this thread, you have your mind set. Buy a P4, it works for you, fine.....

no, i don't "miss" your point.. that you "don't care" doesn't make the fact this condition exist conveniently "disappear"... and you certainly didn't say, "yea, but while that may be true, it doesn't really affect me..", rather you suggest the issue doesn't exist, regardless of whether it affects they way YOU use your pc.

as a matter of fact, i stated just that earlier when someone specifically asked what he should buy - that this may not bother some people as they just don't use their pc in that manner, and specifically referenced gaming and encoding.

No, I don't care if it takes 30 seconds or 31 seconds.... go buy your P4 and stop saying the Athlon64 has poor multitasking, its fine, just not the same as the P4, and better in single tasks by your own admission.

Edit: let me qualify: poor connotates a 3 or less on a scale of 1-10. The athlon64 is about an 8 compared to the P4 in multitasking by my book, and that isn't poor, just not as good.

your arguments get sillier every time you reply. so apparenlty 5 fps less in a game is "substandard", but 3x slower in a task is not "poor"? i guess that would hold substance if your intention is to hold one in a better light over the other regardless of reality. further, now it's "30 seconds vs 31 seconds", so not only do you offer nothing but opinions with no substance, but your "facts" (and i use that term sarcastically) as well in order to fit the point you want to make.

 

Markfw

Moderator Emeritus, Elite Member
May 16, 2002
27,282
16,122
136
Originally posted by: CaiNaM
Originally posted by: Markfw900
Originally posted by: CaiNaM
Originally posted by: Markfw900
CaiNam, again you are missing my point, when I encode, and multitask, I am gaming, so if it takes 30 min, or 1 hr 30 min, I don't care, but I don't like my games to lag. I give up on this thread, you have your mind set. Buy a P4, it works for you, fine.....

no, i don't "miss" your point.. that you "don't care" doesn't make the fact this condition exist conveniently "disappear"... and you certainly didn't say, "yea, but while that may be true, it doesn't really affect me..", rather you suggest the issue doesn't exist, regardless of whether it affects they way YOU use your pc.

as a matter of fact, i stated just that earlier when someone specifically asked what he should buy - that this may not bother some people as they just don't use their pc in that manner, and specifically referenced gaming and encoding.

No, I don't care if it takes 30 seconds or 31 seconds.... go buy your P4 and stop saying the Athlon64 has poor multitasking, its fine, just not the same as the P4, and better in single tasks by your own admission.

Edit: let me qualify: poor connotates a 3 or less on a scale of 1-10. The athlon64 is about an 8 compared to the P4 in multitasking by my book, and that isn't poor, just not as good.

your arguments get sillier every time you reply. so apparenlty 5 fps less in a game is "substandard", but 3x slower in a task is not "poor"? i guess that would hold substance if your intention is to hold one in a better light over the other regardless of reality. further, now it's "30 seconds vs 31 seconds", so not only do you offer nothing but opinions with no substance, but your "facts" (and i use that term sarcastically) as well in order to fit the point you want to make.

I don't know where you got 3x slower unless you reference my parody before, but I guess that wa wasted. An no, I don't need any more facts than have been presented here, and my own experience.

As I said, just go buy your P4 and leave this thread, as your logic escapes everyone that I see here.
 

FelixDeCat

Lifer
Aug 4, 2000
31,029
2,687
126
Originally posted by: Markfw900
Originally posted by: CaiNaM
Originally posted by: Markfw900
Originally posted by: CaiNaM
Originally posted by: Markfw900
CaiNam, again you are missing my point, when I encode, and multitask, I am gaming, so if it takes 30 min, or 1 hr 30 min, I don't care, but I don't like my games to lag. I give up on this thread, you have your mind set. Buy a P4, it works for you, fine.....

no, i don't "miss" your point.. that you "don't care" doesn't make the fact this condition exist conveniently "disappear"... and you certainly didn't say, "yea, but while that may be true, it doesn't really affect me..", rather you suggest the issue doesn't exist, regardless of whether it affects they way YOU use your pc.

as a matter of fact, i stated just that earlier when someone specifically asked what he should buy - that this may not bother some people as they just don't use their pc in that manner, and specifically referenced gaming and encoding.

No, I don't care if it takes 30 seconds or 31 seconds.... go buy your P4 and stop saying the Athlon64 has poor multitasking, its fine, just not the same as the P4, and better in single tasks by your own admission.

Edit: let me qualify: poor connotates a 3 or less on a scale of 1-10. The athlon64 is about an 8 compared to the P4 in multitasking by my book, and that isn't poor, just not as good.

your arguments get sillier every time you reply. so apparenlty 5 fps less in a game is "substandard", but 3x slower in a task is not "poor"? i guess that would hold substance if your intention is to hold one in a better light over the other regardless of reality. further, now it's "30 seconds vs 31 seconds", so not only do you offer nothing but opinions with no substance, but your "facts" (and i use that term sarcastically) as well in order to fit the point you want to make.

I don't know where you got 3x slower unless you reference my parody before, but I guess that wa wasted. An no, I don't need any more facts than have been presented here, and my own experience.

As I said, just go buy your P4 and leave this thread, as your logic escapes everyone that I see here.

Look whos talking.
 

FelixDeCat

Lifer
Aug 4, 2000
31,029
2,687
126
Originally posted by: CaiNaM
Originally posted by: Markfw900
Originally posted by: CaiNaM
Originally posted by: Markfw900
CaiNam, again you are missing my point, when I encode, and multitask, I am gaming, so if it takes 30 min, or 1 hr 30 min, I don't care, but I don't like my games to lag. I give up on this thread, you have your mind set. Buy a P4, it works for you, fine.....

no, i don't "miss" your point.. that you "don't care" doesn't make the fact this condition exist conveniently "disappear"... and you certainly didn't say, "yea, but while that may be true, it doesn't really affect me..", rather you suggest the issue doesn't exist, regardless of whether it affects they way YOU use your pc.

as a matter of fact, i stated just that earlier when someone specifically asked what he should buy - that this may not bother some people as they just don't use their pc in that manner, and specifically referenced gaming and encoding.

No, I don't care if it takes 30 seconds or 31 seconds.... go buy your P4 and stop saying the Athlon64 has poor multitasking, its fine, just not the same as the P4, and better in single tasks by your own admission.

Edit: let me qualify: poor connotates a 3 or less on a scale of 1-10. The athlon64 is about an 8 compared to the P4 in multitasking by my book, and that isn't poor, just not as good.

your arguments get sillier every time you reply. so apparenlty 5 fps less in a game is "substandard", but 3x slower in a task is not "poor"? i guess that would hold substance if your intention is to hold one in a better light over the other regardless of reality. further, now it's "30 seconds vs 31 seconds", so not only do you offer nothing but opinions with no substance, but your "facts" (and i use that term sarcastically) as well in order to fit the point you want to make.

of course his arguments get sillier because he doesnt know what hes saying. he stole my very words to attack me, after lying about me, and accusing me of someone elses attacks against him. true mark of an idiot. :thumbsup:

 

CaiNaM

Diamond Member
Oct 26, 2000
3,718
0
0
Originally posted by: Markfw900
I don't know where you got 3x slower unless you reference my parody before, but I guess that wa wasted.
read the thread.
An no, I don't need any more facts than have been presented here, and my own experience.
of course not, your opinion obviously defies both logic and fact.
[/quote]As I said, just go buy your P4 and leave this thread, as your logic escapes everyone that I see here.[/quote]
whether i "buy my p4 or not", why would i leave this thread? at least i'm one of the one's showing objectivity. i started this thread looking for a solution, but it seems to me what everyone is saying is that there is none, tho a couple of banner waving amd fans simply dismiss it as "normal" or "fine", which is utterly ridiculous.. but hey, some people have lower standards than others, i suppose.

and frankly, while people in here have actually shown some logic and clear thought (some who even have a different opinion from me), at this point i certainly cannot include you in that. as with most "hard core fans", your statements are repetitive, lack any facts other than your own opinion, and you've pretty much done this without responding to any pertinent points whatosever.

i see no reason to participate with you in what you've turned into a pissing match, unless you actually come up with something of substance to back up what you say (and as you haven't done that yet, i have no reason to believe you'll start now), so feel free to bask in all your amd brand loyalty, as at this point it really matters not.
 

Zebo

Elite Member
Jul 29, 2001
39,398
19
81
Lots of bickering in an otherwise important thread.

Good thread. Title a little inflamitory but very good info. Between the bikering and all that talk of quanta and I starting thinking about Schrodinger, wave functions and how much I hated pchem so I got lost. Did we ever decide if it's an OS issue or A64 just can't hang with intensive multitasking regaurdless of OS?

 

Priit

Golden Member
Nov 2, 2000
1,337
1
0
I read almost all this thread through and I can't quite understand all the whining about hardware when the problem might just as well be software-related. What is A64's "poor multitasking performance", IMHO multitasking is sth that OS does, not (single) processor. So how many OS'es (like linux, solaris, *bsd) have you tried besides M$WinXP? How's their multitasking performance compared relatively P4 vs A64?
 

DAPUNISHER

Super Moderator CPU Forum Mod and Elite Member
Super Moderator
Aug 22, 2001
32,067
32,593
146
Originally posted by: Priit
I read almost all this thread through and I can't quite understand all the whining about hardware when the problem might just as well be software-related. What is A64's "poor multitasking performance", IMHO multitasking is sth that OS does, not (single) processor. So how many OS'es (like linux, solaris, *bsd) have you tried besides M$WinXP? How's their multitasking performance compared relatively P4 vs A64?
Well spoken, but somewhat outside of the scope of this particular query by CaiNaM for the following reason. The majority of users, even in enthusiast forums liks this, are windows users first and foremost, and XP and 2kpro dominate. From the Anandtech stats-

Windows Variants:17922
Unix Variants: 2060
Mac: 130

To continue, I've seen a smattering of benchmarks that suggest the A64 platform will see some substantial gains when a complimentary 64bit OS and app are involved. That indicates, to me anyways, that this architecture has yet to see it's full potential realized for the desktop user. That makes your point an important one *obviously why you brought it up :) * in that the while the evidence may suggest the A64 isn't performing admirably in some CPU intensive multitasking in windows@the moment, some, or perhaps even most of the issues may be due to the OS and apps not being properly optimized as opposed to a deficiency of the architecture.

I haven't had the opportunity to spend any real time on an HT setup, but the data provided by users has shown me that HT is a sweet feature when optimized for or in some situations when more than one CPU intensive task is involved E.G. distributed computing projects such as SETI and F@H. I also feel its primary benefit to some end-user is it's the least expensive approach to stronger multitasking@this time. True MP or multiple systems is a more effective but also more expensive solution in most cases.

The SP A64 is already a very powerful platform that is still maturing, so I believe the best is yet to come where it's concerned. I also believe CaiNaM is doing the neophytes, lurkers, and generally curious a service by continuing the discussions that Duvie and some others initiated sometime back. The meat&potatoes of those discussions were that while the A64 has rightfully garnered the respect and accolades it deserves, the tests/benchmarks that favor HT are curiously absent from most reviews while one of the the least productive activites, gaming, recieves the bulk of attention. I'm by no means saying this isn't appropriate given the audience, just that most reviews don't provide the completed picture; i.e. there are a few puzzle pieces usually missing and topics like this help fill in the holes that's all :)
 

Markfw

Moderator Emeritus, Elite Member
May 16, 2002
27,282
16,122
136
Originally posted by: DAPUNISHER
Originally posted by: Priit
I read almost all this thread through and I can't quite understand all the whining about hardware when the problem might just as well be software-related. What is A64's "poor multitasking performance", IMHO multitasking is sth that OS does, not (single) processor. So how many OS'es (like linux, solaris, *bsd) have you tried besides M$WinXP? How's their multitasking performance compared relatively P4 vs A64?
Well spoken, but somewhat outside of the scope of this particular query by CaiNaM for the following reason. The majority of users, even in enthusiast forums liks this, are windows users first and foremost, and XP and 2kpro dominate. From the Anandtech stats-

Windows Variants:17922
Unix Variants: 2060
Mac: 130

To continue, I've seen a smattering of benchmarks that suggest the A64 platform will see some substantial gains when a complimentary 64bit OS and app are involved. That indicates, to me anyways, that this architecture has yet to see it's full potential realized for the desktop user. That makes your point an important one *obviously why you brought it up :) * in that the while the evidence may suggest the A64 isn't performing admirably in some CPU intensive multitasking in windows@the moment, some, or perhaps even most of the issues may be due to the OS and apps not being properly optimized as opposed to a deficiency of the architecture.

I haven't had the opportunity to spend any real time on an HT setup, but the data provided by users has shown me that HT is a sweet feature when optimized for or in some situations when more than one CPU intensive task is involved E.G. distributed computing projects such as SETI and F@H. I also feel its primary benefit to some end-user is it's the least expensive approach to stronger multitasking@this time. True MP or multiple systems is a more effective but also more expensive solution in most cases.

The SP A64 is already a very powerful platform that is still maturing, so I believe the best is yet to come where it's concerned. I also believe CaiNaM is doing the neophytes, lurkers, and generally curious a service by continuing the discussions that Duvie and some others initiated sometime back. The meat&potatoes of those discussions were that while the A64 has rightfully garnered the respect and accolades it deserves, the tests/benchmarks that favor HT are curiously absent from most reviews while one of the the least productive activites, gaming, recieves the bulk of attention. I'm by no means saying this isn't appropriate given the audience, just that most reviews don't provide the completed picture; i.e. there are a few puzzle pieces usually missing and topics like this help fill in the holes that's all :)
Thanks dapunisher, I already asked that question, since all my experience has been related to win2k, NOT XP, and I asked if this might be the problem, and no answer or further discussion has started. Maybe I should install winxp64, back on my Athlon64 and try some tests ? Of course it would be best if CaiNaM would do that and see for himself, its free anyway.
 

Priit

Golden Member
Nov 2, 2000
1,337
1
0
DAPUNISHER: You're of course right, but main reason why I even bothered to write on this thread was it's title. I think "athlon 64: why is it's poor multitasking with winxp downplayed/overlooked?" would be more correct (even thought most people use MS). I don't have fancy new hardware (A64/P4 HT) to try out myself, but old AXP with enough memory "multitasks" fast enough to run 10 X-terms (10 times of the load comparing to single desktop) with linux even when people using rather heavy apps (OpenOffice, Gimp, Java games). I've been reading that A64 running 64-bit linux is considerably faster than old 32-bit Athlons in anything, no linux user as so far complained about bad multitasking performance. That's why whining about hardware here seems little harsh to me...
 

CaiNaM

Diamond Member
Oct 26, 2000
3,718
0
0
hmm.. the title.. well, the reasoning is for a couple reasons.. first, it's accurate, and i don't believe particularly inflammatory unless one is looking for it to be. i think it has enough edge to interest people in reading it and contributing to the discussion. guess it depends on perspective.

secondly as DP stated, the predominent OS is windows (by a huge landslide) leaving most other OS's are rather insignificant. while i run a couple dozen servers under linux, they are simply used in other ways. i mean, if you want to look at it from the other perspective, do you ever see "fastest gaming platform (only in windows)? do when reading cpu reviews, how many include benchmarks under linux (or 64-bit windows, for that matter)? no, because again, 'nix variants are insignificant and mac OS doesn't apply at all.

but since you're the second person to mention this (and did so intelligently), and my intent is not to be overly duragotory toward amd or its products, i did "qualify" the title to show we're talking about windows.

as to whether this is an OS issue or not.. that's not really the point, nor is it really that relevant. i think from i've learned in the past couple of weeks, in some ways it could be construed as such (and not necessarily particulary to windows) as such, as it seems the primary reason is that the a64 architecture makes it alot more dependant on the OS than the p4 w/HT.

going back to the fact that athlon's shorter, 12 stage pipe is much more efficient in that it remains full doing it's primary task, it relies on OS task/thread scheduling much more so than the intel, which can make up for the "inefficiency" of the longer, 20 stage pipe as HT takes advantage of the portions unused by the first task by using it to process the subsequent tasks, maximizing the use of the longer pipeline and thereby increasing it's efficiency.

in essence, the a64 does approx. the same amount of work per cycle regardless of how many tasks it's doing, while the amount of work per cycle performed by the p4 actually increases with multiple tasks.

as i've continually mentioned before, it's certainly possible that under 64b win we'll see different behavior, but it's beta, and with the inmaturity of the drivers it's just not an everday OS.

i'll also reiterate that i think dual core a64's will be rdy before 64b win, and that would obviously eliminate the behavior we're currently discussing, which i also believe is intels only compelling advantage over amd at this time.

edit: oh, and i've also not eliminated the possibility that s939, due to it's additional bandwidth from dual channel memory, might help under these condidions. glug has duplicated the behavior, tho on his 939 platform (same amt of cache and same nf3 chipset) it seems to have lesser impact on his system than mine (s754). unfortunately i have no way of testing that.

 

CaiNaM

Diamond Member
Oct 26, 2000
3,718
0
0
Originally posted by: Markfw900
Thanks dapunisher, I already asked that question, since all my experience has been related to win2k, NOT XP, and I asked if this might be the problem, and no answer or further discussion has started. Maybe I should install winxp64, back on my Athlon64 and try some tests ? Of course it would be best if CaiNaM would do that and see for himself, its free anyway.

ok, now that's getting back to relevant discussion :)

sorry, i missed anything regarding testing this under w2k. unfortunately, while i have a couple of real win2k cd's, i can't find my product #'s (haven't needed em in a couple years)... but perhaps i'll toss in another drive and install win64, if for no other reason than to see what the future might have in store regarding this condition.

 

Zebo

Elite Member
Jul 29, 2001
39,398
19
81
It's accurate in your small selection of tests. Not the tests I've seen from reviews and mutlitasking benchmarks programs they show.. More data is all before we jump to broad statements otherwise We'll have guys starting threads about "Intel sucking at rendering" while only using POVray as thier "proof". Don't we have enough of that type of stuff around here? Intelligent threads deserve intelligent titles is all.:)
 

CaiNaM

Diamond Member
Oct 26, 2000
3,718
0
0
Originally posted by: Zebo
It's accurate in your small selection of tests. Not the tests I've seen from reviews and mutlitasking benchmarks programs they show.. More data is all before we jump to broad statements otherwise We'll have guys starting threads about "Intel sucking at rendering" while only using POVray as thier "proof". Don't we have enough of that type of stuff around here? Intelligent threads deserve intelligent titles is all.:)

i disagree with your first statement. the few instances of of reviews doing actual multitasking shows the same behavior, and the "business winmark" (is that the correct name?) used in some bencmarks to show multitasking ability is a joke. we're talking simultaneous instances of apps requiring the cpu, not multiple instances of maps taking up memory...

if you have some suggestions on particular conditions you'd like me to check, i'd be happy to do so if i'm able to.
 

Lithan

Platinum Member
Aug 2, 2004
2,919
0
0
Pulled off aces hardware. WaltC author.

"> Last, I want to briefly address the topic in my heading--multithreaded
> software. Recall that multithreaded software long predates
> in terms of concept and execution the HT P4 (as the P4 itself
> long predates the HT P4.) Its best use to date has been,
> of course, in dual and multi-cpu systems which have the literal
> capacity to execute multithreads simultaneously. But I can
> dimly recall a few pieces of multithreaded software running
> on my old and ancient PII systems--dimly, at least that far
> back...;)
>
> P4 HT simply won't, because it cannot, do SMT like SMP systems
> can do SMT. Despite the marketing aphorisms, P4 HT has several
> bugaboos which are P4-specific. It's in reality of course
> a single core cpu, and what it does with multithreaded software
> through HT is to essentially accelerate the multitasking
> (time share) of multiple threads in the single core, thereby
> increasing per-clock work efficiency of the P4 above what
> the P4 normaly achieves when running a single thread at the
> same clock speed. At no time does the HT P4 ever actually
> run more than a single thread at the time, but since the
> HT circuitry increases the P4's per-clock working efficiency
> in the advent of multithreaded software, the performance
> within the multithreaded application improves as the internal
> per-clock thread multitasking performance of the cpu improves.
> The extra theoretical per-clock efficiency of the HT circuitry
> is of no use for single threaded software, since there are
> no multiple threads within the software for the HT cpu circuitry
> to (basically) "accelerate the multitasking of those threads
> per clock." I liken the distinction to the HT circuitry being
> piggybacked onto the normal P4, as opposed to the HT circuitry
> being basic to the P4's architecture in some fundamental
> fashion. The bios control for HT on/off, a very clumsy way
> to handle the condition, imo, is probably the only possible
> way to have a single-core cpu effectively fool the OS into
> seeing a second cpu where none exists in order to accelerate
> per-clock multithreaded performance. (There are other considerations
> which I consider peripheral to the discussion here that P4
> HT circuitry enablment creates, such as higher power consumption
> and heat dissipation demands, etc. Also, don't confuse a
> "logical" cpu as reported by the OS with a real physical
> additional cpu, anymore than you would confuse a physical
> hard drive in your system with a logical drive partition
> placed on it.)
>
> Here we can understand the crux of the most serious of the
> P4's HT problems: inconsistency and unpredictability. Because
> we are not dealing with two separate cpu cores but rather
> with a single core which by definition is incapable of executing
> more than a single thread at a time, HT performance increases
> when running multithreaded software are wildly inconsistent
> to the point where they can range from as much as a 50% increase
> in rare cases of heavily P4 HT-optimized software to an actual
> *minus 10-15%* in the performance of other software when
> compared with running that software on the same cpu with
> HT turned off. In fact, as you may have noted recently, Intel
> is now *officially* advocating that HT be *turned off* in
> some cases as it is seen to be a drag on even non-HT performance,
> sometimes. This is all happening because you in effect have
> one cpu core which is trying to pretend it is two cores--but
> only sometimes--and the situation can as easily provoke inefficiency
> in the core as it can provoke the desired efficiency increases
> you look for in processing...;) In short, two physical cpu
> cores are always much better than a single physical core
> & a logical core.
>
> So let's consider the case of the A64 and a multithreaded
> software application. It'll run the multithreaded software
> exactly like the HT-enabled P4 runs it, by multitasking the
> threads. The difference is that the A64's fundamental architecture
> is designed to run everything at maximum per-clock efficiency
> all the time and doesn't conditionally and functionally speed
> up per-clock multithreaded multitasking or slow down per
> clock while running single threads exclusively, as happens
> in the P4 HT-enabled cpu--sometimes (as sometimes it's best
> to just turn P4-HT off)...;) If a specific application is
> heavily optimized for the A64, for instance, it will run
> the application faster than normal just as a heavily P4-HT
> optimized multithreaded application will run faster than
> normal on the P4 with HT enabled. How "much" faster for
> either depends upon the depth and quality of the respective
> optimizations, of course. Look at the spread in MHz between
> a 3.4GHz HT P4 and a 2.2GHz A64--it's darn near 50%, isn't
> it? Depending on the cache config for these cpus, of course,
> which can vary in either case, these cpus are considered
> very close to each other in IA-32 x86 general application
> performance (with, as you note, Athlon64 winning most contests--except
> those running heavily P4-HT optimized software and not offering
> a heavily A64-optimized version of the test or bench to compare.)
> This tells us that basically the A64 is ~50% faster than
> the HT P4 *per clock.* When you consider that the A64 will
> make further performance gains running in 64-bit mode, well,
> the choice is pretty clear to me.
>
> Last but not least, none of this is lost on Intel which, with
> the Dothan cores and their upcoming dual core cpus, seems
> pretty clearly to have consigned P4 HT to the ash-can of
> history. AMD actually announced a dual-core cpu direction
> years ago when it announced K8 and for awhile there was a
> lot of speculation that k8 would debut as a dual-core cpu.
> Unlike Intel, though, AMD seems to have done a lot more
> prepatory work in setting up the kinds of system buses and
> other things amenable to a dual-core cpu reaching its performance
> potential. Dual cores from AMD and Intel are just far better
> SMT strategies than something like P4 HT ever was, because
> of one very important thing they bring to the table that
> P4 HT never did: not only much better multithreaded performance,
> of course, but mainly dual-core cpus will introduce a consistency
> and predictability into multithreaded software performance
> that P4 HT simply failed to do. That's why P4 HT was never
> the catalyst for smt software development some thought it
> would be--the performance potential is simply too unpredictable
> for many software firms to justify its sometimes much higher
> development costs. SMP, really, has always been a far greater
> catalyst for smt software development than P4 HT ever was,
> imo. It's certain, though, I think, that when dual-core
> cpus become commodity cpus that general smt software development
> will soon after accelerate like a rocket...;)"
 

CaiNaM

Diamond Member
Oct 26, 2000
3,718
0
0
Originally posted by: Lithan
Pulled off aces hardware. Daniel Rwizi author.

"> Last, I want to briefly address the topic in my heading--multithreaded
<snip>

so are you agreeing with him or disagreeing with him? i ask as while i don't agree with every conclusion he's made, his post, in basis agrees with what i've been saying (while you, of course, have been disagreeing with me the entire time w/o any substance):

"the HT circuitry increases the P4's per-clock working efficiency in the advent of multithreaded software, the performance within the multithreaded application improves as the internal per-clock thread multitasking performance of the cpu improves."

"The difference is that the A64's fundamental architecture is designed to run everything at maximum per-clock efficiency all the time and doesn't conditionally and functionally speed up per-clock multithreaded multitasking or slow down per clock while running single threads exclusively, as happens in the P4 HT-enabled cpu."

i'm surprised the first post containing any substance reaches the the same basic conclusion as i've reached.

a couple of points to consider further, is that at no time did i ever state HT was better or equal to dual core/dual cpu setup. i also agree with him in that,

"Dual cores from AMD and Intel are just far better SMT strategies than something like P4 HT ever was.."

although that's not what this discussion is all about. he makes another fundamentally sound statement when he writes,

"In short, two physical cpu cores are always much better than a single physical core &amp; a logical core."

but that doesn't preclude the point of this thread, which is a single physical core &amp; a logical core is better than a single physical core", at least in the context of the p4 as it has wasted pipeline resources which can be used to increase it's computational power when running more than one task, and again i quote his post,

"the HT circuitry increases the P4's per-clock working efficiency in the advent of multithreaded software, the performance within the multithreaded application improves as the internal per-clock thread multitasking performance of the cpu improves."

another important point of this which he doesn't mention is that it doesn't necessarily have a large impact on the first task either (tho this is where i agree with him on inconsistencies; it depends on the app and how it uses the cpu cycles).

of course, multithreaded software is much more prevelant than it was a year or two ago. at any rate, dual cores will render HT passe (well at least in considering NEW components, and depending on price of dual cores of course), and smp is a far better solution than a single core pretending to be 2, but in the meantime, it's quite obvious p4's architecture does offer some benefits (which will affect some more than others).

also, would be nice if you provided a link so we could easily see the comments others have made regarding his conclusions.

thanks for the post tho; interesting read.

ahh.. i did find the start of that thread.. gonna start reading it now. for those interested, you can find it HERE

a couple points since following that thread. first, the auther was not Daniel Rwizi, rather it was By WaltC (or perhaps his real name is Daniel? at any rate, that was the name penned on the original post).

the other point was a post in response to a fundamental inaccuracy in WaltC's post:

"P4 SMT does run two threads at once. It may fetch from alternate threads each cycle, but it issues, dispatches, executes, and retires instructions from multiple threads each cycle."

while the last part of his statement about retirement seems erroneous (was stated in a followup that, "P4 retirement alternates between logical processors", his basic argument is sound.

good stuff tho...
 

Lithan

Platinum Member
Aug 2, 2004
2,919
0
0
You've made the claim that A64's fail miserably in multitasking situations. Presumably because of some magic 'Shitty-multitasking fairy'. Your evidence of this amounts to stacked reviews and your own claims of lag when playing two DOAC clients. You've made it perfectly clear that you are more interested in proving your case against the A64 than actually discovering where your problem arises from or how you can fix it.

Author credit corrected.


MidnightWriter put it well...
"Every processor changes processes (not to mention threads) many time per second. Issues of "smoothness" are generally related to applications or processes that cause the interface to stall waiting for I/O - things like disk access or network access. For these tangible delays, the benefits of AMD's HyperTransport are at least as helpful as Intel's HyperThreading.

HyperThreading is most useful when there are two specific things happening at the same time. One, a background operation, and two, a foreground operation that frequently stalls the pipeline. One of the operations has to make little or no use of any I/O.

While it isn't difficult to contrive this combination of circumstances in a benchmark, the reality is that it doesn't occur very often in typical computing. You're more likely to need to run an application that uses some older libraries (or just an old app) where Pentium generally stumbles badly. You almost never see benchmarks for those.... "
 

Accord99

Platinum Member
Jul 2, 2001
2,259
172
106
Originally posted by: CaiNaM
"In short, two physical cpu cores are always much better than a single physical core &amp; a logical core."
but that doesn't preclude the point of this thread, which is a single physical core &amp; a logical core is better than a single physical core", at least in the context of the p4 as it has wasted pipeline resources which can be used to increase it's computational power when running more than one task, and again i quote his post,
There is also other factors. Dual cores essentially double up the transistors and die size, and given the recent AMD presentation, dual-core Opterons will run at 3-5 speed grades below single-core Opterons, sacrificing single-threaded performance in order to meet a power limit. HT, on the other hand uses less than 5% more transistors for a 20-30% gain in multithreaded performance and does not affect single-threaded performance. And they can be complementary technologies. The Power 5, probably the most powerful 64-bit server processor available today uses both. It's dual core and has SMT, because dual core do not solve the problem that most applications are unable to extract the maximum theoretical performance out of a single processor. The now-canceled Alpha EV8, which if it had been released would have easily gained the performance crown, also featured SMT.

Originally posted by: Lithan
MidnightWriter put it well...
"Every processor changes processes (not to mention threads) many time per second. Issues of "smoothness" are generally related to applications or processes that cause the interface to stall waiting for I/O - things like disk access or network access. For these tangible delays, the benefits of AMD's HyperTransport are at least as helpful as Intel's HyperThreading.
How does HyperTransport improve a disk seek or a packet travelling through a network?

HyperThreading is most useful when there are two specific things happening at the same time. One, a background operation, and two, a foreground operation that frequently stalls the pipeline. One of the operations has to make little or no use of any I/O.
HyperThreading works when there are two or more CPU intensive applications or threads running simultaneously.

 

Lithan

Platinum Member
Aug 2, 2004
2,919
0
0
1. HT can degrade system performance. I seem to remember one of the pi calculaters takes (or took at one time) a significant hit when HT was enabled. I also remember some people testing and finding that disabling HT boosted 3dmarks slightly. There are other examples of this. Also, as motherboard/psu/etc manufacturers start producing parts designed to handle the load of Dual core cpu's, I would wager the dual cores will follow only slightly behind single core cpu's. Much like Opterons are right on the tail of FX these days.

2. Hypertransport > Pci bus

3. Note the words "most useful". What you said, while technically accurate, fails to tell the full story.


 

Brian23

Banned
Dec 28, 1999
1,655
1
0
I didn't read the whole thread so I don't know if this was mentioned, but one thing that significantly affects multitasking is critical sections. Anyone familiar with multithreaded apps in C++ should know what they are. Basically a critical section is a block of code that MUST exicute completely before the CPU is allowed to switch to another thread/process. If the application is poorly written, the critical sections will be very long causing the application to hog CPU resources. Normal programs don't use critical sections so they can be paused at any point during execution to give CPU time to another process. Multithreaded applications MUST use critical sections. If you have 2 instances of an app that has a long critical section, they will both seem choppy because they both are trying to get a big chunk of CPU resources.

I'm not sure why this would affect a A64 more than a P4, but it could be it. If you really want to test A64's multitasking capabilities, a better test would be one that is designed to test that. (I can't think of any benchmarks off the top of my head that test that, but I"m sure they exist.)

I remember the original Unreal Tournament caused task switching to be really slow too. On both my Celeron and my Athlon XP. I would always run winamp while I played UT, and if I wanted to change the song, I would have to minimize UT which took FOREVER. I don't think this is platform, processor or OS related. I think it's the program. Thats why I said that you should run a benchmark that tests multitasking.
 

CaiNaM

Diamond Member
Oct 26, 2000
3,718
0
0
so after posting something that actually had some relevant in interesting information, you now go back to the same old useless rhetoric?
Originally posted by: Lithan
You've made the claim that A64's fail miserably in multitasking situations. Presumably because of some magic 'Shitty-multitasking fairy'. Your evidence of this amounts to stacked reviews and your own claims of lag when playing two DOAC clients.
so.. it's only "unstacked" and objective if it fits within your own (obviously narrow) opinions?

as far as evidence, i've showed a way it's simple for others to reproduce and example of the effect i'm speaking of, i've provided a video (my apologies that the quality is lacking) to show this behavior on my system, and DAOC is just one example (the one that started my questioning what's going on) of this effect. it is hardly the only application affected.

regarding this 'Shitty-multitasking fairy' you pulled out of your ass for whatever reason, i've stated clearly a logical explanation for why this issue likely exists, and it has nothing to do with 'faires' of any sort.
You've made it perfectly clear that you are more interested in proving your case against the A64 than actually discovering where your problem arises from or how you can fix it.
have i now? how is that? this thread started out asking questions on whether it was isolated to something in my hardware, and what i could do to "fix" it. i've swapped hardware, reinstalled operating systems, and tried every one of the few suggestions on this thread, with an obvious lack of results. the course of this thread has dictated we arrive at the conclusion that it was not isolated, and that the behavior is apparenlty not "fixable", but rather inherit at least when running winxp on this platform. i certainly assumed no conclusions when this thread started some time back.

you've also never offered an explanation, nor given any examples of how this may be resolved. all you've done is at varous times threadcrapped, denied there is any issue at all, blamed it on software - whatever suits your mood at the time - and gone on with your obvious "amd rules" mentality. the one thing you offered that was constructive doesn't even support your view of this issue, and again you show your propensity to ignore anything relevant or factual.
MidnightWriter put it well...
"Every processor changes processes (not to mention threads) many time per second. Issues of "smoothness" are generally related to applications or processes that cause the interface to stall waiting for I/O - things like disk access or network access. For these tangible delays, the benefits of AMD's HyperTransport are at least as helpful as Intel's HyperThreading.

HyperThreading is most useful when there are two specific things happening at the same time. One, a background operation, and two, a foreground operation that frequently stalls the pipeline. One of the operations has to make little or no use of any I/O.

While it isn't difficult to contrive this combination of circumstances in a benchmark, the reality is that it doesn't occur very often in typical computing. You're more likely to need to run an application that uses some older libraries (or just an old app) where Pentium generally stumbles badly. You almost never see benchmarks for those.... "
who is "midnightwriter", in what context were these comments made, and what relevance does I/O functions have to this dicussion?

you've certainly shown the ability to use someone else's words, but don't seem to have the ability to put it in context or show what relevance it has or how it pertains to the argument you're trying to make.

Originally posted by: Brian23
I didn't read the whole thread so I don't know if this was mentioned, but one thing that significantly affects multitasking is critical sections. Anyone familiar with multithreaded apps in C++ should know what they are. Basically a critical section is a block of code that MUST exicute completely before the CPU is allowed to switch to another thread/process. If the application is poorly written, the critical sections will be very long causing the application to hog CPU resources. Normal programs don't use critical sections so they can be paused at any point during execution to give CPU time to another process. Multithreaded applications MUST use critical sections. If you have 2 instances of an app that has a long critical section, they will both seem choppy because they both are trying to get a big chunk of CPU resources.

I'm not sure why this would affect a A64 more than a P4, but it could be it. If you really want to test A64's multitasking capabilities, a better test would be one that is designed to test that. (I can't think of any benchmarks off the top of my head that test that, but I"m sure they exist.)

I remember the original Unreal Tournament caused task switching to be really slow too. On both my Celeron and my Athlon XP. I would always run winamp while I played UT, and if I wanted to change the song, I would have to minimize UT which took FOREVER. I don't think this is platform, processor or OS related. I think it's the program. Thats why I said that you should run a benchmark that tests multitasking.
i see what you're getting at, but let's assume the application was not properly written. why would it behave fine on another platform, but poorly on the athlon64 platform?

as for a specific benchmark to test this performance, while it certainly may prove useful, it would seem to me this is still a realworld issue which exists regardless of what a static benchmark may or may not reveal.

having said that, i'd certainly be happy to use such a benchmark (if anyone knows of one) to compare results with others.
 

Brian23

Banned
Dec 28, 1999
1,655
1
0
I'm not an expert on multithreading, but perhaps HT allows multiple critical sections to run at once? I don't know for sure.

Time to stop reading AT and have a few more :beer: :beer:
 

gururu

Platinum Member
Jul 16, 2002
2,402
0
0
[read through probably 80% of the thread]

i think it is an interesting observation considering that pre-Athlon64 cores have 'less' of an issue running two resource-intensive tasks. I'm wondering if it is along the lines of Vee's suggestion on it being an OS thing. These issues might get resolved with the release of Longhorn; I'm guessing it will take advantage of some 64-bit processes that may remedy the apparent deficiency.

The phenomenon probably doesn't make the rounds too often in tech/gaming forums, because I tend to think that most of us try to squeeze so much from our machines that we would die before running a serious background task while gaming. But I see think your DAOC problem is a great demonstration of a potential A64 pitfall. I have never learned so much about HT before going through this thread. Awesome info.