There is one thing I'd like to add to this discussion. That is that the P4 is highly dependent upon software to be optimized for it. More specifically, loops have to be autovectorized for SSE2. If this is not the case, i.e. application is old, compiled for PIII, or librares are old, or application is not suitable for packed ints and fp, no P4 is competitive with either K7 or K8. I would even hazard a guess that the 'lowly' XP2500+ beats even the 3.4EE.
Primarily, media and benchmarks can be, and are highly SSE2 optimized.
If you run other, more general types of applications, or older applications, typical benchmarks does not reflect the performance you will "enjoy" from the P4.
There's also a reverse side of this. Packed optimization for 3DNow+ have been rather late coming to many media apps. If you happen to run a such media app (lacking 3DNow+, but optimized for SSE2), performance is going to be dramatically better on a P4. This means you are generally much better off with a P4, if your major activity is media. A64 supports SSE2, but again, older apps seem sometimes to fail identifying that.
Personally, I'm not much concerned or interested in these issues any more. 64-bit software is so much more important to me. But I am a latecomer to AMD. And the primary reason I discovered and explored AMD, was the great disappointment in performance from the P4 compared to PIII. AMD suits me better since it's more like a PIII on steroids.
In closing, I think the question "XP<P4<A64?" need to be put in relation to a specific class of applications.
P4 is *only* ">" to XP in regards of supporting SSE2. In all other respects, 386/387, MMX, SSE, scalar math and int, conditionals, AthlonXP is superior to the P4.
P.S. I also note that on these forums, there are frequent comments concerning the great performance of early Athlons and poor performance of early P4s.
Well... Even though P4B and P4C picked up a good general boost from their faster FSB, I can't see much have really changed. - So I can't really help asking the question: How many realize that benchmarks have changed?
I kinda think that fact (and also changing apps) is the primary reason why the perception of AthlonXP vs P4 performance have also changed.