You are being disingenuous here John. The same constraints apply to the shared hardware in a BD module. Unless you are saying these is never a case that Bulldozer could become hardware constrained on the front end, or have its pipeline filled 100%.
Also, as you state, it is about overall system thoughtput. You are choosing your words carefully to imply the second thread runs slower than the first or sometimes doesn't run at all, and somehow overall system throughput is affected. You know that is false (excluding corner cases).
Be proud of AMD, you should be. Work your butt off to market your products (I'm sure you are). But spin is unbecoming. Saying in one post that you don't like benchmarks, and then posting a cherry picked benchmark to prove a point raises questions, you know?