Software: GPGPU Is the Key
X-bit labs: Performance of Zacate (and Ontario) in x86-based applications appears to be pretty low, it barely beats Atom in a lot of cases and it offers lower performance than Pentium dual-core. But obviously it should be much faster in GPGPU-based applications. I wonder whether you have plans to ensure that actual system vendors install those GPGPU apps onto their netbooks or nettops with Fusion chips?
Godfrey Cheng: Benchmarks like Bapco Sysmark are not the things that the consumer does. When we looked at consumers and asked what did they want their computer to do, we learnt that consumers want better Internet video, better video in general, better Web-surfing. These are the things that our [Ontario and Zacate] APUs are designed for; they are not designed to be benchmark winners. Instead, we are delivering great performance, great functionality and the great experience at very good TDPs. We are not going to chase the benchmark wars with our friends at Intel, that is not our goal. But if you look at products based on those APUs, people will get great experience out of them.
We are definitely working with third-party software application companies to take advantage of parallel compute and fixed-function capabilities of our APUs. You will see companies shipping software next year that will be optimized for our accelerated processing units.
X-bit labs: When do you expect a wide range of consumer applications to start using GPUs to speed up stream processing?
Godfrey Cheng: We have been working with software developers for a while, we should expect to see such applications in 2011. [...] We are working with multiple companies and there are multiple applications that benefit from general purpose computing on GPUs and thus from our ALUs in the pipeline as a part of our Fusion Fund initiative.
X-bit labs: At present software determines which (x86 or GPU) cores to use. But would you expect future chips to automatically perform the load balance and decide which hardware to use for particular operations?
Godfrey Cheng: That part is really done on the compiler, API and application levels. We are definitely working on OpenCL application programming interface and at some point an OpenCL variation will allow you to choose whether you put [your workload on] your GPU or your CPU. Basically, we are agnostic as to where run the code as long as it runs efficiently; but software companies will have controls as to where exactly it runs, it is not at the chip level.
When we work with software developers, we say, "we have an APU that does this, this and this, why don't you try your code-path in different ways?" Ultimately, they will decide what is best for their software architecture.
What has never happened is that by offering them x86, ALUs and fixed-function units they can approach patterning their problems differently. If we can offer them 90GFLOPS+ of compute power in an eighteen watt package, they might approach that and use those capabilities that they never had access to. That is what is different [with APUs].