The idea looks very interesting to me from programming point of view.Is the APU responsible for keeping this operation thread safe or the burden lies with programmers and languages?
Some in the compiler, for sure. Reasonably strong ordering and coherency cost lots in hardware, and GPUs can get away with lacking some, using fancy scheduling and SMT to get around the impact of barriers and such. I can see thread safety implementations being quasi-software for years to come, on the GPU side of things (GCN's "OOO" write ordering should offer good enough HW guarantees to do the rest in software, if it eats too much in hardware resources, for now). But, it can mostly be compiled away, so you should never have to deal with it.
Memory coherency, however, really needs to be done in hardware. Microcode is fine; and doing it, "in hardware," by actually doing it in firmware, is also fine. Leaving it up to software has, and will again, result in longer development cycles, subtle bugs, and some software never being able to reach the potential its makers thought it could, because the average case ends up so much worse than the paper specs and tailored benchmark scores.
Guys, layman question of the day:
- Won't this more complex memory management add latency?
Yes, in best-case terms. It always does. That's nothing new. The problem is that, just like virtual memory v. physical memory, the vast majority of software does not live inside that best case bubble.
Of course, Intel has been there for awhile. They just need to improve their IGP and software for it. Good slides, and it's good they're finally doing it, but Intel beat them to this integration step. Intel needs to improve their software and IGP, but they've already been here, done that, and been sharing the LLC (it's like getting a the T-shirt, but for the CPU

).
That is the big issue isn't it? Nobody is targeting GPUs, they're targeting GPU drivers. I think one of the advantages of Intel's Phi is that you target x86.
But, a
very special x86. Also, it's a small niche-market product.