Well, it is mostly no-net. I run on a cluster where only one node has net access, but the disks are cross mounted. So for each node that I run on (20 currently) I must
1) from the net node, download a WU, kill client
2) from the work node, start client, no intermediate blocks, terminate after one failed attempt with server
3) keep track of when WU will finish
4) from the net node, send finished WU, get new one
5) rinse, repeat
It takes a good chunk of my time, but for some reason I do it anyway. If the client worked well in SMP mode, I could add another 20 nodes and be up over 15 McEMs/sec. But it doesn't, so oh well.