Further the Pro cards might use two GPUs, which (due to much smaller texture space requirements) would be fine for pro graphics while keeping redundant data. While for compute tasks they could actually store different working sets in the local RAM.
Some compute tasks requires sizable amounts of RAM, and slow system RAM doesn't cut it.
