You can design the microprocessor so that it fetches multiple words and puts them side by side and treats the result as the effective opcode. So, even if your microprocessor only had a 4-bit data bus, you could still have 8, 12, 16 or 32-bit effective opcodes. It wouldn't be very efficient. But, it would be possible.
A 4-bit word has 16 possibilities. An n-bit word has 2^n possibilities.