It is also possible to have two chips with no layout changes between the two which are available at different speeds. This is because the fab process isn't totally exact; i mean they're good, but they're not good at say a level of atoms.
For example, when they hit the masks with light in etching process, its hard to get the light to disperse totally evenly across the whole wafer, meaning some chips experience a different angle of light penetrating through the openings in the masks, meaning they have slightly (usually very slightly) different physical characteristics.
This and other similar fabrication "errors" lead to chips that come back being operational at slightly different speeds. A company then takes those chips and tests them to see what they [the company] feels they will work at comfortably (and perhaps there's internal political and marketing influences too) and then set their chip multiplier to achieve the target speed.
That's why people can overclock chips cause they "can" work at faster speeds, the company (intel, motorola, etc.) just doesn't support it. And that's also why overclocking depends on the chip generation and some luck, because depending on better layouts in later generations (or maybe even earlier generations) and/or the exact happenings during your chips fab, it may be able to withstand the faster switching speeds and be overclocked high, or it might produce errors because some circuitry has currently leakeage, lockup, or other electrical quality problems, or it could theoretically fry the chip through something like electromigration.
I'm not very knowledgable of intel's definition of stepping, but i'd assume its a change on one or more of the metal masks for the same architectural chip.
Side Note: You COULD have the die size change if you took a design and shrunk it to a new process, without adding functionality. I'd consider this the same chip. However, its usually a fair amount of work to just shrink a chip because of the new process properties (i.e coupling, rail bounce, current leakage, etc.) so its usually not much more work to redesign parts of you chip at the same time thus leading to a new chip. But i think the pentiums go through die changes when change process to get more speed out a current line.