Performance concerns about individual instructions aren't worth much effort. Things like operand alignment, data and instruction cache retention, locality of reference, branch frequency etc. can have really significant effects.
Remember that CPU speeds have increased much faster than memory speeds -- getting an operand from cache can take a cycle or two, but from memory can take hundreds or thousands (try causing a page fault!).