This series removes lazy FPU mode, and cleans up various bits and pieces around the FPU code.
I have run this through a basic floating point test that involves about 1.5 billion context switches and 45 minutes of swapping at 250MB/second. This seems to tease out bugs fairly well, though I would not mind an actual floating point test suite...