Hey Stchu! Not exactly sure what the "messy code" is except the it looks like non printable binary data. Depending on where you look I think it is values, offset etc.
The reason that we are keeping the family stored in the files are to leave the door open for something called locality groups. There was a lot of talk about keeping them or not when doing the big 0.20 rewrite, but we decided to leave them in. When using compression they add very little overhead and makes it easy to send out results without having to copy anything. Regards Erik