On Mon, 01 Apr 2013 06:11:50 -0700, rusi wrote: > On Apr 1, 5:15 pm, Roy Smith <r...@panix.com> wrote:
>> The import job isn't done yet, but so far we've processed 116 million >> records and had to clean up four of them. I can live with that. >> Sometimes practicality trumps correctness. > > That works out to 0.000003%. Of course I assume it is US only data. > Still its good to know how skew the distribution is. If the data included Japanese names, or used Emoji, it would be much closer to 100% than 0.000003%. -- Steven -- http://mail.python.org/mailman/listinfo/python-list