Re: [Tutor] name shortening in a csv module output

Peter Otten Thu, 23 Apr 2015 05:07:54 -0700

Jim Mooney wrote:

> ..
> 
>> Ï»¿
>>
>> is the UTF-8 BOM (byte order mark) interpreted as Latin 1.
>>
>> If the input is UTF-8 you can get rid of the BOM with
>>
>> with open("data.txt", encoding="utf-8-sig") as csvfile:
>>
> 
> Peter Otten
> 
> I caught the bad arithmetic on name length, but where is the byte order
> mark coming from?


Did you touch the data with an editor? That might be the culprit.

> My first line is plain English so far as I can see - no
> umlauts or foreign characters.
> first_name|last_name|email|city|state or region|address|zip
> 
> Is this an artifact of csv module output, or is it the data from
> generatedata.com, which looks global? More likely it means I have to
> figure out unicode ;'(


_______________________________________________
Tutor maillist  -  [email protected]
To unsubscribe or change subscription options:
https://mail.python.org/mailman/listinfo/tutor

Re: [Tutor] name shortening in a csv module output

Reply via email to