Re: [Tutor] input file encoding

2007-09-11 Thread Tim Michelsen
> Not sure what you mean by "standard encoding" (is this an Ubuntu > thing?) but essentially whenever you're pulling stuff into Python As it was lined out by others I was printing to a linux terminal which had the encoding set to UTF-8. Therefore and for further processing of the data I had to ope

Re: [Tutor] input file encoding

2007-09-11 Thread Kent Johnson
Tim Golden wrote: > Ah, I see. I'm so used to Windows where there is, technically an encoding > for the console window, but you can't really do anything about > it (apart from the awkward chcp) and it isn't really in your face. I > do *use* Linux sometimes, but I don't really think in it :) Actua

Re: [Tutor] input file encoding

2007-09-11 Thread Tim Golden
Kent Johnson wrote: > Tim Golden wrote: >> Not sure what you mean by "standard encoding" (is this an Ubuntu >> thing?) > > Probably referring to the encoding the terminal application expects - > writing latin-1 chars when the terminal expects utf-8 will not work well. Ah, I see. I'm so used to

Re: [Tutor] input file encoding

2007-09-11 Thread Kent Johnson
Tim Golden wrote: > Tim Michelsen wrote: >> Hello, >> I want to process some files encoded in latin-1 (iso-8859-1) in my >> python script that I write on Ubuntu which has UTF-8 as standard encoding. > > Not sure what you mean by "standard encoding" (is this an Ubuntu > thing?) Probably referrin

Re: [Tutor] input file encoding

2007-09-11 Thread Tim Golden
Tim Michelsen wrote: > Hello, > I want to process some files encoded in latin-1 (iso-8859-1) in my > python script that I write on Ubuntu which has UTF-8 as standard encoding. Not sure what you mean by "standard encoding" (is this an Ubuntu thing?) but essentially whenever you're pulling stuff in

[Tutor] input file encoding

2007-09-10 Thread Tim Michelsen
Hello, I want to process some files encoded in latin-1 (iso-8859-1) in my python script that I write on Ubuntu which has UTF-8 as standard encoding. When I use the "print lines_in_myfile" is get some wired symbols. How shold I read those files in or convert their encoding to utf-8? Thanks in ad