Re: [Python-3000] Help on text editors

2006-09-09 Thread David Hopwood
Michael Urman wrote: > On 9/7/06, David Hopwood <[EMAIL PROTECTED]> wrote: > >>Yes. However, this is not a good idea for precisely the reason described >>on that page (false detection of Unicode), and so any Unicode detection >>algorithm in Python should only be based on detecting a BOM, IMHO. >

Re: [Python-3000] Help on text editors

2006-09-09 Thread David Hopwood
Michael Urman wrote: > On 9/7/06, Jeff Wilcox <[EMAIL PROTECTED]> wrote: > >>>From: "Paul Prescod" <[EMAIL PROTECTED]> >>>1. On US English Windows, Notepad defaults to an encoding called "ANSI". >>>"ANSI" is not a real encoding at all (and certainly not one from the >> >>On Japanese Windows 2000,

Re: [Python-3000] Help on text editors

2006-09-09 Thread David Hopwood
Michael Urman wrote: > As for whether cp932 is the same as Shift JIS, David and I seem to > disagree. While I lack hard data, the string '\\' round trips through > either on my box. I missed this part. On any single implementation, '\\' will usually round-trip from Unicode -> Shift-JIS -> Unicode;

Re: [Python-3000] Help on text editors

2006-09-09 Thread Marcin 'Qrczak' Kowalczyk
David Hopwood <[EMAIL PROTECTED]> writes: >> Right, except BOMs break tons of Unix applications (and even >> occasional Windows ones) which do not expect them. > > This problem is overstated. A BOM anywhere in a text causes no > problem with display, and *should* be treated as an ignorable > chara

[Python-3000] Offtopic: declaring encoding

2006-09-09 Thread Paul Prescod
On 9/9/06, Marcin 'Qrczak' Kowalczyk <[EMAIL PROTECTED]> wrote: > Note that there are plenty of other characters that should be> treated as ignorable, so the applications that are broken for BOMs> are broken more generally.I disagree. UTF-8 BOM should not be used on Unix. It's not a reliable method

Re: [Python-3000] Help on text editors

2006-09-09 Thread Paul Prescod
On 9/7/06, Jeff Wilcox <[EMAIL PROTECTED] > wrote:> From: "Paul Prescod" < [EMAIL PROTECTED]>> 1. On US English Windows, Notepad defaults to an encoding called "ANSI".> "ANSI" is not a real encoding at all (and certainly not one from theOn Japanese Windows 2000, Notepad defaults to ANSI as it does

Re: [Python-3000] Offtopic: declaring encoding

2006-09-09 Thread Marcin 'Qrczak' Kowalczyk
"Paul Prescod" <[EMAIL PROTECTED]> writes: > text data cannot even be reliably processed on the desktop on which > it was created (yes, even on Unix: look back in this thread). Where? > Do you have a positive prescription? New communication protocols and newly created file formats designed for

Re: [Python-3000] Offtopic: declaring encoding

2006-09-09 Thread Paul Prescod
On 9/9/06, Marcin 'Qrczak' Kowalczyk <[EMAIL PROTECTED]> wrote: "Paul Prescod" <[EMAIL PROTECTED]> writes:> text data cannot even be reliably processed on the desktop on which> it was created (yes, even on Unix: look back in this thread). Where?http://mail.python.org/pipermail/python-3000/2006-Sept

Re: [Python-3000] Help on text editors

2006-09-09 Thread Jan Claeys
Op do, 07-09-2006 te 12:21 -0700, schreef Paul Prescod: > Guido has asked me to do some research in aid of a file encoding > detection/defaulting PEP. > > I only have access to a small number of operating systems and language > variants so I need help. > > If you have access to "German Windows XP

[Python-3000] Pre-PEP: Easy Text File Decoding

2006-09-09 Thread Paul Prescod
PEP: XXXTitle: Easy Text File Decoding Version: $Revision$Last-Modified: $Date$ Author: Paul Prescod <[EMAIL PROTECTED]> Status: DraftType: Standards Track Content-Type: text/x-rstCreated: 09-Sep-2006 Post-History: 09-Sep-2006Python-Version: 3.0 Abstract Python 3000 will use Unicode as the

Re: [Python-3000] The future of exceptions

2006-09-09 Thread Greg Ewing
Michael Chermside wrote: > It doesn't necessarily imply that > the traceback be materialized immediately upon exception creation > (which is undesirable because we want exceptions lightweight enough > to use for things like for loop control!)... but it might mean that > pieces of the stack frame ne