Re: [Python-Dev] Help with Unicode arrays in NumPy

2006-02-08 Thread Travis Oliphant
Thank you, Martin and Stephen, for the suggestions and comments. For your information: We decided that all NumPy arrays of unicode strings will use UCS4 for internal representation. When an element of the array is selected, a unicodescalar (which inherits directly from the unicode builtin type

Re: [Python-Dev] Help with Unicode arrays in NumPy

2006-02-07 Thread Stephen J. Turnbull
> "Travis" == Travis E Oliphant <[EMAIL PROTECTED]> writes: Travis> Numpy supports arrays of arbitrary fixed-length "records". Travis> It is much more than numeric-only data now. One of the Travis> fields that a record can contain is a string. If strings Travis> are supported

Re: [Python-Dev] Help with Unicode arrays in NumPy

2006-02-07 Thread Martin v. Löwis
Travis E. Oliphant wrote: > Numpy supports arrays of arbitrary fixed-length "records". It is > much more than numeric-only data now. One of the fields that a > record can contain is a string. If strings are supported, it makes > sense to support unicode strings as well. Hmm. How do you support

Re: [Python-Dev] Help with Unicode arrays in NumPy

2006-02-07 Thread Travis E. Oliphant
Martin v. Löwis wrote: > Travis E. Oliphant wrote: > >>Currently that means that they are "unicode" strings of basic size UCS2 >>or UCS4 depending on the platform. It is this duality that has some >>people concerned. For all other data-types, NumPy allows the user to >>explicitly request a bi

Re: [Python-Dev] Help with Unicode arrays in NumPy

2006-02-07 Thread Martin v. Löwis
Travis E. Oliphant wrote: > Currently that means that they are "unicode" strings of basic size UCS2 > or UCS4 depending on the platform. It is this duality that has some > people concerned. For all other data-types, NumPy allows the user to > explicitly request a bit-width for the data-type.

[Python-Dev] Help with Unicode arrays in NumPy

2006-02-07 Thread Travis E. Oliphant
This is a design question which is why I'm posting here. Recently the NumPy developers have become more aware of the difference between UCS2 and UCS4 builds of Python. NumPy arrays can be of Unicode type. In other words a NumPy array can be made of up fixed-data-length unicode strings. Cur