Re: [SQL] regexp_replace and UTF8

2009-02-04 Thread Jasen Betts
On 2009-02-02, Bart Degryse wrote: > > --=__PartF6DE34E1.0__= > Content-Type: text/plain; charset=UTF-8 > Content-Transfer-Encoding: quoted-printable > > Thanks for the ideas! > The function Jasen suggests works partially. > There are also entities like > ‘ > ’ > “ > ” > … these work fine here.

Re: [SQL] regexp_replace and UTF8

2009-02-02 Thread Bart Degryse
Thanks for the ideas! The function Jasen suggests works partially. There are also entities like ‘ ’ “ ” … The output of the htmlent function for these looks like | Was that what you meant with "characters outside of the LATIN-1 space are not handled but apparently this is what you want." ? Because

Re: [SQL] regexp_replace and UTF8

2009-01-31 Thread Jasen Betts
On 2009-01-30, Bart Degryse wrote: > > --=__Part8EA648F8.0__= > Content-Type: text/plain; charset=UTF-8 > Content-Transfer-Encoding: quoted-printable > > Hi, > I have a text field with data like this: 'de patiënt niet' > (without the quotes). > I would like to convert this string to look like this

Re: [SQL] regexp_replace and UTF8

2009-01-30 Thread Harald Fuchs
In article <87ljstm4eq@oxford.xeocode.com>, Gregory Stark writes: > "Bart Degryse" writes: >> Hi, >> I have a text field with data like this: 'de patiënt niet' >> Can anyone help me fix this or point me to a better approach. >> By the way, changing the way data is put into the field is >> u

Re: [SQL] regexp_replace and UTF8

2009-01-30 Thread Gregory Stark
"Bart Degryse" writes: > Hi, > I have a text field with data like this: 'de patiënt niet' > Can anyone help me fix this or point me to a better approach. > By the way, changing the way data is put into the field is > unfortunately not an option. > Many thanks in advance. You could use a plperl

[SQL] regexp_replace and UTF8

2009-01-30 Thread Bart Degryse
Hi, I have a text field with data like this: 'de patiënt niet' (without the quotes). I would like to convert this string to look like this: 'de patiënt niet' Basically what I need to do (I think) is - get rid of the &, # and ; - convert the number to hex - make a UTF8 from that (thus: \xEB) - conve