Re: Stripping out Unicode combining characters (diacritics) -

2008-05-07 Thread Brad Baxter
ton > # 817-272-5326 office > # 817-688-1926 mobile > # [EMAIL PROTECTED] > # http://rocky.uta.edu/doran/ > > > > -Original Message- > > From: Doran, Michael D [mailto:[EMAIL PROTECTED] > > Sent: Monday, May 05, 2008 7:27 PM > > To:

RE: Stripping out Unicode combining characters (diacritics) -

2008-05-07 Thread Doran, Michael D
y, May 05, 2008 7:27 PM > To: [EMAIL PROTECTED] > Cc: Perl4lib > Subject: Stripping out Unicode combining characters (diacritics) > > I'm trying to strip out combining diacritics from some form > input using this code: > > > > > > >

Re: Stripping out Unicode combining characters (diacritics)

2008-05-07 Thread David Kaufman
Hi Michael, "Doran, Michael D" <[EMAIL PROTECTED]> wrote: > I'm trying to strip out combining diacritics from some form input using > this code: > [...] > $sans_diacritics =~ s/\p{M}*//g; I do it like this: use Encode; use Unicode::Normalize qw(normalize); my $ascii = encode('ascii', normali

RE: Stripping out Unicode combining characters (diacritics)

2008-05-06 Thread Doran, Michael D
TECTED] # http://rocky.uta.edu/doran/ > -Original Message- > From: Leif Andersson [mailto:[EMAIL PROTECTED] > Sent: Tuesday, May 06, 2008 3:33 AM > To: Doran, Michael D > Subject: Re: Stripping out Unicode combining characters (diacritics) > > Oh, now I see your REAL

Re: Stripping out Unicode combining characters (diacritics)

2008-05-06 Thread Leif Andersson
ander Kopia: [EMAIL PROTECTED]; Perl4lib Ämne: RE: Stripping out Unicode combining characters (diacritics) Hi Mike, I appreciate the quick reply. I am familiar with the Unicode::Normalize module (and will also be using that), but I left it out of this question because it's not relevant to t

RE: Stripping out Unicode combining characters (diacritics)

2008-05-05 Thread Doran, Michael D
Mon 5/5/2008 8:52 PM To: Doran, Michael D Cc: [EMAIL PROTECTED]; Perl4lib Subject: Re: Stripping out Unicode combining characters (diacritics) On Mon, May 5, 2008 at 8:26 PM, Doran, Michael D <[EMAIL PROTECTED]> wrote: [snip] > > I'm pulling my hair out on this... so an

Re: Stripping out Unicode combining characters (diacritics)

2008-05-05 Thread Mike Rylander
On Mon, May 5, 2008 at 8:26 PM, Doran, Michael D <[EMAIL PROTECTED]> wrote: [snip] > > I'm pulling my hair out on this... so any help would be appreciated. If > there's any other info I can provide, let me know. > You'll want to transform the text to NFD format (nominally, base characters plus

Stripping out Unicode combining characters (diacritics)

2008-05-05 Thread Doran, Michael D
I'm trying to strip out combining diacritics from some form input using this code: #!/usr/local/bin/perl use CGI; $query = CGI::new(); $search_term = $query->param('text'); $sans_diacritics = $search_term; $sans_diacritics =~ s/\p{M}*//g; #$sans_diacritics =~ s/o//g;