Re: OT: strip hebrew vowels and accents from utf-8 text

2009-11-05 Thread David Purton
On Thu, Nov 05, 2009 at 04:00:59PM -0500, Celejar wrote: > > Can anyone suggest a simple way to strip vowels out of utf-8 encoded > > hebrew text, leaving just the consenants? > > #! /usr/bin/perl -w > > use strict; > use Encode; > > while (<>) { > $_ = Encode::decode('utf-8', $_); >

Re: OT: strip hebrew vowels and accents from utf-8 text

2009-11-05 Thread Celejar
On Thu, 5 Nov 2009 14:50:56 +1030 David Purton wrote: > Can anyone suggest a simple way to strip vowels out of utf-8 encoded > hebrew text, leaving just the consenants? > > i.e., given something like בָָּ֟֟רָא, pipe it through something so that the > output is ברא. The unicode characters to id