2009/2/9  <jimston...@aol.com>:
> I know there is probably a simple solution for this, but can anyone help me
> with code to filter out messages with extended characters like these:
>
> àïåëüñèí
> êàïèëêà
> òàáëåòêè
> ïåðåêèñü
> òåëåôîí
> òåòðàäè
> êàðàíäàøè
> îá¸ðòêè  îò êàíôåò
> êîðîáêà îò òåëåôîíà
> âàòà
> âàçà
> äèñêè
> êíèãè è  òåòðàäè
> È åùå êó÷à áóìàæåê è âñÿêîãî õëàìà
>
> Just something to dump any messages with these characters. Is there a  simple
> way?
>


Just by guess to decode them to utf8 since you didn't know which way
they are encoded. :-)

use Encode;
my @list = Encode->encodings(":all");  # get all encoding ways

for my $encoding (@list) {
    print "decoded with $encoding:\n";
    print encode("utf8",decode($encoding,$your_string) );
}


-- 
Jeff Peng
Office: +86-20-38350822
AIM: jeffpang
www.dtonenetworks.com

Reply via email to