Hi Dimitar,

My apologies for this slow reply, I have been traveling lately. I have
not used SenseClusters for these languagues, but all of the
tokenization and is done using Perl, so if you can define a suitable
--token file, then I *think* things should work pretty seamlessly. The
--token file contains Perl regular expressions that define what a
token is, for English this is quite simple and is what contained in
SenseClusters by default.

This assumes that Perl supports these languages and their encodings, I
know it supports UTF-8, I am not sure about the others to be honest.

I hope this helps, but do let us know if you run into problems and we
can look at little deeper.

Cordially,
Ted

On 2/13/07, Dimitar Vasilev <[EMAIL PROTECTED]> wrote:
> Hello
> Does SenseClusters support Slavic languages  Bulgarian,
> Russian,Serbian and their encodings
> (CP1251,KOI8R,UTF-8,IS0-8859-5)?
> My goal is to analyse a dataset and graph it.
> Could you recommend some software that accepts the output of senseclusters?
> thank you in advance,
>
> --
> Димитър Василев
> Dimitar Vassilev
>
> GnuPG key ID: 0x4B8DB525
> Keyserver: pgp.mit.edu
> Key fingerprint: D88A 3B92 DED5 917E 341E D62F 8C51 5FC4 4B8D B525
> -------------------------------------------------------------------------
> Using Tomcat but need to do more? Need to support web services, security?
> Get stuff done quickly with pre-integrated technology to make your job easier.
> Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
> http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
> _______________________________________________
> senseclusters-users mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/senseclusters-users
>


-- 
Ted Pedersen
http://www.d.umn.edu/~tpederse
-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
senseclusters-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/senseclusters-users

Reply via email to