RE: Farsi Stemming Algorithm

2004-04-30 Thread Ehsan Akhgari
> (I'll just reply to your other post here) I guess I didn't know about > a new pc-parse release. Where did you get the newest source code? > That's terrific news for me. Well, the release I downloaded is approximately one year old, but here's the URL I downloaded it from: ftp://ftp.sil.org/sof

RE: IranL10nInfo

2004-04-30 Thread C Bobroff
Peter, Please send me your Tajik keyboard and we can discuss it further off the list. I don't think Arial Unicode MS will do but TITUS may work. I'll have to check. My particular project was for the web so even if we do find a font, it will boil down to the eternal question of whether to embed,

RE: Farsi Stemming Algorithm

2004-04-30 Thread Jon D.
--- Ehsan Akhgari <[EMAIL PROTECTED]> wrote: > I tried nmiAim, but unfortunately didn't get > any results. Yeah, that's been fixed in the newest version. > Are you > interested in the fixed sources? (I'll just reply to your other post here) I guess I didn't know about a new pc-parse release

RE: Farsi Stemming Algorithm

2004-04-30 Thread Ehsan Akhgari
> One of the things that drives me nuts about the software is that it > claims to run on Solaris/Sparc, Win/x86, MacOS, or BSD, but apparently > no Linux (I have a Sparc box, so I'm lucky :-). The source code is > downloadable, but it currently doesn't seem to compile on Linux/x86. > It does have

RE: IranL10nInfo

2004-04-30 Thread Linguasoft
Arial Unicode MS should do, plus (probably) Code2000 by James Kass or (possibly) Bitstream TITUS Unicode -- I've to check the latter ones. I am quite certain that there are a couple of Russian-made (not hacked) fonts around, too. Peter -Original Message- From: C Bobroff [mailto:[EMAIL P

RE: IranL10nInfo

2004-04-30 Thread Linguasoft
"Tadzhik" is also valid, as a (re-)transliteration of the language's name as spelled in Cyrillic. The Cyrillic alphabet uses two graphemes to represent the sound of Perso-Arabic . Similar as used in French transliteration of Arabic, etc. Peter ___ P

RE: Farsi Stemming Algorithm

2004-04-30 Thread Ehsan Akhgari
> It's a two-level morphology engine, so basically it resolves a surface > form to a lexical form, or lexical to surface form. > For example, if I give it a newspaper word like 'nmiAim' > (نميايم -- I am not coming), it > will resolve to 'n+mi+A+m', taking into account any morpheme boundary > chan