Saurabh Hota <saurabh060...@gmail.com>
writes:

> Hi
> I have gone through the archives and Akshay has good data set
> of shortened words which can be used to train which vowels are
> dropped. Also we have to note that abbreviations and shorten form
> are different like brb - > be right back and bday -> birthday. So we
> have to handle them separately. And to do this first we have to
> classify them.
>
> For translation I have just written a bash script
> while read line; do echo $line | apertium en-eo; done < Tweets

That would run much faster as 

    apertium en-eo < Tweets

-- 
Kevin Brubeck Unhammer

GPG: 0x766AC60C

Attachment: pgppNwKO19MSA.pgp
Description: PGP signature

------------------------------------------------------------------------------
Learn Graph Databases - Download FREE O'Reilly Book
"Graph Databases" is the definitive new guide to graph databases and their
applications. Written by three acclaimed leaders in the field,
this first edition is now available. Download your free book today!
http://p.sf.net/sfu/13534_NeoTech
_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to