Hey all, Tihomir and myself are proud to announce the first release of apertium-mk-bg (Macedonian and Bulgarian). This is the first Slavic language pair in Apertium -- and hopefully the first of many!
You can try it online now. Here are some stats: ==Linguistic data== * Macedonian morphology: 4,010 * Macedonian morphology: 4,364 * Bilingual dictionary: 4,083 * Disambiguation rules Macedonian->Bulgarian: 9 * Transfer Macedonian->Bulgarian: 19 * Transfer Bulgarian->Macedonian: 18 ==Coverage== * Bulgarian Wikipedia: Total: 9834480, Known: 7391855 (75.16%) (other numbers pending, but on SETimes should be around 80%) ==Accuracy== We have so far only tested the accuracy from Macedonian to Bulgarian. Approximately 1,000 words were taken from the SETimes corpus, translated using the system and then posteditted. Unknown words were allowed. ------------------------------------------------------- Edit distance: 292 Word error rate (WER): 26.67 % Number of position-independent word errors: 278 Position-independent word error rate (PER): 25.39 % ------------------------------------------------------- Fran ------------------------------------------------------------------------------ Sell apps to millions through the Intel(R) Atom(Tm) Developer Program Be part of this innovative community and reach millions of netbook users worldwide. Take advantage of special opportunities to increase revenue and speed time-to-market. Join now, and jumpstart your future. http://p.sf.net/sfu/intel-atom-d2d _______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
