El dj 26 de 08 de 2010 a les 03:35 +0100, en/na Jimmy O'Regan va escriure: > On 25 August 2010 21:00, Francis Tyers <[email protected]> wrote: > > Hey all, > > > > Tihomir and myself are proud to announce the first release of > > apertium-mk-bg (Macedonian and Bulgarian). This is the first Slavic > > language pair in Apertium -- and hopefully the first of many! > > > > Well, Slavic Lite - all the Slavic flavour, low in cholesterol and inflection. > > > You can try it online now. > > > > Here are some stats: > > > > ==Linguistic data== > > > > * Macedonian morphology: 4,010 > > * Macedonian morphology: 4,364 > > * Bilingual dictionary: 4,083 > > > > 4,083? The poor thing needs a good meal :) > > > * Disambiguation rules Macedonian->Bulgarian: 9 > > > > * Transfer Macedonian->Bulgarian: 19 > > * Transfer Bulgarian->Macedonian: 18 > > > > ==Coverage== > > > > * Bulgarian Wikipedia: Total: 9834480, Known: 7391855 (75.16%) > > > > (other numbers pending, but on SETimes should be around 80%) > > > > I'd be interested in how the numbers look on the Bulgarian portion of > the JRC Acquis.
The numbers for the JRC Acquis are: * JRC Acquis: Total: 12697749, Known: 10328060 (81.33%) Actually, higher than I expected. Top 10 unknown words: 38457 ^ЕИО/*ЕИО$ 31014 ^параграф/*параграф$ 18587 ^Регламент/*Регламент$ 17451 ^Директива/*Директива$ 17288 ^относно/*относно$ 14771 ^Наименование/*Наименование$ 14462 ^съгласно/*съгласно$ 14220 ^следва/*следва$ 12507 ^след/*след$ 11861 ^във/*във$ Fran ------------------------------------------------------------------------------ Sell apps to millions through the Intel(R) Atom(Tm) Developer Program Be part of this innovative community and reach millions of netbook users worldwide. Take advantage of special opportunities to increase revenue and speed time-to-market. Join now, and jumpstart your future. http://p.sf.net/sfu/intel-atom-d2d _______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
