> X-Mailer: MessagingEngine.com Webmail Interface > Date: Thu, 05 Apr 2012 09:02:23 +0200 > From: Per Tunedal <[email protected]> > To: [email protected] > Reply-To: [email protected] > Subject: Re: [Apertium-stuff] The pair SV-ENG > > Hi Keld, > I haven't decided yet on the corpus to work on. My initial idea was to > do some web crawling after texts in the two languages involved. I > believe there is a tool available. Maybe at Uppsala University? Anyone > who has any idea on how to proceed? > > I thought there was a pair SV - EN to work on, thus I intended to > quickly set up a minor working system (maybe limited to a restraint > domain). Now, when there isn't any such pair, nor any DA - EN pair, I > will have to think it over once more. My main interest is French, a > language completely different from Swedish. If I would work on the pair > SV - FR instead, I will have to consider using an other tool than > Apertium. I will have a look at Matxin and MOSES (statistical). > > Is it possible to use dictionaries and other resources from Apertium > with Matxin? > > Yours, > Per Tunedal > >
As the sv-en pair is already in nursery with tx files making thinking that somebody worked on this pair for translations on the two directions, I am also surprised to see that the bilingual dictionary has a size of about 2Kbytes only, and tx files are only starting files doing nothing special. Some other pairs in incubator may be more advanced. Of course, there are English and Swedish morphological dictionaries taken from other pairs, but it is as much easy to take French dictionary (the most complete seems the one of the pair eo-fr) and the same Swedish dictionary. To answer an old question of you, as nobody worked on sv-en pair since about a year (see "List of language pairs"), AND this pair has not a maintainer indicated in "Language and pair maintainer" wiki page, there should be absolutly no problem if you start working on it. In fact, I think if Francis put several files for this pair one year ago, as for others pairs he does not develop, it can be considered as a help for people who would like to start working on these pairs. Something more personal now, sv-fr pair should have been the forth language pair for me to work on (in chronological order). But before that, I need to start by the first one and presently I did not do a bit more than translating Apertium wiki pages, witch is a way to learn how system works. For Swedish language, except for several words like : Hey, tack, kronor, systembolaget, yag (no) jag alskar sverige, and for following your song "du gamla, du fria" with a translation (sentences seems to have a similar structure between French and Swedish), my swedish is really too limited (never followed lessons). So, if you are swedish native and know a little French, don't hesitate creating this pair. -------------------------------- Bernard Chardonneau (France) Phone : [33] 1 64 90 87 04 (from Sept to June except holidays) GSM phone : [33] 6 49 95 13 95 (french scholl holidays, C zone) Multilingual websites for my free softwares : http://libremail.free.fr and http://libremail.tuxfamily.org http://cyloop.tuxfamily.org (mainly translated with Apertium) My general website (in french only) http://bech.free.fr ------------------------------------------------------------------------------ For Developers, A Lot Can Happen In A Second. Boundary is the first to Know...and Tell You. Monitor Your Applications in Ultra-Fine Resolution. Try it FREE! http://p.sf.net/sfu/Boundary-d2dvs2 _______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
