El dg 16 de 12 de 2012 a les 14:21 +0100, en/na Per Tunedal va escriure: > Hi, > I consider info on primarily domain and secondly style useful for > disambiguation. As a first step it would be very nice to be able to add > a domain-tag to words. Adding info on style would make it possible to > further improve translation results. > > The translation would improve considerably if the user could choose the > appropriate domain when demanding a translation. Consider e.g. the > example of translating the English word "key", or similarly the French > word "clé/clef", to Swedish. If the domain is e.g. > Tourism/accommodation/real estate or similar, the word would most likely > translate to "nyckel" (to lock/unlock the door of a house). On the other > hand if the domain is e.g. information technology (or even music) the > word would most likely translate to "tangent" (on your keyboard or > piano). Obviously, a lexical selector/disambiguator could be trained on > a corpus from a specific domain as well, further improving to the > translation.
I did this in my thesis. It's quite effective. It's possible to tune the vocabulary to a domain with either parallel or monolingual corpora using apertium-lex-tools.[1] You won't be interested in it though as it doesn't work with the Java version. > On more reason to take domains into account is that it's fairly easy to > get a high coverage and accuracy when concentrating on translating a > domain, rather than trying to make a system that makes a general > translation. Further, it's a fruitful way to build a new language pair: > adding a domain at a time. Thus, the system quickly gets useful. > > 1. I'm planning to spend some days on adding translation for some > domains to the pair Swedish-Danish (sv-da). It would be nice to be able > to include the domain-info from the start. Put it in a comment <e c="domain:foo"> > 2. Has any work been done on using domain-info in lexical > selection/disambiguation? Any plans? Yes, that is the subject of my thesis. Fran 1. http://wiki.apertium.org/wiki/Constraint-based_lexical_selection_module ------------------------------------------------------------------------------ LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial Remotely access PCs and mobile devices and provide instant support Improve your efficiency, and focus on delivering more value-add services Discover what IT Professionals Know. Rescue delivers http://p.sf.net/sfu/logmein_12329d2d _______________________________________________ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff