El dt 13 de 11 de 2012 a les 09:31 +0100, en/na Per Tunedal va escriure: > OK. It works, but please explain why it's not enough to compile the > dictionaries with: > > lt-comp lr apertium-sv-da.sv.dix sv-da.automorf.bin > lt-comp rl apertium-sv-da.sv.dix da-sv.autogen.bin > etc > > How do I know when I have to recompile the whole pair?
The autotools setup, when you type 'make' will recompile only what has been changed. > Secondly, I'm curious how Apertium handles word splitting. The points in > abbreviations must be handled somehow, wouldn't they? I just thought > about simple scripts for aligning, like Bligner, or even OmegaT. They > split sentences at punctuation marks. Thus, they have a list of what not > to split, i.e. the abbreviations for the languages in concern. That's > why I started this tread. How does Apertium know not to split? Does the > tagger look for the tag <abbr> ? Is this a standard solution for > Apertium? Or do I have to add it in each language pair somehow? Left-to-right longest match with tokenise-as-you-analyse. http://www.dlsi.ua.es/~mlf/docum/garrido02p.pdf Section 3 describes it. Fran ------------------------------------------------------------------------------ Monitor your physical, virtual and cloud infrastructure from a single web console. Get in-depth insight into apps, servers, databases, vmware, SAP, cloud infrastructure, etc. Download 30-day Free Trial. Pricing starts from $795 for 25 servers or applications! http://p.sf.net/sfu/zoho_dev2dev_nov _______________________________________________ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff