Re: [Apertium-stuff] Abbreviations

Francis Tyers Tue, 13 Nov 2012 00:57:42 -0800

El dt 13 de 11 de 2012 a les 09:31 +0100, en/na Per Tunedal va escriure:
> OK. It works, but please explain why it's not enough to compile the
> dictionaries with:
> 
> lt-comp lr apertium-sv-da.sv.dix sv-da.automorf.bin
> lt-comp rl apertium-sv-da.sv.dix da-sv.autogen.bin
> etc
> 
> How do I know when I have to recompile the whole pair?


The autotools setup, when you type 'make' will recompile only what has
been changed.

> Secondly, I'm curious how Apertium handles word splitting. The points in
> abbreviations must be handled somehow, wouldn't they? I just thought
> about simple scripts for aligning, like Bligner, or even OmegaT. They
> split sentences at punctuation marks. Thus, they have a list of what not
> to  split, i.e. the abbreviations for the languages in concern. That's
> why I started this tread. How does Apertium know not to split? Does the
> tagger look for the tag <abbr> ? Is this a standard solution for
> Apertium? Or do I have to add it in each language pair somehow?

Left-to-right longest match with tokenise-as-you-analyse.

http://www.dlsi.ua.es/~mlf/docum/garrido02p.pdf

Section 3 describes it.

Fran


------------------------------------------------------------------------------
Monitor your physical, virtual and cloud infrastructure from a single
web console. Get in-depth insight into apps, servers, databases, vmware,
SAP, cloud infrastructure, etc. Download 30-day Free Trial.
Pricing starts from $795 for 25 servers or applications!
http://p.sf.net/sfu/zoho_dev2dev_nov
_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Re: [Apertium-stuff] Abbreviations

Reply via email to