Hi Kevin and Fran,

I Kevin's patch, but somthing changes that causes mistakes when
reformatting. I will try to add some "rubbish" at the end of my segments.
Anyway, thank you so much for your help!

Miquel.

2012/1/18 Kevin Brubeck Unhammer <unham...@fsfe.org>

> Miquel Esplà <miqueles...@gmail.com>
> writes:
>
> > Hi everybody,
> >
> > I am performing some experiments with the fr-es pair. I am trying to
> translate small n-grams
> > (with 1<=n<=3). To be sure that they are translated independently, I
> enclose each n-gram into
> > HTML paragraph tags (<p></p>). Now, this is my problem: the defformater
> adds a dot at the end
> > of each n-gram. One of my n-grams in French ends with the word "avr" (I
> know it means nothing,
> > but it is automatically extracted from a text) and when the dot is
> added, it is recognised by
> > the lt-proc as an abbreviation of avril. As a consequence, this
> paragraph and the following
> > one are concatenated in the resulting translation.
> >
> > This is an example of the output of the deformatter:
> > .[][<html><body><p>]- avr.[][<\/p><p>]- axes.[][<\/p><\/body><\/html>
> > and this is what the lt-proc outputs:
> > ^./.<sent>$[][<html><body><p>]- ^avr./avr.<n><m><sg>$[][<\/p><p>]-
> ^axes/axe<n><m><pl>/axer
> >
> <vblex><pri><p2><sg>/axer<vblex><prs><p2><sg>$^./.<sent>$[][<\/p><\/body><\/html>
> >
> > I have been taking a look to the defformater deffinition, but I am not
> sure about how to solve
> > this. I guess if a space were added before the dot by the deformatter,
> the problem would be
> > solved, but I am not sure about where to add this feature. May anybody
> help me?
>
>     cd trunk/apertium
>    wget http://paste.pocoo.org/raw/536603/ -O nodot.patch
>    patch -p0 < nodot.patch
>    make && make install
>
> This will ensure none of the deformatters add any dots that weren't in
> the input text. I'm not sure why they do in the first place. Perhaps it
> helps tagging some times. I just find it a nuisance, so I keep an
> install in another prefix for when I want a deformatter that doesn't
> mess up punctuation.
>
>
> hope this helps,
> Kevin Brubeck Unhammer
>
>
>
> ------------------------------------------------------------------------------
> Keep Your Developer Skills Current with LearnDevNow!
> The most comprehensive online learning library for Microsoft developers
> is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
> Metro Style Apps, more. Free future releases when you subscribe now!
> http://p.sf.net/sfu/learndevnow-d2d
> _______________________________________________
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
------------------------------------------------------------------------------
Keep Your Developer Skills Current with LearnDevNow!
The most comprehensive online learning library for Microsoft developers
is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
Metro Style Apps, more. Free future releases when you subscribe now!
http://p.sf.net/sfu/learndevnow-d2d
_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to