Re: [Apertium-stuff] Update about superblanks in transfer

2020-08-30 Thread Tanmai Khanna
> Are the changes being implemented going to alter the behavior of the > punctuation marks that are not analyzed as tokens? > Yes, as was discussed in the thread about markup handling in Apertium, input blanks are now read as a queue and output in order in the available spots in the rule output.

Re: [Apertium-stuff] Update about superblanks in transfer

2020-08-30 Thread Jaume Ortolà i Font
Missatge de Tino Didriksen del dia dg., 30 d’ag. 2020 a les 11:15: > Why is - a blank in the first place? If it's needed in contexts, it should > be fully analyzed as a token. > This goes for all Apertium languages and pairs. I don't understand why > punctuation generally isn't analyzed. I

Re: [Apertium-stuff] Update about superblanks in transfer

2020-08-30 Thread Tanmai Khanna
Also, I agree with Tino, if the punctuation is important for the context then it should probably be analysed as a token. *तन्मय खन्ना * *Tanmai Khanna* On Sun, Aug 30, 2020 at 3:16 PM Tanmai Khanna wrote: > Hèctor what I mean is, if you don't want a space in the output of rules > you have to

Re: [Apertium-stuff] Update about superblanks in transfer

2020-08-30 Thread Tanmai Khanna
Hèctor what I mean is, if you don't want a space in the output of rules you have to remove the . For eg., $ echo "f.75v." | apertium -d .. fra-frp *f.75 v.. This space between 75 and v is now there because the output rule has a and so if you want the output to come without a space, you should

Re: [Apertium-stuff] Update about superblanks in transfer

2020-08-30 Thread Hèctor Alòs i Font
Missatge de Tanmai Khanna del dia dg., 30 d’ag. 2020 a les 12:06: > Hi Hèctor, > I'm dealing with the issues I see one by one. > 1. I was flushing the remaining blanks after processOut because I thought > usually we only have one .. block in the rule, but in some of > your rules there's

Re: [Apertium-stuff] Update about superblanks in transfer

2020-08-30 Thread Tino Didriksen
Why is - a blank in the first place? If it's needed in contexts, it should be fully analyzed as a token. This goes for all Apertium languages and pairs. I don't understand why punctuation generally isn't analyzed. I assume it's just historic. -- Tino Didriksen On Sun, 30 Aug 2020 at 08:27,

Re: [Apertium-stuff] Update about superblanks in transfer

2020-08-30 Thread Tanmai Khanna
Hey, It solved them Franco-Japanese issue as well :D Can you check the diff once and see if there's any more issues Hèctor? (After updating apertium). *तन्मय खन्ना * *Tanmai Khanna* On Sun, Aug 30, 2020 at 2:35 PM Tanmai Khanna wrote: > Hi Hèctor, > I'm dealing with the issues I see one by

Re: [Apertium-stuff] Update about superblanks in transfer

2020-08-30 Thread Tanmai Khanna
Hi Hèctor, I'm dealing with the issues I see one by one. 1. I was flushing the remaining blanks after processOut because I thought usually we only have one .. block in the rule, but in some of your rules there's multiple, so in the latest commit to apertium/apertium, I made them flush after the

Re: [Apertium-stuff] Update about superblanks in transfer

2020-08-30 Thread Hèctor Alòs i Font
Missatge de Tanmai Khanna del dia dg., 30 d’ag. 2020 a les 9:49: > My guess is, the transfer rule for Franco-Japanese has a two word input, > so the stored blank is "-". Now the output has 3 words "una > Franco-Japonêsa", since the blanks are printed in order, they're printed in > the first

Re: [Apertium-stuff] Update about superblanks in transfer

2020-08-30 Thread Tanmai Khanna
My guess is, the transfer rule for Franco-Japanese has a two word input, so the stored blank is "-". Now the output has 3 words "una Franco-Japonêsa", since the blanks are printed in order, they're printed in the first available spot in the output rules. There's a few possible solutions for

Re: [Apertium-stuff] Update about superblanks in transfer

2020-08-30 Thread Tanmai Khanna
Hèctor, No worries I'll look into this. Can you send the input sentences? I want to see the transfer rules that are applying to the erroneous parts. They might need some changing. तन्मय खन्ना Tanmai Khanna From: Hèctor Alòs i Font Sent: Sunday, August 30, 2020

Re: [Apertium-stuff] Update about superblanks in transfer

2020-08-30 Thread Hèctor Alòs i Font
Unfortunately, I found a lot of problems cased by superblanks, especially with the handling of hyphens. See a couple of differences in translations of my French test corpus into Arpitan before and after the update: < 00607. Tandis que les Tétes Broulâyes sont en *permission sur *Espritos Marcos,