I tried different combinations: apertium-destxt -n | lt-proc -z -w 'apertium-tat/tat.automorf.bin' | cg-proc -z 'apertium-tat/tat.rlx.bin' | cg-proc -z -w -1 'apertium-tat/dev/mansur.bin' | apertium-retxt |
And it does not merge if I remove "cg-proc -z -w -1 'apertium-tat/dev/mansur.bin' | " from the pipeline. Could you take a look at the rules there? Am Fr., 9. Nov. 2018 um 12:06 Uhr schrieb Kevin Brubeck Unhammer < unham...@fsfe.org>: > mansur <6688...@gmail.com> čálii: > > > Hi! > > > > root@apertium:~# locale > > LANG=ru_RU.UTF-8 > > LANGUAGE= > > LC_CTYPE="ru_RU.UTF-8" > > LC_NUMERIC="ru_RU.UTF-8" > > LC_TIME="ru_RU.UTF-8" > > LC_COLLATE=C > > LC_MONETARY="ru_RU.UTF-8" > > LC_MESSAGES="ru_RU.UTF-8" > > LC_PAPER="ru_RU.UTF-8" > > LC_NAME="ru_RU.UTF-8" > > LC_ADDRESS="ru_RU.UTF-8" > > LC_TELEPHONE="ru_RU.UTF-8" > > LC_MEASUREMENT="ru_RU.UTF-8" > > LC_IDENTIFICATION="ru_RU.UTF-8" > > LC_ALL= > > > > What did you mean by "each of you"? > > I'm guessing "each of mansur & kevin" :-) > > LANG=nn_NO.UTF-8 > LANGUAGE=nn_NO:nn:no_NO:no:nb_NO:nb:en > LC_CTYPE="nn_NO.UTF-8" > LC_NUMERIC="nn_NO.UTF-8" > LC_TIME="nn_NO.UTF-8" > LC_COLLATE="nn_NO.UTF-8" > LC_MONETARY="nn_NO.UTF-8" > LC_MESSAGES="nn_NO.UTF-8" > LC_PAPER="nn_NO.UTF-8" > LC_NAME="nn_NO.UTF-8" > LC_ADDRESS="nn_NO.UTF-8" > LC_TELEPHONE="nn_NO.UTF-8" > LC_MEASUREMENT="nn_NO.UTF-8" > LC_IDENTIFICATION="nn_NO.UTF-8" > LC_ALL= > > It seems we both have UTF-8, the only difference is in LANGUAGE and > LC_COLLATE – I wouldn't have thought any of them would matter, but I get > U+1F609 WINKING FACE 😉 > (and a newline) where you get > U+FFFD REPLACEMENT CHARACTER � > so it definitely seems encoding-related. > > Is it lt-proc or cg-proc that does it? > > > > > Am Fr., 9. Nov. 2018 um 11:44 Uhr schrieb Xavi Ivars < > xavi.ivars-re5jqeeqqe8avxtiumw...@public.gmane.org>: > > > >> What are the encodings that each of you are using in the shell? Is it a > >> UTF one in both cases? > >> > >> > >> -- > >> Xavi Ivars > >> < http://xavi.ivars.me > > >> > >> El dv., 9 de nov. 2018, 09:41, mansur < > 6688000-re5jqeeqqe8avxtiumw...@public.gmane.org> va escriure: > >> > >>> Strange. > >>> I uploaded my output here: https://filebin.net/c7mikerq2vwv08ql > >>> > >>> > >>> Am Fr., 9. Nov. 2018 um 11:31 Uhr schrieb Kevin Brubeck Unhammer < > >>> unham...@fsfe.org>: > >>> > >>>> I still get 5 lines for that, could you upload the output you get too? > >>>> I get: > >>>> > >>>> http://sprunge.us/fJYZbm > >>>> > >>>> -Kevin > >>>> > >>>> mansur < > 6688000-re5jqeeqqe8avxtiumwx3w-xmd5yjdbdmrexy1tmh2...@public.gmane.org> > čálii: > >>>> > >>>> > Hi! > >>>> > I uploaded it here: > >>>> > https://filebin.net/46e383wip8h2qcrc > >>>> > > >>>> > > >>>> > Am Fr., 9. Nov. 2018 um 11:00 Uhr schrieb Kevin Brubeck Unhammer < > >>>> > unham...@fsfe.org>: > >>>> > > >>>> >> mansur < > 6688000-re5jqeeqqe8avxtiumwx3w-xmd5yjdbdmrexy1tmh2...@public.gmane.org> > čálii: > >>>> >> > >>>> >> > One more example: > >>>> >> > > >>>> >> > - Фәнис Яруллин � > >>>> >> > - Фәнис Яруллинга багышланган чараларның һәрберсендә катнашырга > >>>> тырышам, > >>>> >> - > >>>> >> > диде әдипнең дусты Мохтар Афзалов. > >>>> >> > > >>>> >> > ^-/-<guio>$ ^Фәнис/Фәнис<np><ant><m><nom>$ > >>>> >> > ^Яруллин/Яруллин<np><cog><m><nom>$ �-/-<guio>$ > >>>> >> > ^Фәнис/Фәнис<np><ant><m><nom>$ > ^Яруллинга/Яруллин<np><cog><m><dat>$ > >>>> >> > ^багышланган/багышла<v><tv><pass><gpr_past>$ > >>>> >> ^чараларның/чара<n><pl><gen>$ > >>>> >> > ^һәрберсендә/*һәрберсендә$ ^катнашырга/катнаш<v><tv><inf>$ > >>>> >> > ^тырышам/тырыш<v><tv><pres><p1><sg>$^,/,<cm>$ ^-/-<guio>$ > >>>> >> > ^диде/ди<v><tv><ifi><p3><sg>$ ^әдипнең/әдип<n><sg><gen>$ > >>>> >> > ^дусты/дуст<n><sg><px3sp><nom>$ ^Мохтар/Мохтар<np><ant><m><nom>$ > >>>> >> > ^Афзалов/Афзалов<np><cog><m><nom>+и<cop><aor><p3><sg>$^./.<sent>$ > >>>> >> > > >>>> >> > Here it happens because of some broken char... But why? > >>>> >> > >>>> >> I can't reproduce it, but maybe the broken character didn't survive > >>>> the > >>>> >> e-mail. Could you e.g. put a text file with it on > >>>> https://filebin.net/ ? > >>>> >> > >>>> >> _______________________________________________ > >>>> >> Apertium-stuff mailing list > >>>> >> Apertium-stuff@lists.sourceforge.net > >>>> >> https://lists.sourceforge.net/lists/listinfo/apertium-stuff > >>>> >> > >>>> > > >>>> > _______________________________________________ > >>>> > Apertium-stuff mailing list > >>>> > > apertium-stuff-5nwgofrqmnerv+lv9mx5uipxlwaovq5f-xmd5yjdbdmrexy1tmh2...@public.gmane.org > >>>> > https://lists.sourceforge.net/lists/listinfo/apertium-stuff > >>>> > > >>>> > >>>> _______________________________________________ > >>>> Apertium-stuff mailing list > >>>> Apertium-stuff@lists.sourceforge.net > >>>> https://lists.sourceforge.net/lists/listinfo/apertium-stuff > >>>> > >>> _______________________________________________ > >>> Apertium-stuff mailing list > >>> Apertium-stuff@lists.sourceforge.net > >>> https://lists.sourceforge.net/lists/listinfo/apertium-stuff > >>> > >> _______________________________________________ > >> Apertium-stuff mailing list > >> Apertium-stuff@lists.sourceforge.net > >> https://lists.sourceforge.net/lists/listinfo/apertium-stuff > >> > > > > _______________________________________________ > > Apertium-stuff mailing list > > Apertium-stuff@lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/apertium-stuff > > > > _______________________________________________ > Apertium-stuff mailing list > Apertium-stuff@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/apertium-stuff >
_______________________________________________ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff