Ok, thanks Hector. We're pretty happy with the Pos-tagger for now. I
think we won't use the texts for now. When we have a larger Pos-tagged
occitan corpus, we may try again.
Regards,
AureSÉGUIER
Responsabla del pòle informatic
Congrès permanent de la lenga occitana
mobilePhone
+33 (0)5 32 00 00 64 <tel:+33 (0)5 32 00 00 64>
website
www.locongres.org <//www.locongres.org>
address
La Ciutat - Creem! , 5-7 rue de la Fontaine, 64000 Pau
facebook <https://www.facebook.com/lo.congres>
twitter <https://twitter.com/locongres>
linkedin
<https://www.linkedin.com/company/congres-permanent-de-la-lenga-occitane/>
instagram <https://www.instagram.com/locongres/>
Le 15/10/2025 à 15:05, Hèctor Alòs i Font a écrit :
J'ai trouvé cela dans la documentation :
https://wiki.apertium.org/wiki/Paire_Occitan-Fran%C3%A7ais#D.C3.A9sambigu.C3.AFsateur_statistique_2
Missatge de Hèctor Alòs i Font <[email protected]> del dia dc., 15
d’oct. 2025 a les 16:02:
Adiu, Aure,
The texts for training the tagger, if I remember correctly, were
something we tried back with Claudi Balaguer, but I don't think we
managed to get a post-tagger that worked better than the one that
already existed. Consequently, we didn't use them, and simply left
them in case they might be useful to someone in the future. I
don't have access to Apertium stuff right now. I'll try to look
into it tonight.
Best,
Hèctor
Missatge de Aure Séguier <[email protected]> del dia dc., 15
d’oct. 2025 a les 15:39:
Hi,
I changed the organization of occitan language words to merge
words which are variants in many varieties. For instance «
veire » (oci), « véser » (oci@gascon), « véder » (oci@gascon)
and « veir » (oci@aran) are now merged in only one verb « véser ».
The apertium-oci repository has a « texts » subdirectory with
pos-tagged .vislcg.txt texts. I understood these texts are
used to fine-tune the pos-tagger with statistical techniques.
I corrected these texts so they reflect the new verbs
organization in the monodix.
But now I have no idea what to do with these texts. How do I
use them to fine-tune the pos-tagger ? I found this page on
Apertium wiki :
https://wiki.apertium.org/wiki/Tagger_training. But it doesn't
mention any vislcg text. Where can I found the procedure to
fine-tune again the pos-tagger with the corrected texts ?
Thanks
--
AureSÉGUIER
Responsabla del pòle informatic
Congrès permanent de la lenga occitana
mobilePhone
+33 (0)5 32 00 00 64 <tel:+33%20(0)5%2032%2000%2000%2064>
website
www.locongres.org <//www.locongres.org>
address
La Ciutat - Creem! , 5-7 rue de la Fontaine, 64000 Pau
facebook <https://www.facebook.com/lo.congres>
twitter <https://twitter.com/locongres>
linkedin
<https://www.linkedin.com/company/congres-permanent-de-la-lenga-occitane/>
instagram <https://www.instagram.com/locongres/>
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff