Hello, As you know, I developped a shell to generate list of language pairs and list of dictionaries for the wiki.
I put the result in a tar file sent on this list on the 22 of last January. But nobody used it to put it on the computer hosting the wiki (I think it is xixona.dlsi.ua.es). So, these pages are updated only when I think to do that. At the end of this month, as it happens 4 times a year between September and June, I will spend 2 weeks of holidays without internet (but not without computer), and next summer, from 26th June to begin of September, I will use Internet as few as possible. So, if updates are still done in manual mode, they will wait. Last month I improved my shell. The main problem is to keep informations about langages pairs to prevent using a lot the network and spending dizains of minutes exploring every pair even if they did not change between two calls of the command. For the existing pairs and dictionaries, the first version of the command used 731 files with 29k usefull bytes. As on ext3 filesystem, a non empty file uses at least 4k of disk space, these informations needed 2.9 Mbytes of disk space. The new version keeps these informations in 8 files with 27k usefull bytes, and the disk spaces used for that on ext3 filesystem is 56k bytes. The first version was made of a single commented shell of 341 lines and 12 K bytes. For the present version, 4 smaller files are used : - the first to set up common environment variables (dirnames and filenames) - another to test if something changed since the last call and on what branche did pairs changed - the third to update (memorise) characteristics of pairs changed since the last time in a branche - the last one to generate sources of wiki pages. I also made changes to make my commands working on different linux distributions. For an automatic update, a command similar to a python script francis Tyers indicated in another mail will have to be added. The new version of my script is at : http://bech.free.fr/apertium/genpage.tar.bz2 As nobody did that before, I propose to install these commands on the computer hosting the wiki if I have an account on it. My logname could be bech. There will be a little maintenance to do (extending 2 files) when a new langage will stard to be supported, or if a language is indicated with a 2 letters code in several pairs and 3 letters code in others. Presently, it is the case for : - fi and fin - nb and nob - ru and rus - tr and tur (I think that's all) and to remove a patch if for es-an pair if it starts using .t1x file (I may try to do that in May if not done before). -------------------------------- Bernard Chardonneau (France) Phone : [33] 1 64 90 87 04 (from Sept to June except holidays) GSM phone : [33] 6 49 95 13 95 (french scholl holidays, C zone) Multilingual websites for my free softwares : http://libremail.free.fr and http://libremail.tuxfamily.org http://cyloop.tuxfamily.org (mainly translated with Apertium) My general website (in french only) http://bech.free.fr ------------------------------------------------------------------------------ Better than sec? Nothing is better than sec when it comes to monitoring Big Data applications. Try Boundary one-second resolution app monitoring today. Free. http://p.sf.net/sfu/Boundary-dev2dev _______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
