Hello,

As you know, I developped a shell to generate list of language pairs
and list of dictionaries for the wiki.

I put the result in a tar file sent on this list on the 22 of last
January.

But nobody used it to put it on the computer hosting the wiki (I
think it is xixona.dlsi.ua.es). So, these pages are updated only
when I think to do that.

At the end of this month, as it happens 4 times a year between
September and June, I will spend 2 weeks of holidays without internet
(but not without computer), and next summer, from 26th June to begin
of September, I will use Internet as few as possible.

So, if updates are still done in manual mode, they will wait.

Last month I improved my shell.

The main problem is to keep informations about langages pairs to
prevent using a lot the network and spending dizains of minutes
exploring every pair even if they did not change between two calls
of the command.

For the existing pairs and dictionaries, the first version of the
command used 731 files with 29k usefull bytes. As on ext3 filesystem,
a non empty file uses at least 4k of disk space, these informations
needed 2.9 Mbytes of disk space.

The new version keeps these informations in 8 files with 27k usefull
bytes, and the disk spaces used for that on ext3 filesystem is 56k
bytes.

The first version was made of a single commented shell of 341 lines
and 12 K bytes.

For the present version, 4 smaller files are used :

- the first to set up common environment variables (dirnames and
  filenames)
- another to test if something changed since the last call and
  on what branche did pairs changed
- the third to update (memorise) characteristics of pairs changed
  since the last time in a branche
- the last one to generate sources of wiki pages.

I also made changes to make my commands working on different linux
distributions.

For an automatic update, a command similar to a python script
francis Tyers indicated in another mail will have to be added.

The new version of my script is at :
http://bech.free.fr/apertium/genpage.tar.bz2

As nobody did that before, I propose to install these commands
on the computer hosting the wiki if I have an account on it.
My logname could be bech.

There will be a little maintenance to do (extending 2 files)  when
a new langage will stard to be supported, or if a language is
indicated with a 2 letters code in several pairs and 3 letters code
in others.
Presently, it is the case for :
- fi and fin
- nb and nob
- ru and rus
- tr and tur
(I think that's all)

and to remove a patch if for es-an pair if it  starts using .t1x file
(I may try to do that in May if not done before).



--------------------------------
Bernard Chardonneau (France)
Phone : [33] 1 64 90 87 04 (from Sept to June except holidays)
GSM phone : [33] 6 49 95 13 95 (french scholl holidays, C zone)

Multilingual websites for my free softwares :
http://libremail.free.fr and http://libremail.tuxfamily.org
http://cyloop.tuxfamily.org (mainly translated with Apertium)

My general website (in french only)
http://bech.free.fr

------------------------------------------------------------------------------
Better than sec? Nothing is better than sec when it comes to
monitoring Big Data applications. Try Boundary one-second 
resolution app monitoring today. Free.
http://p.sf.net/sfu/Boundary-dev2dev
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to