Dutch spell checking dictionaries (Was: Trouble updating the ro spellfile)

2008-11-10 Thread Tony Mechelynck

On 10/11/08 09:26, Bram Moolenaar wrote:
 Tony Mechelynck wrote:

 On 09/11/08 11:24, Bram Moolenaar wrote:
 [...]
 Fixing bugs currently has priority.  I also still have a note to update
 the Dutch spell file, I think already for half a year...

 Shall I do it? (Which tools are required, beside Vim?) Het Nederlands is
 mijn moedertaal niet, maar ik bezit goede, niet te verouderde ;-) één-
 en tweetalige woordenboeken.

 Well, I started with the open office list, merged with a list I scraped
 of a website and then had it filtered against an official list.  This is
 too much work, we probably need to drop this and go back to the
 OpenOffice.org list.  You could try downloading that one and check that
 it works well.


Ah, you mean spelling dictionaries, I thought you meant message/menu 
translations. Sorry. I still did some detective work, though; here are 
the results:

I followed the OOo link at :help Mkspell and found there several 
dictionary archives. The format has changed between OOo versions 2 and 
3. AFAICT, the version 3 dictionaries are distributed as OOo 
extensions which don't seem compatible with Vim.

For version 2 (with MySpell) 3 Dutch archives are listed, apparently all 
dated 2007. The fullest one is 
http://ftp.services.openoffice.org/pub/OpenOffice.org/contrib/dictionaries/nl_NL-pack.zip
 
which includes the other two (nl_NL .dic and .aff and hyph_nl_NL.dic), 
plus a medical dictionary, nl_MED .dic and .aff. (README license texts 
are also included, see below.) Personally I don't use spell checking, 
and in addition I don't write much in Dutch. Maybe someone (or some 
people) using Dutch spell checking more than I do could test those 
spelling dictionaries? The structure of the file at the above-mentioned 
URL is as follows:

   nl_NL-pack.zip
 hyph.txt
 hyph_nl_NL.zip
   hyph_nl_NL.dic
   README_hyph_nl_NL.txt
 nl_MED.zip
   nl_MED.dic
   nl_MED.aff
   README_nl_MED.txt
 nl_NL.zip
   nl_NL.dic
   nl_NL.aff
   README_nl_NL.txt
 README_nl_NL.txt
 spell.txt

...so there will be two levels of unzipping. The README files (at least 
those that I checked) are in both Dutch and English. Goed geluk!


Best regards,
Tony.
-- 
Real Users never use the Help key.

--~--~-~--~~~---~--~~
You received this message from the vim_use maillist.
For more information, visit http://www.vim.org/maillist.php
-~--~~~~--~~--~--~---



Re: Dutch spell checking dictionaries (Was: Trouble updating the ro spellfile)

2008-11-10 Thread Bram Moolenaar


Tony Mechelynck wrote:

  On 09/11/08 11:24, Bram Moolenaar wrote:
  [...]
  Fixing bugs currently has priority.  I also still have a note to update
  the Dutch spell file, I think already for half a year...
 
  Shall I do it? (Which tools are required, beside Vim?) Het Nederlands is
  mijn moedertaal niet, maar ik bezit goede, niet te verouderde ;-) één-
  en tweetalige woordenboeken.
 
  Well, I started with the open office list, merged with a list I scraped
  of a website and then had it filtered against an official list.  This is
  too much work, we probably need to drop this and go back to the
  OpenOffice.org list.  You could try downloading that one and check that
  it works well.
 
 
 Ah, you mean spelling dictionaries, I thought you meant message/menu 
 translations. Sorry. I still did some detective work, though; here are 
 the results:
 
 I followed the OOo link at :help Mkspell and found there several 
 dictionary archives. The format has changed between OOo versions 2 and 
 3. AFAICT, the version 3 dictionaries are distributed as OOo 
 extensions which don't seem compatible with Vim.
 
 For version 2 (with MySpell) 3 Dutch archives are listed, apparently all 
 dated 2007. The fullest one is 
 http://ftp.services.openoffice.org/pub/OpenOffice.org/contrib/dictionaries/nl_NL-pack.zip
  
 which includes the other two (nl_NL .dic and .aff and hyph_nl_NL.dic), 
 plus a medical dictionary, nl_MED .dic and .aff. (README license texts 
 are also included, see below.) Personally I don't use spell checking, 
 and in addition I don't write much in Dutch. Maybe someone (or some 
 people) using Dutch spell checking more than I do could test those 
 spelling dictionaries? The structure of the file at the above-mentioned 
 URL is as follows:
 
nl_NL-pack.zip
  hyph.txt
  hyph_nl_NL.zip
hyph_nl_NL.dic
README_hyph_nl_NL.txt
  nl_MED.zip
nl_MED.dic
nl_MED.aff
README_nl_MED.txt
  nl_NL.zip
nl_NL.dic
nl_NL.aff
README_nl_NL.txt
  README_nl_NL.txt
  spell.txt
 
 ...so there will be two levels of unzipping. The README files (at least 
 those that I checked) are in both Dutch and English. Goed geluk!

That's still an older version.  Apparently that's the latest that was
distributed officially, even though many improvements have been
discussed.

If you want to work on this, and are able to communicate in Dutch, you
can subscribe on the maillist: http://www.opentaal.nl/mailinglist.php

-- 
hundred-and-one symptoms of being an internet addict:
222. You send more than 20 personal e-mails a day.

 /// Bram Moolenaar -- [EMAIL PROTECTED] -- http://www.Moolenaar.net   \\\
///sponsor Vim, vote for features -- http://www.Vim.org/sponsor/ \\\
\\\download, build and distribute -- http://www.A-A-P.org///
 \\\help me help AIDS victims -- http://ICCF-Holland.org///

--~--~-~--~~~---~--~~
You received this message from the vim_use maillist.
For more information, visit http://www.vim.org/maillist.php
-~--~~~~--~~--~--~---



Re: Dutch spell checking dictionaries (Was: Trouble updating the ro spellfile)

2008-11-10 Thread Tony Mechelynck

On 10/11/08 21:41, Bram Moolenaar wrote:
[...]
 If you want to work on this, and are able to communicate in Dutch, you
 can subscribe on the maillist: http://www.opentaal.nl/mailinglist.php


When I try to access the beheerpagina about (un)subscribing to the 
mailing list, SeaMonkey tells me that:
- The certificate is not trusted because its issuer is unknown
- The certificate is only valid for *.uitwisselplatform.nl (and thus not 
for uitwisselplatform.nl with no prefix)
- The certificate has expired on 16 September 2007 at 01:59

I could add it to my browser as a permanent exception, but what is the 
purpose of having certificates expire if you are to accept them more 
than a year after their expiration date?

Maybe someone on the list could give an admin of uitwisselplatform.nl a 
phone call? Or should I open a bug at Mozilla for the Tech Evangelism 
product, Dutch language component, about the fact that that site 
presents a certificate which is more than a year overdue?


Best regards,
Tony.
-- 
What good is having someone who can walk on water if you don't follow
in his footsteps?

--~--~-~--~~~---~--~~
You received this message from the vim_use maillist.
For more information, visit http://www.vim.org/maillist.php
-~--~~~~--~~--~--~---



Re: Trouble updating the ro spellfile

2008-11-09 Thread Bram Moolenaar


Andrei Popescu wrote:

 On Sat,08.Nov.08, 22:17:42, Bram Moolenaar wrote:
  
   and now I have a working spellfile (before that it was using only the 
   .aff file to build the .spl). Could you please replace the file on 
   ftp.vim.org with one built from the wordlist indicated by me? The old 
   file is plain useless (actually it could be doing more harm then good).
  
  To be able to allow everybody to reproduce the .spl file we need to
  write a .aap file that does the work.  It basically does all the work
  that you would do manually.  In this case it would also do the
  conversion from utf-8 to iso-8859-2 to make a spell file in that
  encoding.  
 
 No, please don't. The new diacritics (șțȘȚ) can not be
 represented in ISO-8859-2, they need UTF-8. Only the -cedilla wordlist
 uses them, but we (as in the Romanian FLOSS community) are trying to
 switch to the new diacritics.
 
 Even MS (responsible for creating the old/wrong diacritics
 (şŞţŢ)) is using the correct ones in Vista and there is an
 update available for XP.

So what about people who have iso-8859-2 files on a system that doesn't
support utf-8?  Would the old spell file be better than nothing?
Or would a conversion from utf-8 to iso-8859-2 of the new spell file
give better results.

  Do you think you can write this .aap file?  You can use the
  files of other languages as an example.  If not then I can put it on my
  todo list (which probably means it will take a couple of months before I
  get to it).
 
 I'll see what I can do about it, but no promises. Better do put it on 
 your TODO list, just in case I'm too busy. The spare time I have now I 
 tend to use for translation work for the Debian Lenny release, which 
 should happen Real Soon Now (tm).

Fixing bugs currently has priority.  I also still have a note to update
the Dutch spell file, I think already for half a year...

-- 
I noticed my daughter's Disney-net password on a sticky note:
MickeyMinnieGoofyPluto.  I asked her why it was so long.
Because they say it has to have at least four characters.

 /// Bram Moolenaar -- [EMAIL PROTECTED] -- http://www.Moolenaar.net   \\\
///sponsor Vim, vote for features -- http://www.Vim.org/sponsor/ \\\
\\\download, build and distribute -- http://www.A-A-P.org///
 \\\help me help AIDS victims -- http://ICCF-Holland.org///

--~--~-~--~~~---~--~~
You received this message from the vim_use maillist.
For more information, visit http://www.vim.org/maillist.php
-~--~~~~--~~--~--~---



Re: Trouble updating the ro spellfile

2008-11-09 Thread Andrei Popescu
On Sun,09.Nov.08, 11:24:03, Bram Moolenaar wrote:
 
 So what about people who have iso-8859-2 files on a system that doesn't
 support utf-8?  Would the old spell file be better than nothing?

No, the old spellfile is obsolete.

 Or would a conversion from utf-8 to iso-8859-2 of the new spell file
 give better results.

The hunspell-ro.3.2.tar.gz file contains an ISO-8859-2 word list. I 
guess it could be used to create a .spl, but it should be clearly marked 
as obsolete or something similar.

I guess the following will be needed for this file:

FOL ăâîşţ
LOW ăâîşţ
UPP ĂÂÎŞŢ

Regards,
Andrei
-- 
If you can't explain it simply, you don't understand it well enough.
(Albert Einstein)


signature.asc
Description: Digital signature


Trouble updating the ro spellfile

2008-11-08 Thread Andrei Popescu
Hello everybody,

Lately I've been using vim to edit .po files and I wanted to use the
built-in spell checker. The .spl file available for my native language
(Romanian) is... lacking, to say the least. So I tried to build a new 
one using the word list available here

http://sourceforge.net/project/showfiles.php?group_id=144374package_id=236406

I downloaded the ro_RO.3.2.zip file, unpacked it and ran

:mkspell ro ro_RO.aff

Reading word file ro_RO.aff ...
Compressing word tree...
Compressed 13638 of 21005 nodes; 7367 (35%) remaining
Compressed 13684 of 21054 nodes; 7370 (35%) remaining
Writing spell file ro.utf-8.spl ...
Done!
Estimated runtime memory use: 79385 bytes

Then I try to use it (after copying it to ~/.vim/spell/)

:set spelllang=ro
:set spell

but it just won't work, anything I write in Romanian is highlighted as
bad and z= doesn't give any suggestions. I also tried running vim with 
-u NONE -U NONE, but the same thing happens. Spelling with 'en' or 'de' 
works fine.

What am I doing wrong?

:r !vim --version

VIM - Vi IMproved 7.2 (2008 Aug 9, compiled Oct 20 2008 19:21:29)
Included patches: 1-25
Compiled by [EMAIL PROTECTED]
Huge version without GUI.  Features included (+) or not (-):
+arabic +autocmd -balloon_eval -browse ++builtin_terms +byte_offset +cindent 
-clientserver -clipboard +cmdline_compl +cmdline_hist +cmdline_info +comments 
+cryptv +cscope +cursorshape +dialog_con +diff +digraphs -dnd -ebcdic 
+emacs_tags +eval +ex_extra +extra_search +farsi +file_in_path +find_in_path 
+float +folding -footer +fork() +gettext -hangul_input +iconv +insert_expand 
+jumplist +keymap +langmap +libcall +linebreak +lispindent +listcmds +localmap 
+menu +mksession +modify_fname +mouse -mouseshape +mouse_dec +mouse_gpm 
-mouse_jsbterm +mouse_netterm -mouse_sysmouse +mouse_xterm +multi_byte 
+multi_lang -mzscheme -netbeans_intg -osfiletype +path_extra +perl +postscript 
+printer +profile +python +quickfix +reltime +rightleft +ruby +scrollbind 
+signs +smartindent -sniff +statusline -sun_workshop +syntax +tag_binary 
+tag_old_static -tag_any_white +tcl +terminfo +termresponse +textobjects +title
 -toolbar +user_commands +vertsplit +virtualedit +visual +visualextra +viminfo 
+vreplace +wildignore +wildmenu +windows +writebackup -X11 -xfontset -xim -xsmp
 -xterm_clipboard -xterm_save 
   system vimrc file: $VIM/vimrc
 user vimrc file: $HOME/.vimrc
  user exrc file: $HOME/.exrc
  fall-back for $VIM: /usr/share/vim
Compilation: gcc -c -I. -Iproto -DHAVE_CONFIG_H -Wall -g -O2
-D_REENTRANT -D_GNU_SOURCE -DDEBIAN  -I/usr/local/include -D_LARGEFILE_SOURCE 
-D_FILE_OFFSET_BITS=64  -I/usr/lib/perl/5.10/CORE  -I/usr/include/python2.5 
-pthread -I/usr/include/tcl8.4  -D_REENTRANT=1  -D_THREAD_SAFE=1  
-D_LARGEFILE64_SOURCE=1  -I/usr/lib/ruby/1.8/x86_64-linux 
Linking: gcc   -L.  -rdynamic -Wl,-export-dynamic  -Wl,-E  -Wl,--as-needed -o 
vim   -lncurses -lselinux  -lacl -lgpm -Wl,-E  -L/usr/local/lib  
-L/usr/lib/perl/5.10/CORE -lperl -L/usr/lib/python2.5/config -lpython2.5 -lutil 
-Xlinker -export-dynamic -Wl,-O1 -Wl,-Bsymbolic-functions -L/usr/lib -ltcl8.4 
-lieee -lruby1.8 -lm   

:r !locale

LANG=ro_RO.UTF-8
LC_CTYPE=ro_RO.UTF-8
LC_NUMERIC=ro_RO.UTF-8
LC_TIME=ro_RO.UTF-8
LC_COLLATE=ro_RO.UTF-8
LC_MONETARY=ro_RO.UTF-8
LC_MESSAGES=ro_RO.UTF-8
LC_PAPER=ro_RO.UTF-8
LC_NAME=ro_RO.UTF-8
LC_ADDRESS=ro_RO.UTF-8
LC_TELEPHONE=ro_RO.UTF-8
LC_MEASUREMENT=ro_RO.UTF-8
LC_IDENTIFICATION=ro_RO.UTF-8
LC_ALL=

Regards,
Andrei
-- 
If you can't explain it simply, you don't understand it well enough.
(Albert Einstein)


signature.asc
Description: Digital signature


Re: Trouble updating the ro spellfile

2008-11-08 Thread Bram Moolenaar


Andrei Popescu wrote:

 Hello everybody,
 
 Lately I've been using vim to edit .po files and I wanted to use the
 built-in spell checker. The .spl file available for my native language
 (Romanian) is... lacking, to say the least. So I tried to build a new 
 one using the word list available here
 
 http://sourceforge.net/project/showfiles.php?group_id=144374package_id=236406
 
 I downloaded the ro_RO.3.2.zip file, unpacked it and ran
 
 :mkspell ro ro_RO.aff
 
 Reading word file ro_RO.aff ...
 Compressing word tree...
 Compressed 13638 of 21005 nodes; 7367 (35%) remaining
 Compressed 13684 of 21054 nodes; 7370 (35%) remaining
 Writing spell file ro.utf-8.spl ...
 Done!
 Estimated runtime memory use: 79385 bytes
 
 Then I try to use it (after copying it to ~/.vim/spell/)
 
 :set spelllang=ro
 :set spell
 
 but it just won't work, anything I write in Romanian is highlighted as
 bad and z= doesn't give any suggestions. I also tried running vim with 
 -u NONE -U NONE, but the same thing happens. Spelling with 'en' or 'de' 
 works fine.
 
 What am I doing wrong?

The spell file is written as utf-8.  Are you sure the input is also
utf-8?

Perhaps the diff files for the old spell files give a few hints about
what's different in the Vim spell input files:

ftp://ftp.vim.org/pub/vim/runtime/spell/ro/

You can also try :spelldump, it will list all the valid words.

-- 
hundred-and-one symptoms of being an internet addict:
213. Your kids start referring to you as that guy in front of the monitor.

 /// Bram Moolenaar -- [EMAIL PROTECTED] -- http://www.Moolenaar.net   \\\
///sponsor Vim, vote for features -- http://www.Vim.org/sponsor/ \\\
\\\download, build and distribute -- http://www.A-A-P.org///
 \\\help me help AIDS victims -- http://ICCF-Holland.org///

--~--~-~--~~~---~--~~
You received this message from the vim_use maillist.
For more information, visit http://www.vim.org/maillist.php
-~--~~~~--~~--~--~---