Dutch spell checking dictionaries (Was: Trouble updating the ro spellfile)
On 10/11/08 09:26, Bram Moolenaar wrote: Tony Mechelynck wrote: On 09/11/08 11:24, Bram Moolenaar wrote: [...] Fixing bugs currently has priority. I also still have a note to update the Dutch spell file, I think already for half a year... Shall I do it? (Which tools are required, beside Vim?) Het Nederlands is mijn moedertaal niet, maar ik bezit goede, niet te verouderde ;-) één- en tweetalige woordenboeken. Well, I started with the open office list, merged with a list I scraped of a website and then had it filtered against an official list. This is too much work, we probably need to drop this and go back to the OpenOffice.org list. You could try downloading that one and check that it works well. Ah, you mean spelling dictionaries, I thought you meant message/menu translations. Sorry. I still did some detective work, though; here are the results: I followed the OOo link at :help Mkspell and found there several dictionary archives. The format has changed between OOo versions 2 and 3. AFAICT, the version 3 dictionaries are distributed as OOo extensions which don't seem compatible with Vim. For version 2 (with MySpell) 3 Dutch archives are listed, apparently all dated 2007. The fullest one is http://ftp.services.openoffice.org/pub/OpenOffice.org/contrib/dictionaries/nl_NL-pack.zip which includes the other two (nl_NL .dic and .aff and hyph_nl_NL.dic), plus a medical dictionary, nl_MED .dic and .aff. (README license texts are also included, see below.) Personally I don't use spell checking, and in addition I don't write much in Dutch. Maybe someone (or some people) using Dutch spell checking more than I do could test those spelling dictionaries? The structure of the file at the above-mentioned URL is as follows: nl_NL-pack.zip hyph.txt hyph_nl_NL.zip hyph_nl_NL.dic README_hyph_nl_NL.txt nl_MED.zip nl_MED.dic nl_MED.aff README_nl_MED.txt nl_NL.zip nl_NL.dic nl_NL.aff README_nl_NL.txt README_nl_NL.txt spell.txt ...so there will be two levels of unzipping. The README files (at least those that I checked) are in both Dutch and English. Goed geluk! Best regards, Tony. -- Real Users never use the Help key. --~--~-~--~~~---~--~~ You received this message from the vim_use maillist. For more information, visit http://www.vim.org/maillist.php -~--~~~~--~~--~--~---
Re: Dutch spell checking dictionaries (Was: Trouble updating the ro spellfile)
Tony Mechelynck wrote: On 09/11/08 11:24, Bram Moolenaar wrote: [...] Fixing bugs currently has priority. I also still have a note to update the Dutch spell file, I think already for half a year... Shall I do it? (Which tools are required, beside Vim?) Het Nederlands is mijn moedertaal niet, maar ik bezit goede, niet te verouderde ;-) één- en tweetalige woordenboeken. Well, I started with the open office list, merged with a list I scraped of a website and then had it filtered against an official list. This is too much work, we probably need to drop this and go back to the OpenOffice.org list. You could try downloading that one and check that it works well. Ah, you mean spelling dictionaries, I thought you meant message/menu translations. Sorry. I still did some detective work, though; here are the results: I followed the OOo link at :help Mkspell and found there several dictionary archives. The format has changed between OOo versions 2 and 3. AFAICT, the version 3 dictionaries are distributed as OOo extensions which don't seem compatible with Vim. For version 2 (with MySpell) 3 Dutch archives are listed, apparently all dated 2007. The fullest one is http://ftp.services.openoffice.org/pub/OpenOffice.org/contrib/dictionaries/nl_NL-pack.zip which includes the other two (nl_NL .dic and .aff and hyph_nl_NL.dic), plus a medical dictionary, nl_MED .dic and .aff. (README license texts are also included, see below.) Personally I don't use spell checking, and in addition I don't write much in Dutch. Maybe someone (or some people) using Dutch spell checking more than I do could test those spelling dictionaries? The structure of the file at the above-mentioned URL is as follows: nl_NL-pack.zip hyph.txt hyph_nl_NL.zip hyph_nl_NL.dic README_hyph_nl_NL.txt nl_MED.zip nl_MED.dic nl_MED.aff README_nl_MED.txt nl_NL.zip nl_NL.dic nl_NL.aff README_nl_NL.txt README_nl_NL.txt spell.txt ...so there will be two levels of unzipping. The README files (at least those that I checked) are in both Dutch and English. Goed geluk! That's still an older version. Apparently that's the latest that was distributed officially, even though many improvements have been discussed. If you want to work on this, and are able to communicate in Dutch, you can subscribe on the maillist: http://www.opentaal.nl/mailinglist.php -- hundred-and-one symptoms of being an internet addict: 222. You send more than 20 personal e-mails a day. /// Bram Moolenaar -- [EMAIL PROTECTED] -- http://www.Moolenaar.net \\\ ///sponsor Vim, vote for features -- http://www.Vim.org/sponsor/ \\\ \\\download, build and distribute -- http://www.A-A-P.org/// \\\help me help AIDS victims -- http://ICCF-Holland.org/// --~--~-~--~~~---~--~~ You received this message from the vim_use maillist. For more information, visit http://www.vim.org/maillist.php -~--~~~~--~~--~--~---
Re: Dutch spell checking dictionaries (Was: Trouble updating the ro spellfile)
On 10/11/08 21:41, Bram Moolenaar wrote: [...] If you want to work on this, and are able to communicate in Dutch, you can subscribe on the maillist: http://www.opentaal.nl/mailinglist.php When I try to access the beheerpagina about (un)subscribing to the mailing list, SeaMonkey tells me that: - The certificate is not trusted because its issuer is unknown - The certificate is only valid for *.uitwisselplatform.nl (and thus not for uitwisselplatform.nl with no prefix) - The certificate has expired on 16 September 2007 at 01:59 I could add it to my browser as a permanent exception, but what is the purpose of having certificates expire if you are to accept them more than a year after their expiration date? Maybe someone on the list could give an admin of uitwisselplatform.nl a phone call? Or should I open a bug at Mozilla for the Tech Evangelism product, Dutch language component, about the fact that that site presents a certificate which is more than a year overdue? Best regards, Tony. -- What good is having someone who can walk on water if you don't follow in his footsteps? --~--~-~--~~~---~--~~ You received this message from the vim_use maillist. For more information, visit http://www.vim.org/maillist.php -~--~~~~--~~--~--~---
Re: Trouble updating the ro spellfile
Andrei Popescu wrote: On Sat,08.Nov.08, 22:17:42, Bram Moolenaar wrote: and now I have a working spellfile (before that it was using only the .aff file to build the .spl). Could you please replace the file on ftp.vim.org with one built from the wordlist indicated by me? The old file is plain useless (actually it could be doing more harm then good). To be able to allow everybody to reproduce the .spl file we need to write a .aap file that does the work. It basically does all the work that you would do manually. In this case it would also do the conversion from utf-8 to iso-8859-2 to make a spell file in that encoding. No, please don't. The new diacritics (șțȘȚ) can not be represented in ISO-8859-2, they need UTF-8. Only the -cedilla wordlist uses them, but we (as in the Romanian FLOSS community) are trying to switch to the new diacritics. Even MS (responsible for creating the old/wrong diacritics (şŞţŢ)) is using the correct ones in Vista and there is an update available for XP. So what about people who have iso-8859-2 files on a system that doesn't support utf-8? Would the old spell file be better than nothing? Or would a conversion from utf-8 to iso-8859-2 of the new spell file give better results. Do you think you can write this .aap file? You can use the files of other languages as an example. If not then I can put it on my todo list (which probably means it will take a couple of months before I get to it). I'll see what I can do about it, but no promises. Better do put it on your TODO list, just in case I'm too busy. The spare time I have now I tend to use for translation work for the Debian Lenny release, which should happen Real Soon Now (tm). Fixing bugs currently has priority. I also still have a note to update the Dutch spell file, I think already for half a year... -- I noticed my daughter's Disney-net password on a sticky note: MickeyMinnieGoofyPluto. I asked her why it was so long. Because they say it has to have at least four characters. /// Bram Moolenaar -- [EMAIL PROTECTED] -- http://www.Moolenaar.net \\\ ///sponsor Vim, vote for features -- http://www.Vim.org/sponsor/ \\\ \\\download, build and distribute -- http://www.A-A-P.org/// \\\help me help AIDS victims -- http://ICCF-Holland.org/// --~--~-~--~~~---~--~~ You received this message from the vim_use maillist. For more information, visit http://www.vim.org/maillist.php -~--~~~~--~~--~--~---
Re: Trouble updating the ro spellfile
On Sun,09.Nov.08, 11:24:03, Bram Moolenaar wrote: So what about people who have iso-8859-2 files on a system that doesn't support utf-8? Would the old spell file be better than nothing? No, the old spellfile is obsolete. Or would a conversion from utf-8 to iso-8859-2 of the new spell file give better results. The hunspell-ro.3.2.tar.gz file contains an ISO-8859-2 word list. I guess it could be used to create a .spl, but it should be clearly marked as obsolete or something similar. I guess the following will be needed for this file: FOL ăâîşţ LOW ăâîşţ UPP ĂÂÎŞŢ Regards, Andrei -- If you can't explain it simply, you don't understand it well enough. (Albert Einstein) signature.asc Description: Digital signature
Trouble updating the ro spellfile
Hello everybody, Lately I've been using vim to edit .po files and I wanted to use the built-in spell checker. The .spl file available for my native language (Romanian) is... lacking, to say the least. So I tried to build a new one using the word list available here http://sourceforge.net/project/showfiles.php?group_id=144374package_id=236406 I downloaded the ro_RO.3.2.zip file, unpacked it and ran :mkspell ro ro_RO.aff Reading word file ro_RO.aff ... Compressing word tree... Compressed 13638 of 21005 nodes; 7367 (35%) remaining Compressed 13684 of 21054 nodes; 7370 (35%) remaining Writing spell file ro.utf-8.spl ... Done! Estimated runtime memory use: 79385 bytes Then I try to use it (after copying it to ~/.vim/spell/) :set spelllang=ro :set spell but it just won't work, anything I write in Romanian is highlighted as bad and z= doesn't give any suggestions. I also tried running vim with -u NONE -U NONE, but the same thing happens. Spelling with 'en' or 'de' works fine. What am I doing wrong? :r !vim --version VIM - Vi IMproved 7.2 (2008 Aug 9, compiled Oct 20 2008 19:21:29) Included patches: 1-25 Compiled by [EMAIL PROTECTED] Huge version without GUI. Features included (+) or not (-): +arabic +autocmd -balloon_eval -browse ++builtin_terms +byte_offset +cindent -clientserver -clipboard +cmdline_compl +cmdline_hist +cmdline_info +comments +cryptv +cscope +cursorshape +dialog_con +diff +digraphs -dnd -ebcdic +emacs_tags +eval +ex_extra +extra_search +farsi +file_in_path +find_in_path +float +folding -footer +fork() +gettext -hangul_input +iconv +insert_expand +jumplist +keymap +langmap +libcall +linebreak +lispindent +listcmds +localmap +menu +mksession +modify_fname +mouse -mouseshape +mouse_dec +mouse_gpm -mouse_jsbterm +mouse_netterm -mouse_sysmouse +mouse_xterm +multi_byte +multi_lang -mzscheme -netbeans_intg -osfiletype +path_extra +perl +postscript +printer +profile +python +quickfix +reltime +rightleft +ruby +scrollbind +signs +smartindent -sniff +statusline -sun_workshop +syntax +tag_binary +tag_old_static -tag_any_white +tcl +terminfo +termresponse +textobjects +title -toolbar +user_commands +vertsplit +virtualedit +visual +visualextra +viminfo +vreplace +wildignore +wildmenu +windows +writebackup -X11 -xfontset -xim -xsmp -xterm_clipboard -xterm_save system vimrc file: $VIM/vimrc user vimrc file: $HOME/.vimrc user exrc file: $HOME/.exrc fall-back for $VIM: /usr/share/vim Compilation: gcc -c -I. -Iproto -DHAVE_CONFIG_H -Wall -g -O2 -D_REENTRANT -D_GNU_SOURCE -DDEBIAN -I/usr/local/include -D_LARGEFILE_SOURCE -D_FILE_OFFSET_BITS=64 -I/usr/lib/perl/5.10/CORE -I/usr/include/python2.5 -pthread -I/usr/include/tcl8.4 -D_REENTRANT=1 -D_THREAD_SAFE=1 -D_LARGEFILE64_SOURCE=1 -I/usr/lib/ruby/1.8/x86_64-linux Linking: gcc -L. -rdynamic -Wl,-export-dynamic -Wl,-E -Wl,--as-needed -o vim -lncurses -lselinux -lacl -lgpm -Wl,-E -L/usr/local/lib -L/usr/lib/perl/5.10/CORE -lperl -L/usr/lib/python2.5/config -lpython2.5 -lutil -Xlinker -export-dynamic -Wl,-O1 -Wl,-Bsymbolic-functions -L/usr/lib -ltcl8.4 -lieee -lruby1.8 -lm :r !locale LANG=ro_RO.UTF-8 LC_CTYPE=ro_RO.UTF-8 LC_NUMERIC=ro_RO.UTF-8 LC_TIME=ro_RO.UTF-8 LC_COLLATE=ro_RO.UTF-8 LC_MONETARY=ro_RO.UTF-8 LC_MESSAGES=ro_RO.UTF-8 LC_PAPER=ro_RO.UTF-8 LC_NAME=ro_RO.UTF-8 LC_ADDRESS=ro_RO.UTF-8 LC_TELEPHONE=ro_RO.UTF-8 LC_MEASUREMENT=ro_RO.UTF-8 LC_IDENTIFICATION=ro_RO.UTF-8 LC_ALL= Regards, Andrei -- If you can't explain it simply, you don't understand it well enough. (Albert Einstein) signature.asc Description: Digital signature
Re: Trouble updating the ro spellfile
Andrei Popescu wrote: Hello everybody, Lately I've been using vim to edit .po files and I wanted to use the built-in spell checker. The .spl file available for my native language (Romanian) is... lacking, to say the least. So I tried to build a new one using the word list available here http://sourceforge.net/project/showfiles.php?group_id=144374package_id=236406 I downloaded the ro_RO.3.2.zip file, unpacked it and ran :mkspell ro ro_RO.aff Reading word file ro_RO.aff ... Compressing word tree... Compressed 13638 of 21005 nodes; 7367 (35%) remaining Compressed 13684 of 21054 nodes; 7370 (35%) remaining Writing spell file ro.utf-8.spl ... Done! Estimated runtime memory use: 79385 bytes Then I try to use it (after copying it to ~/.vim/spell/) :set spelllang=ro :set spell but it just won't work, anything I write in Romanian is highlighted as bad and z= doesn't give any suggestions. I also tried running vim with -u NONE -U NONE, but the same thing happens. Spelling with 'en' or 'de' works fine. What am I doing wrong? The spell file is written as utf-8. Are you sure the input is also utf-8? Perhaps the diff files for the old spell files give a few hints about what's different in the Vim spell input files: ftp://ftp.vim.org/pub/vim/runtime/spell/ro/ You can also try :spelldump, it will list all the valid words. -- hundred-and-one symptoms of being an internet addict: 213. Your kids start referring to you as that guy in front of the monitor. /// Bram Moolenaar -- [EMAIL PROTECTED] -- http://www.Moolenaar.net \\\ ///sponsor Vim, vote for features -- http://www.Vim.org/sponsor/ \\\ \\\download, build and distribute -- http://www.A-A-P.org/// \\\help me help AIDS victims -- http://ICCF-Holland.org/// --~--~-~--~~~---~--~~ You received this message from the vim_use maillist. For more information, visit http://www.vim.org/maillist.php -~--~~~~--~~--~--~---