Bug#979439: hunspell-it: Italian spell check cannot recognize simple words
Yes, it works fine! Thank you very much for this quick fix. Gian Luigi On 1/14/21 9:50 PM, Rene Engelhard wrote: recode ISO8859-15...UTF-8 /usr/share/hunspell/it_IT.dic -- Gian Luigi Gragnani Diten - University of Genoa Via Opera Pia 11A, 16145 Genova, Italy phone: + 39 010 33 52756
Bug#979439: hunspell-it: Italian spell check cannot recognize simple words
Hi again, Am 13.01.21 um 20:59 schrieb Rene Engelhard: > Am 13.01.21 um 19:43 schrieb Antonio: >> I solved it by changing this line of the file >> "/usr/share/hunspell/it_IT.aff": >> >> --- it_IT.aff.orig.bad 2021-01-13 19:28:38.184457124 +0100 >> +++ it_IT.aff.new.ok 2021-01-13 19:38:06.141194317 +0100 >> @@ -36,7 +36,7 @@ >> LANG it_IT >> HOME https://libreitalia.org >> VERSION 5.1.0 (13/10/2020) >> -SET UTF-8 >> +SET ISO8859-15 >> TRY >> aioertnsclmdpgubzfvhà q'ACMSkBGPLxEyRTVòIODNwFéùÚìjUZKHWJYQXÃà >> # Nota: non presenti nel dizionario: ÃÃ, vanno aggiunte alla TRY >> rigenerata >> > Interesting. > > rene@frodo:~/LibreOffice/git/master/dictionaries/it_IT$ file it_IT.aff > it_IT.aff: UTF-8 Unicode text > rene@frodo:~/LibreOffice/git/master/dictionaries/it_IT$ file it_IT.dic > it_IT.dic: ISO-8859 text > $ > > > Someone wasn't able to decide ;-). I'd argue that UTF-8 is the future > and that would possible mean a simply recode of it_IT.dic to UTF-8 > should also fix it? In a quick try here with the UDHR in Italian as suggested by the reporter a simple $ sudo recode ISO8859-15...UTF-8 /usr/share/hunspell/it_IT.dic makes it work indeed. Regards, Rene
Bug#979439: hunspell-it: Italian spell check cannot recognize simple words
Hi Antonio, Am 13.01.21 um 19:43 schrieb Antonio: > I solved it by changing this line of the file > "/usr/share/hunspell/it_IT.aff": > > --- it_IT.aff.orig.bad 2021-01-13 19:28:38.184457124 +0100 > +++ it_IT.aff.new.ok 2021-01-13 19:38:06.141194317 +0100 > @@ -36,7 +36,7 @@ > LANG it_IT > HOME https://libreitalia.org > VERSION 5.1.0 (13/10/2020) > -SET UTF-8 > +SET ISO8859-15 > TRY > aioertnsclmdpgubzfvhà q'ACMSkBGPLxEyRTVòIODNwFéùÚìjUZKHWJYQXÃà > # Nota: non presenti nel dizionario: ÃÃ, vanno aggiunte alla TRY > rigenerata > Interesting. rene@frodo:~/LibreOffice/git/master/dictionaries/it_IT$ file it_IT.aff it_IT.aff: UTF-8 Unicode text rene@frodo:~/LibreOffice/git/master/dictionaries/it_IT$ file it_IT.dic it_IT.dic: ISO-8859 text $ Someone wasn't able to decide ;-). I'd argue that UTF-8 is the future and that would possible mean a simply recode of it_IT.dic to UTF-8 should also fix it? I probably would prefer that over patching the .aff to use the old ISO8859-15. Regards, Rene
Bug#979439: hunspell-it: Italian spell check cannot recognize simple words
I solved it by changing this line of the file "/usr/share/hunspell/it_IT.aff": --- it_IT.aff.orig.bad 2021-01-13 19:28:38.184457124 +0100 +++ it_IT.aff.new.ok 2021-01-13 19:38:06.141194317 +0100 @@ -36,7 +36,7 @@ LANG it_IT HOME https://libreitalia.org VERSION 5.1.0 (13/10/2020) -SET UTF-8 +SET ISO8859-15 TRY aioertnsclmdpgubzfvhà q'ACMSkBGPLxEyRTVòIODNwFéùÚìjUZKHWJYQXÃà # Nota: non presenti nel dizionario: ÃÃ, vanno aggiunte alla TRY rigenerata
Bug#979439: hunspell-it: Italian spell check cannot recognize simple words
Hi, Am 13.01.21 um 10:36 schrieb g.l. gragnani: > Package: hunspell-it > Version: 1:7.1.0~rc1-1 > Followup-For: Bug #979439 > > Dear Maintainer, > > I can confirm this bug for both LibreOffice and Thunderbird (1:78.6.0-1) > According to the program isutf8 (package moreutils) > the file /usr/share/hunspell/it_IT.dic IS NOT utf-8 encoded. > Could it be the problem? Could theoretically be, yes. But then I wonder why it's claimed to work with a stock LO, which doesn't specify any encoding either to that file. And it_IT.dic is not the only one: rene@frodo:~/LibreOffice/git/master/dictionaries$ find . -name "*.dic" -exec file {} \; | grep -v UTF ./en/en_AU.dic: ASCII text ./en/en_US.dic: ASCII text ./en/en_CA.dic: ASCII text ./sw_TZ/sw_TZ.dic: ASCII text ./lt_LT/hyph_lt.dic: ISO-8859 text ./lt_LT/lt.dic: ISO-8859 text ./et_EE/et_EE.dic: ISO-8859 text ./et_EE/hyph_et_EE.dic: ISO-8859 text ./el_GR/hyph_el_GR.dic: ISO-8859 text ./el_GR/el_GR.dic: ISO-8859 text ./sr/hyph_sr.dic: ISO-8859 text ./sr/hyph_sr-Latn.dic: ISO-8859 text ./an_ES/an_ES.dic: data ./nl_NL/hyph_nl_NL.dic: troff or preprocessor input, ISO-8859 text ./lv_LV/hyph_lv_LV.dic: ISO-8859 text ./sk_SK/hyph_sk_SK.dic: ISO-8859 text ./pt_PT/hyph_pt_PT.dic: ISO-8859 text ./pl_PL/pl_PL.dic: ISO-8859 text ./pl_PL/hyph_pl_PL.dic: troff or preprocessor input, ISO-8859 text ./da_DK/hyph_da_DK.dic: ISO-8859 text ./gl/hyph_gl.dic: troff or preprocessor input, ASCII text ./id/hyph_id_ID.dic: ASCII text ./id/id_ID.dic: ASCII text ./de/hyph_de_DE.dic: ISO-8859 text ./de/hyph_de_AT.dic: ISO-8859 text ./de/de_AT_frami.dic: ISO-8859 text ./de/de_CH_frami.dic: ISO-8859 text ./de/hyph_de_CH.dic: ISO-8859 text ./de/de_DE_frami.dic: ISO-8859 text ./no/nb_NO.dic: ISO-8859 text ./no/nn_NO.dic: ISO-8859 text ./no/hyph_nb_NO.dic: troff or preprocessor input, ISO-8859 text ./no/hyph_nn_NO.dic: troff or preprocessor input, ISO-8859 text ./bs_BA/bs_BA.dic: ISO-8859 text ./hr_HR/hyph_hr_HR.dic: ISO-8859 text ./cs_CZ/cs_CZ.dic: ISO-8859 text ./cs_CZ/hyph_cs_CZ.dic: troff or preprocessor input, ISO-8859 text ./zu_ZA/hyph_zu_ZA.dic: ASCII text ./sl_SI/sl_SI.dic: ISO-8859 text ./sl_SI/hyph_sl_SI.dic: ISO-8859 text ./af_ZA/hyph_af_ZA.dic: ISO-8859 text ./it_IT/hyph_it_IT.dic: ASCII text ./it_IT/it_IT.dic: ISO-8859 text ./sv_SE/hyph_sv.dic: ISO-8859 text $ Regards, Rene
Bug#979439: hunspell-it: Italian spell check cannot recognize simple words
Package: hunspell-it Version: 1:7.1.0~rc1-1 Followup-For: Bug #979439 Dear Maintainer, I can confirm this bug for both LibreOffice and Thunderbird (1:78.6.0-1) According to the program isutf8 (package moreutils) the file /usr/share/hunspell/it_IT.dic IS NOT utf-8 encoded. Could it be the problem? Regards. G.L. G. -- System Information: Debian Release: bullseye/sid APT prefers unstable APT policy: (500, 'unstable') Architecture: amd64 (x86_64) Kernel: Linux 5.10.0-1-amd64 (SMP w/4 CPU threads) Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8), LANGUAGE=en_US:en Shell: /bin/sh linked to /bin/dash Init: systemd (via /run/systemd/system) LSM: AppArmor: enabled Versions of packages hunspell-it depends on: ii dictionaries-common 1.28.3 hunspell-it recommends no packages. Versions of packages hunspell-it suggests: pn hunspell ii libreoffice-writer 1:7.0.4-3 -- no debconf information
Bug#979439: hunspell-it: Italian spell check cannot recognize simple words
Package: hunspell-it Version: 1:7.1.0~rc1-1 Severity: normal X-Debbugs-Cc: livm...@hotmail.com Dear Maintainer, the Italian spell check is broken and cannot recognize common words like "di" or "è" and other terms with apostrophe or accent. It is possible to notice the problem simply copying the text of the Universal Declaration of Human Rights in Italian and trying a spell check on it. It will highlight in applications like LibreOffice or Gedit some part of the text even if it is right. Normally I should expect no mistake in the Declaration. If I try to use aspell-it in Gedit, the terms are correctly checked. Also the AppImage version of LibreOffice is not affected. My Debian version is Bullseye freshly updated. Thank you in advance, Best Regards -- System Information: Debian Release: bullseye/sid APT prefers testing APT policy: (500, 'testing'), (1, 'experimental') Architecture: amd64 (x86_64) Kernel: Linux 5.9.0-5-amd64 (SMP w/12 CPU threads) Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8), LANGUAGE not set Shell: /bin/sh linked to /usr/bin/dash Init: systemd (via /run/systemd/system) LSM: AppArmor: enabled Versions of packages hunspell-it depends on: ii dictionaries-common 1.28.3 hunspell-it recommends no packages. Versions of packages hunspell-it suggests: ii hunspell1.7.0-3 ii libreoffice-writer 1:7.1.0~rc1-1 -- no debconf information