Bug#979439: hunspell-it: Italian spell check cannot recognize simple words

2021-01-14 Thread g.l. gragnani

Yes, it works fine!
Thank you very much for this quick fix.
Gian Luigi

On 1/14/21 9:50 PM, Rene Engelhard wrote:

recode ISO8859-15...UTF-8 /usr/share/hunspell/it_IT.dic


--
Gian Luigi Gragnani
Diten - University of Genoa
Via Opera Pia 11A, 16145 Genova, Italy
phone: + 39 010 33 52756



Bug#979439: hunspell-it: Italian spell check cannot recognize simple words

2021-01-14 Thread Rene Engelhard
Hi again,

Am 13.01.21 um 20:59 schrieb Rene Engelhard:
> Am 13.01.21 um 19:43 schrieb Antonio:
>> I solved it by changing this line of the file
>> "/usr/share/hunspell/it_IT.aff":
>>
>> --- it_IT.aff.orig.bad  2021-01-13 19:28:38.184457124 +0100
>> +++ it_IT.aff.new.ok   2021-01-13 19:38:06.141194317 +0100
>> @@ -36,7 +36,7 @@
>>  LANG it_IT
>>  HOME https://libreitalia.org
>>  VERSION 5.1.0 (13/10/2020)
>> -SET UTF-8
>> +SET ISO8859-15
>>  TRY
>> aioertnsclmdpgubzfvhàq'ACMSkBGPLxEyRTVòIODNwFéùÚìjUZKHWJYQXÃÃ
>>  # Nota: non presenti nel dizionario: ÃÃ, vanno aggiunte alla TRY
>> rigenerata
>>
> Interesting.
>
> rene@frodo:~/LibreOffice/git/master/dictionaries/it_IT$ file it_IT.aff
> it_IT.aff: UTF-8 Unicode text
> rene@frodo:~/LibreOffice/git/master/dictionaries/it_IT$ file it_IT.dic
> it_IT.dic: ISO-8859 text
> $
>
>
> Someone wasn't able to decide ;-). I'd argue that UTF-8 is the future
> and that would possible mean a simply recode of it_IT.dic to UTF-8
> should also fix it?

In a quick try here with the UDHR in Italian as suggested by the
reporter a simple

$ sudo recode ISO8859-15...UTF-8 /usr/share/hunspell/it_IT.dic

makes it work indeed.


Regards,


Rene



Bug#979439: hunspell-it: Italian spell check cannot recognize simple words

2021-01-13 Thread Rene Engelhard
Hi Antonio,

Am 13.01.21 um 19:43 schrieb Antonio:
> I solved it by changing this line of the file
> "/usr/share/hunspell/it_IT.aff":
>
> --- it_IT.aff.orig.bad  2021-01-13 19:28:38.184457124 +0100
> +++ it_IT.aff.new.ok   2021-01-13 19:38:06.141194317 +0100
> @@ -36,7 +36,7 @@
>  LANG it_IT
>  HOME https://libreitalia.org
>  VERSION 5.1.0 (13/10/2020)
> -SET UTF-8
> +SET ISO8859-15
>  TRY
> aioertnsclmdpgubzfvhàq'ACMSkBGPLxEyRTVòIODNwFéùÚìjUZKHWJYQXÃÃ
>  # Nota: non presenti nel dizionario: ÃÃ, vanno aggiunte alla TRY
> rigenerata
>
Interesting.

rene@frodo:~/LibreOffice/git/master/dictionaries/it_IT$ file it_IT.aff
it_IT.aff: UTF-8 Unicode text
rene@frodo:~/LibreOffice/git/master/dictionaries/it_IT$ file it_IT.dic
it_IT.dic: ISO-8859 text
$


Someone wasn't able to decide ;-). I'd argue that UTF-8 is the future
and that would possible mean a simply recode of it_IT.dic to UTF-8
should also fix it?

I probably would prefer that over patching the .aff to use the old
ISO8859-15.

Regards,


Rene



Bug#979439: hunspell-it: Italian spell check cannot recognize simple words

2021-01-13 Thread Antonio
I solved it by changing this line of the file 
"/usr/share/hunspell/it_IT.aff":


--- it_IT.aff.orig.bad  2021-01-13 19:28:38.184457124 +0100
+++ it_IT.aff.new.ok   2021-01-13 19:38:06.141194317 +0100
@@ -36,7 +36,7 @@
 LANG it_IT
 HOME https://libreitalia.org
 VERSION 5.1.0 (13/10/2020)
-SET UTF-8
+SET ISO8859-15
 TRY 
aioertnsclmdpgubzfvhàq'ACMSkBGPLxEyRTVòIODNwFéùÚìjUZKHWJYQXÃÃ
 # Nota: non presenti nel dizionario: ÃÃ, vanno aggiunte alla TRY 
rigenerata




Bug#979439: hunspell-it: Italian spell check cannot recognize simple words

2021-01-13 Thread Rene Engelhard
Hi,

Am 13.01.21 um 10:36 schrieb g.l. gragnani:
> Package: hunspell-it
> Version: 1:7.1.0~rc1-1
> Followup-For: Bug #979439
>
> Dear Maintainer,
>
> I can confirm this bug for both LibreOffice and Thunderbird  (1:78.6.0-1)
> According to the program isutf8 (package moreutils)
> the file /usr/share/hunspell/it_IT.dic IS NOT utf-8 encoded.
> Could it be the problem?

Could theoretically be, yes.

But then I wonder why it's claimed to work with a stock LO, which
doesn't specify any encoding either to that file.


And it_IT.dic is not the only one:


rene@frodo:~/LibreOffice/git/master/dictionaries$ find . -name "*.dic"
-exec file {} \; | grep -v UTF
./en/en_AU.dic: ASCII text
./en/en_US.dic: ASCII text
./en/en_CA.dic: ASCII text
./sw_TZ/sw_TZ.dic: ASCII text
./lt_LT/hyph_lt.dic: ISO-8859 text
./lt_LT/lt.dic: ISO-8859 text
./et_EE/et_EE.dic: ISO-8859 text
./et_EE/hyph_et_EE.dic: ISO-8859 text
./el_GR/hyph_el_GR.dic: ISO-8859 text
./el_GR/el_GR.dic: ISO-8859 text
./sr/hyph_sr.dic: ISO-8859 text
./sr/hyph_sr-Latn.dic: ISO-8859 text
./an_ES/an_ES.dic: data
./nl_NL/hyph_nl_NL.dic: troff or preprocessor input, ISO-8859 text
./lv_LV/hyph_lv_LV.dic: ISO-8859 text
./sk_SK/hyph_sk_SK.dic: ISO-8859 text
./pt_PT/hyph_pt_PT.dic: ISO-8859 text
./pl_PL/pl_PL.dic: ISO-8859 text
./pl_PL/hyph_pl_PL.dic: troff or preprocessor input, ISO-8859 text
./da_DK/hyph_da_DK.dic: ISO-8859 text
./gl/hyph_gl.dic: troff or preprocessor input, ASCII text
./id/hyph_id_ID.dic: ASCII text
./id/id_ID.dic: ASCII text
./de/hyph_de_DE.dic: ISO-8859 text
./de/hyph_de_AT.dic: ISO-8859 text
./de/de_AT_frami.dic: ISO-8859 text
./de/de_CH_frami.dic: ISO-8859 text
./de/hyph_de_CH.dic: ISO-8859 text
./de/de_DE_frami.dic: ISO-8859 text
./no/nb_NO.dic: ISO-8859 text
./no/nn_NO.dic: ISO-8859 text
./no/hyph_nb_NO.dic: troff or preprocessor input, ISO-8859 text
./no/hyph_nn_NO.dic: troff or preprocessor input, ISO-8859 text
./bs_BA/bs_BA.dic: ISO-8859 text
./hr_HR/hyph_hr_HR.dic: ISO-8859 text
./cs_CZ/cs_CZ.dic: ISO-8859 text
./cs_CZ/hyph_cs_CZ.dic: troff or preprocessor input, ISO-8859 text
./zu_ZA/hyph_zu_ZA.dic: ASCII text
./sl_SI/sl_SI.dic: ISO-8859 text
./sl_SI/hyph_sl_SI.dic: ISO-8859 text
./af_ZA/hyph_af_ZA.dic: ISO-8859 text
./it_IT/hyph_it_IT.dic: ASCII text
./it_IT/it_IT.dic: ISO-8859 text
./sv_SE/hyph_sv.dic: ISO-8859 text
$


Regards,


Rene



Bug#979439: hunspell-it: Italian spell check cannot recognize simple words

2021-01-13 Thread g.l. gragnani
Package: hunspell-it
Version: 1:7.1.0~rc1-1
Followup-For: Bug #979439

Dear Maintainer,

I can confirm this bug for both LibreOffice and Thunderbird  (1:78.6.0-1)
According to the program isutf8 (package moreutils)
the file /usr/share/hunspell/it_IT.dic IS NOT utf-8 encoded.
Could it be the problem?
Regards.
G.L. G.

-- System Information:
Debian Release: bullseye/sid
  APT prefers unstable
  APT policy: (500, 'unstable')
Architecture: amd64 (x86_64)

Kernel: Linux 5.10.0-1-amd64 (SMP w/4 CPU threads)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8), 
LANGUAGE=en_US:en
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)
LSM: AppArmor: enabled

Versions of packages hunspell-it depends on:
ii  dictionaries-common  1.28.3

hunspell-it recommends no packages.

Versions of packages hunspell-it suggests:
pn  hunspell
ii  libreoffice-writer  1:7.0.4-3

-- no debconf information



Bug#979439: hunspell-it: Italian spell check cannot recognize simple words

2021-01-06 Thread Alessio Paonessa
Package: hunspell-it
Version: 1:7.1.0~rc1-1
Severity: normal
X-Debbugs-Cc: livm...@hotmail.com

Dear Maintainer,

the Italian spell check is broken and cannot recognize common words like "di"
or "è" and other terms with apostrophe or accent.

It is possible to notice the problem simply copying the text of the Universal
Declaration of Human Rights in Italian and trying a spell check on it. It will
highlight in applications like LibreOffice or Gedit some part of the text even
if it is right.

Normally I should expect no mistake in the Declaration.

If I try to use aspell-it in Gedit, the terms are correctly checked. Also the
AppImage version of LibreOffice is not affected. My Debian version is Bullseye
freshly updated.

Thank you in advance,

Best Regards



-- System Information:
Debian Release: bullseye/sid
  APT prefers testing
  APT policy: (500, 'testing'), (1, 'experimental')
Architecture: amd64 (x86_64)

Kernel: Linux 5.9.0-5-amd64 (SMP w/12 CPU threads)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8), LANGUAGE not set
Shell: /bin/sh linked to /usr/bin/dash
Init: systemd (via /run/systemd/system)
LSM: AppArmor: enabled

Versions of packages hunspell-it depends on:
ii  dictionaries-common  1.28.3

hunspell-it recommends no packages.

Versions of packages hunspell-it suggests:
ii  hunspell1.7.0-3
ii  libreoffice-writer  1:7.1.0~rc1-1

-- no debconf information