https://bugs.documentfoundation.org/show_bug.cgi?id=149462

            Bug ID: 149462
           Summary: LibreOffice Still Saving UTF-8 Plain Text Document as
                    ANSI
           Product: LibreOffice
           Version: 7.3.0.3 release
          Hardware: x86-64 (AMD64)
                OS: Windows (All)
            Status: UNCONFIRMED
          Severity: normal
          Priority: medium
         Component: Writer
          Assignee: libreoffice-bugs@lists.freedesktop.org
          Reporter: labfm...@guerrillamail.org

Description:

This is a follow up to the below issue (which was reported fixed in 7.3.0):
https://bugs.documentfoundation.org/show_bug.cgi?id=142956

All plain text files (*.txt) I use are encoded in UTF-8 (without BOM) format –
therefore using the same file across multiple different applications is
ordinarily seamless as utf8NoBOM is pretty standard now.  However, if I then
use LibreOffice to check spelling of a plain text document for example, it
creates an issue because when saving the file (CTRL + S), LibreOffice can
change it to ANSI encoding, instead of keeping it as UTF-8 (without BOM).

If the plain text file only contains ASCII printable characters (character code
32-127), LibreOffice 7.3 correctly saves the file as UTF-8 (without BOM). 
However, if any extended ASCII codes (character code 128-255) are added (I.E. €
… “ ” – —) and the file is saved, it changes the document to ANSI – which I
don't want as every other application is expecting a document encoded in UTF-8
(without BOM) – meaning they now show invalid characters in some applications
and is inconsistent.

I want to always save plain text files in UTF-8 (without BOM).  Saving the file
by going to [File > Save as > Text - Choose Encoding] is slow, impractical and
relies on remembering to do it.  I think LibreOffice needs a setting in [Tools
> Options > Load/Save > General] that allows users to set the default encoding
they want to use when saving plain text files – with the default setting being
UTF-8 (without BOM).


Steps to Reproduce:

Steps to Reproduce:

Working Correctly:

1) Create a new plain text document in a text editor, such as Windows Notepad
or Visual Studio Code.

2) Add the below text.

Spell check:  Lores sump dolor sit meat, emus no duo, obit verger fed an.
Fabulous porticoes core rum pit nu tied, in enc more commode mandamus. Eli tar
principle complemented ea is.

3) Save the file with UTF-8 (without BOM) encoding – which is the default in
Windows Notepad and Visual Studio Code.

4) Open the file in LibreOffice, modify the document (using only ASCII
printable characters (I.E. a-z A-Z 0-9)), then save the file [CTRL + S] or
[File > Save].

5) Open the file in Windows Notepad.  The encoding remains as UTF-8 (without
BOM), which is correct.

6) Open the file in Visual Studio Code.  The text displays correctly.

7) Open the file in Notepad++.  The text displays correctly.


Not Working Correctly:

1) Create a new plain text document in a text editor, such as Windows Notepad
or Visual Studio Code.

2) Add the below text.

Spell check:  Lores sump dolor sit meat, emus no duo, obit verger fed an.
Fabulous porticoes core rum pit nu tied, in enc more commode mandamus. Eli tar
principle complemented ea is.

3) Save the file with UTF-8 (without BOM) encoding, which is the default in
Windows Notepad and Visual Studio Code.

4) Open the file in LibreOffice, modify the document (using extended ASCII
codes (I.E. € … “ ” – —)), then save the file [CTRL + S] or [File > Save].

5) Open the file in Windows Notepad.  The encoding has been changed from UTF-8
(without BOM), to ANSI.

6) Open the file in Visual Studio Code.  The text shows invalid characters if
VSCode is set to use UTF-8 (without BOM), as the file is now ANSI.

7) Open the file in Notepad++.  The encoding has been changed from UTF-8
(without BOM), to ANSI.


Actual Results:
LibreOffice 7.3 changes the encoding from UTF-8 (without BOM), to ANSI when
saving (after adding extended ASCII characters to document).

Expected Results:
LibreOffice should leave the encoding as UTF-8 (without BOM).  Any new
LibreOffice documents should also be saved as UTF-8 (without BOM) when saving
as *.txt.


Reproducible: Always


User Profile Reset: Yes



Additional Info:
ASCII printable characters:
https://www.ascii-code.com/

Video:
See two attached MP4 video demonstrating the issue.

Info:
Additional Info:
Version: 7.3.3.2 (x64) / LibreOffice Community
Build ID: d1d0ea68f081ee2800a922cac8f79445e4603348
CPU threads: 4; OS: Windows 10.0 Build 19044; UI render: Skia/Raster; VCL: win
Locale: en-GB (en_GB); UI: en-GB
Calc: threaded

-- 
You are receiving this mail because:
You are the assignee for the bug.

Reply via email to