https://bugs.freedesktop.org/show_bug.cgi?id=82418

          Priority: medium
            Bug ID: 82418
          Assignee: libreoffice-bugs@lists.freedesktop.org
           Summary: FILEOPEN: CSV (Text) Import defaults to UTF-16
                    resulting in garbled text and can freeze LibreOffice
          Severity: normal
    Classification: Unclassified
                OS: Linux (All)
          Reporter: blackcat...@gmail.com
          Hardware: Other
        Whiteboard: BSA PossibleRegression
            Status: UNCONFIRMED
           Version: 4.3.0.4 release
         Component: Spreadsheet
           Product: LibreOffice

Created attachment 104381
  --> https://bugs.freedesktop.org/attachment.cgi?id=104381&action=edit
screenshots, bash scripts to generate CSV samples, gdbtrace.log

Problem description: 

CSV/Text data import on fresh installation defaults to UTF-16 (at least in my
case) resulting in a garbled text in the data preview. This is a change to the
behaviour of LibreOffice (LO) from previous releases and can be rather
confusing to less savvy users. Also, if you accidentally okay the dialogue LO
will try to import the file this way, small files will freeze the LO for
considerable amount of time, large files may freeze it indefinitely.

Steps to reproduce:
1. Create some test csv files (see attachment, you can use gensample1.sh,
gensample2.sh scripts to create 100 row and 10k row files)
2. Open samplecsv-100.csv (2.4K) with LO
3. If the character set is UTF-16 then fields preview will be garbled (see
attachment)
4. Change character set to UTF-8: fields preview will be as expected
5. Change back to UTF-16 and press OK: the file will be imported, but it will
take at least couple of seconds (2.4K file), imported data will be cramped into
a single cell of 'random' string
6. Now try to open the samplecsv-10k.csv (253K) file
7. Check the field preview with UTF-16 and UTF-8
8. Change back to UTF-16 and press OK
9. Have some tea, go out, go on holiday, alternatively pkill -15 soffice.bin ;)

Does it mean that there is no sanity check on what LO is trying to import? Is
LO attempting to import the whole 253K file as a single cell?

Expected behaviour:
- Default 'Character set' should be set to UTF-8, maybe 'System'?
- Text import should do some (more) sanity checks before trying to actually
import the data
- There should be a progress bar and there must be a possibility to abort the
import (if the import freezes, all currently opened LO windows/documents will
freeze too)

Problem impairs:
usability / user experience, may freeze LibreOffice


Operating System: Linux (Other)
Version: 4.3.0.4 release
Last worked in: 4.1.3.2 release

-- 
You are receiving this mail because:
You are the assignee for the bug.
_______________________________________________
Libreoffice-bugs mailing list
Libreoffice-bugs@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/libreoffice-bugs

Reply via email to