|
Hello! I am preparing to release V2.0 of my tool "Proofing Tool GUI" on the 1st of July. I have optimised the code a lot and, in Windows, it just takes a few seconds to open the big th_en_US_v2.dat (18 MB). I still have some issues in Linux due to a big pause after opening the thesaurus/dictionaries. I have been able to locate what is causing it, but even that way there is still a pause. For example: the PT-pt thesaurus (12940 words) takes one second to open on my Ubuntu 12 x86 VM but then there is a 4 minutes pause before it can be edited. I tried changing the code and it now only takes 40 seconds on my VM. This is still slow but, on a real Linux system, I believe it will be much faster. Also, the good news is that I believe my tool now works on all Linuxes. My question in this e-mail is because the files need to be in UTF-8 to be edited with Proofing Tool GUI and I was explaining in the user guide how to do it using UniRed Editor [1]. Then, I found out that UniRed messes the number of lines in some files, so I tried to find a solution. A friend of mine who is an expert in coding suggested NotePad++ [2] and I made a few tests with it and it works perfectly, so I need to improve the user guide to explain how to use it. But I have a question: when I convert the files to UTF-8, which option shall I use?: 1) Convert to UTF-8 without BOM 2) Convert to UTF-8 I am not 100% sure which one is the most correct. [1] http://www.esperanto.mv.ru/UniRed/ENG/index.html [2] http://notepad-plus-plus.org Thank you very much! Kind regards from, >Marco A.G.Pinto ----------------------- --
![]() |
- Converting Thesurus (.DAT), Spellers (.DIC + .AFF) to UTF-... Marco A.G.Pinto
- Re: Converting Thesurus (.DAT), Spellers (.DIC + .AFF... Andrea Pescetti

