Re: [LegacyUG] Using special alphabet characters in Legacy

Chris Hill Wed, 27 Nov 2019 03:36:49 -0800

Hi John

I will accept that in my 50 years experience as a developer, IT Managerand consultant I never had the need to use VB6. Therefore I was equallypuzzled by the limitations that the Legacy developers seem to haveapplied, given that by the late 1990s Unicode support was becomingcommon. So I did a web search for 'VB 6 Unicode' and found this websiteat VBForums:


http://www.vbforums.com/showthread.php?365738-Classic-VB-Does-Visual-Basic-6-support-Unicode

Note that the initial entry, in 17 Jun 2008, explicitly refers to theconversion of 16-bit characters (aka Unicode) to ANSI in API interfacesand file saves. It then extends to discuss the development of Unicodebased API extensions and the use of code sets. There are then a numberof later posts regarding methods to implement Unicode in VB6.

If we believe this, and I have no reason not to, it would appear thatthe Legacy developers were limited to the ANSI character set, unlessthey were prepared to develop or acquire Unicode based APIs andinterfaces.

This has been an issue with Legacy for many years and even the Legacysupport staff agree that they are limited to ANSI.


___

Hi Otto

I am running the latest version of Legacy, 9.0.0.332, on a 64-bit Win 10system using the English (United Kingdom) display language - whichversion are you running with.

If I create a new person and enter into his name fields via the Alt+9999sequence in the range Alt+0032 to Alt+0255 then Legacy will accept theninto the field on the display, excluding later checks that Legacy mightwant to apply if I Save the record. However, if I continue withAlt+0256, Alt+0257 .... onwards, then the additional characters are notincluded in the fields, and usually respond with a beep. Hence, thosefields are limited to the ANSI character set.

Equally, if I create a new event and then type into the Notes field,then I can happily ENTER additional characters, thus I can ENTER andhave DISPLAYED the following,

Alt-0250 to Alt-0255 : úûüýþÿ - this is the Unicode set for u withAcute, Circumfles and Diaeresis, y with Acute, lower case Thorn and ywith Diaeresis

Alt-0256 onwards, using Copy from Character map : ĀāĂăĄąĆćĈĉĊċČč - someof these do not work using the Alt-9999 format, these are the upper andlower case pairs of A with Macron, Breve and Ogonek, and C with Acute,Circumflex, Dot Above and Caron.

So, text fields will accept, as input, characters past the ANSI set,while control fields will not, or will convert them to ?. That is goodUNTIL you want to save the data. Do that and then open the event - ALLof the characters after Alt-0255 are missing, so it will ONLY savewithin the ANSI set.

Also, if you refer to the Special Character set, click on the solidsquare at the top of the bar to the left of the Notes field in theEvent, and you will see a list of the characters that Legacy will accept- this should be a match to the Alt-0032 to Alt-0256 set from theWindows Character Map. As far as I can tell it does, but my version ofChar Map misses U+007F through U+009F if set to Unicode, and does showthem if set to Windows:Western with exceptions.

This points us to the question of code sets, which were designed in the1980s to enable different glyphs to be used for characters in theAlt-0128 to Alt-0255 range to be used to cover multiple alphabets usingthe same character code to represent different glyphs. Good, so long asyou only work within a single code set, and difficult if not unless youcan deal with changing the code set on the fly. Of course, Unicode wasthe solution for that, enabling the extension of the character set from8-bits to 16- and 32-bits, so long as the programmer KNEW which versionwas in use - there were multiple versions in the early days.

Now, Otto, I see from your response below that you seem to be based inFinland. If I convert my Character Map to show the Windows:CentralEurope set I can see that the character set for the range from Alt-0128onwards is different, and includes the C with Acute, Cedilla and Caronmarks. Within the Western set only the version with Cedilla is present -C with Acute becomes the Æ glyph ( a combined AE glyph) and the C withCaron becomes the È glyph (E with Grave). In Unicode all of the C withAcute, Circumflex, Dot Above and Caron glyphs are in the range U+0106 toU+010D and outside of the ANSI set within a Western code set.

___

All of this indicates that Legacy was developed in the 1990s, withinAmerica, with the traditional mindset of ANSI and the Western Europecode set, and perhaps supports different code sets if you change thedefault language set, and has never been extended to support Unicode.


Regards

Chris

------ Original Message ------
From: "Otso Havu" <otso.h...@gmail.com>
To: "Legacy User Group" <legacyusergroup@legacyusers.com>
Sent: 27/11/2019 01:00:15
Subject: Re: [LegacyUG] Using special alphabet characters in Legacy

Legacy accepts Unicode from Win CharMap eg. in the Notes field, i just tried

ke 27. marrask. 2019 klo 1.48 John Cardinal (jfcardi...@gmail.com) kirjoitti:

Chris Hill wrote:

> NO, the issue is that native VB6 only uses API interfaces

> that are limited to ANSI.

Chris,

That's not true. I've built a dozen or so commercial programs in VB6 that
read/write/manipulate Unicode data, including reading and writing UTF-8 files,
presenting data entry screens, running reports, etc. A few are still being
used. Perhaps there are old third-party components that are not Unicode-aware,
but VB6 was certainly not limited to ANSI, API or otherwise. VB^ used in MS
Access? I have only done some light scripting there so I don't know what issue
Unicode may have presented there.

The string type in VB6 used "wide" characters (as they were called then), 16-bits each. It's true that one
had to be careful when calling Windows APIs directly and call the correct "W" methods. Calling Windows APIs
directly was probably less than 15% of the code in my applications. To write a UTF-8 file, one had to use the
"binary" version of the Open File statement ("Open FileSpec For Binary Access Write …") and then
convert the chars to UTF-8 before using the PUT call to write the text, but any reasonably-accomplished VB6 programmer
knew how to do that. The details escape me, but using binary I/O may only have been necessary when low-level control
was required, such as reading/writing the Byte-Order-Mark (BOM).

Of course, VB6 was very easy to use and so there were a lot of junior-level
programmers who may have been flummoxed by working with Unicode, but that's not
the fault of VB6.

John

LegacyUserGroup mailing list
LegacyUserGroup@legacyusers.com
To manage your subscription and unsubscribe
http://legacyusers.com/mailman/listinfo/legacyusergroup_legacyusers.com
Archives at:
http://www.mail-archive.com/legacyusergroup@legacyusers.com/




--
Otso Havu +358 50 5534170  +358 50 408 2248
otso.havu..gmail.com
Sulkapolku 6 B 13
FI-00370 Helsinki
Finland
skype:  otso.havu

--

LegacyUserGroup mailing list
LegacyUserGroup@legacyusers.com
To manage your subscription and unsubscribe 
http://legacyusers.com/mailman/listinfo/legacyusergroup_legacyusers.com
Archives at:
http://www.mail-archive.com/legacyusergroup@legacyusers.com/

-- 

LegacyUserGroup mailing list
LegacyUserGroup@legacyusers.com
To manage your subscription and unsubscribe 
http://legacyusers.com/mailman/listinfo/legacyusergroup_legacyusers.com
Archives at:
http://www.mail-archive.com/legacyusergroup@legacyusers.com/

Re: [LegacyUG] Using special alphabet characters in Legacy

Reply via email to