Hi John, John Snow va escriure: > > I am speaking to a client regarding there website being translated in > to a number of languages including Bengali, Urdu and Punjabi which I > am told is not very well supported by Unicode.
This is not true. These languages are supported by Unicode, since the first public version, more than 10 years ago. However, they use so-called complex scripts, which are not easy to "write" with a computer. The fact they are mainly used outside of the major industrial countries did not help on this respect, as you might guess. As a result, it is still quite difficult to have a readable web site written with these languages *and* viewable without impairment. Urdu is written with the so-called Arabic script, but in a way (`style') that is visually quite distinct from the Arabic you might have already seen (Urdu is written in Nastaliq style as opposed to Naskh style usually used for the Arabic language). This requires usually distincts fonts, which are harder to found and much less easily available than Naskh fonts. Of course, your potential clients will probably have the fonts, but the key point here is the potential diversity you may encounter. As I understand things (I did not test all the browsers that Edward indicated), the ultimate versions of the browsers should be able to work, provided the correct fonts are here (for example, Arial Unicode MS from Office XP is not sufficient here: it only has Naskh style). The other solutions outside Unicode (using the so-called "font hacks") are *not* going to give you better results, I would guess. And furthermore the future of such solutions is bad, since everyone is moving toward Unicode. Regarding Punjabi, this is probably the easiest. Depending on the country, Punjabi is written either with the Arabic script (this is called Shahmukhi; can used Naskh style) or in a dedicated script (called Gurmukhi), which is not as easy to write as Latin, but is not overwhelmy complex either. The key point here is this is not a #1 priority given the "market". As a result, I am positive recent version of IE is able to display it (with Arial Unicode MS as font). With other browsers, well, things are progressing, but there is still work to be done. Particularly on "alternative" OS such as Linux and likewise, since the OS is right now without native support for this script. On the other hand, MacOS has Gurmukhi support for many years, so I expect less problems on this platform (but I cannot be positive, my own Mac is far too old). In Gecko this is something which is under development; I did not check ultimately if it works. In Opera, it does work provided the OS supports it (in fact, in Opera in general there is nothing specific about any of these scripts, provided you are using Unicode and not some "font-hack", and provided of course there is OS support). So in general, pronostic about Punjabi is quite good (but keep in mind you may have two versions, one in Shahmukhi for "Pakistan", one in Gurmukhi for "India"). I left Bengali for the end, despite being among the 5 most spoken languages in the world. Bengali is much more complex to write. Font disponibility is scarse (for example, I do not have access to any of release quality). Microsoft is definitively working in this area, so things will get better on the Windows platform within the next years (alternatively, you can read it saying there is still work to be done). On Linux, there is a group of enthousiasts that are working hard to get results; due to the way things are set up, their results are likely to be reusable on MacOS too. Depending on the size of your project, going for "font hack" or other similar solutions such as iPlugin from CDAC might be better alternatives at short term. On the long term, of course, as above, Unicode is the correct solution. Again, please note Unicode does not hold any "culprit": Unicode is a standard to encode texts. As such, it just works for these languages. And it is definitively the standard that all browsers do follow to interpret the "texts" they are sent. However, another thing is making this correctly visible on screen. And this is where you may encounter difficulties (particularly with Bengali). Hope this helps, Antoine