Re: [OT] The Case Against... Unicode?

2016-06-01 Thread Kagamin via Digitalmars-d

On Wednesday, 1 June 2016 at 16:26:36 UTC, deadalnix wrote:
On Wednesday, 1 June 2016 at 16:15:15 UTC, Patrick Schluter 
wrote:
What Joakim does not understand, is that there are huge, huge 
quantities of documents that are multi-lingual.


That should be obvious to anyone living outside the USA.


https://msdn.microsoft.com/th-th inside too :)


Re: [OT] The Case Against... Unicode?

2016-06-01 Thread Nick Sabalausky via Digitalmars-d

On 06/01/2016 12:26 PM, deadalnix wrote:

On Wednesday, 1 June 2016 at 16:15:15 UTC, Patrick Schluter wrote:

What Joakim does not understand, is that there are huge, huge
quantities of documents that are multi-lingual.


That should be obvious to anyone living outside the USA.



Or anyone in the USA who's ever touched a product that includes a manual 
or a safety warning, or gone to high school (a foreign language class is 
pretty much universally mandatory, even in the US).




Re: [OT] The Case Against... Unicode?

2016-06-01 Thread Kagamin via Digitalmars-d

On Wednesday, 1 June 2016 at 15:02:33 UTC, Wyatt wrote:
If you have to deal with delivering the fastest possible i18n 
at GSM data rates, well, that's a tough problem and it sounds 
like you might need to do something pretty special. Turning the 
entire ecosystem into your special case is not the answer.


UTF-8 encoded SMS work fine for me in GSM network, didn't notice 
any problem.


Re: [OT] The Case Against... Unicode?

2016-06-01 Thread deadalnix via Digitalmars-d

On Wednesday, 1 June 2016 at 16:15:15 UTC, Patrick Schluter wrote:
What Joakim does not understand, is that there are huge, huge 
quantities of documents that are multi-lingual.


That should be obvious to anyone living outside the USA.



Re: [OT] The Case Against... Unicode?

2016-06-01 Thread Patrick Schluter via Digitalmars-d

On Wednesday, 1 June 2016 at 15:02:33 UTC, Wyatt wrote:

On Wednesday, 1 June 2016 at 13:57:27 UTC, Joakim wrote:


No, I explicitly said not the web in a subsequent post.  The 
ignorance here of what 2G speeds are like is mind-boggling.


It's not hard.  I think a lot of us remember when a 14.4 modem 
was cutting-edge.  Codepages and incompatible encodings were 
terrible then, too.


Never again.

Well, when you _like_ a ludicrous encoding like UTF-8, not 
sure your opinion matters.


It _is_ kind of ludicrous, isn't it?  But it really is the 
least-bad option for the most text.  Sorry, bub.


No. The common string-handling use case is code that is 
unaware which script (not language, btw) your text is in.


Lol, this may be the dumbest argument put forth yet.


This just makes it feel like you're trolling.  You're not just 
trolling, right?


I don't think anyone here even understands what a good 
encoding is and what it's for, which is why there's no point 
in debating this.


And I don't think you realise how backwards you sound to people 
who had to live through the character encoding hell of the 
past.  This has been an ongoing headache for the better part of 
a century (it still comes up in old files, sites, and systems) 
and you're literally the only person I've ever seen seriously 
suggest we turn back now that the madness has been somewhat 
tamed.


Indeed, Joakim's proposal is so insane it beggars belief (why not 
go back to baudot encoding, it's only 5 bit, hurray, it's so much 
faster when used with flag semaphores).


As a programmer in the European Commission translation unit, 
working on the probably biggest translation memory in the world 
for 14 years, I can attest that Unicode is a blessing. When I 
remember the shit we had in our documents because of the code 
pages before most programs could handle utf-8 or utf-16 (and 
before 2004 we only had 2 alphabets to take care of, Western and 
Greek). What Joakim does not understand, is that there are huge, 
huge quantities of documents that are multi-lingual. Translators 
of course handle nearly exclusively with at least bi-lingual 
documents. Any document encountered by a translator must at least 
be able to present the source and the target language. But even 
outside of that specific population, multilingual documents are 
very, very common.




If you have to deal with delivering the fastest possible i18n 
at GSM data rates, well, that's a tough problem and it sounds 
like you might need to do something pretty special. Turning the 
entire ecosystem into your special case is not the answer.








[OT] The Case Against... Unicode?

2016-06-01 Thread Wyatt via Digitalmars-d

On Wednesday, 1 June 2016 at 13:57:27 UTC, Joakim wrote:


No, I explicitly said not the web in a subsequent post.  The 
ignorance here of what 2G speeds are like is mind-boggling.


It's not hard.  I think a lot of us remember when a 14.4 modem 
was cutting-edge.  Codepages and incompatible encodings were 
terrible then, too.


Never again.

Well, when you _like_ a ludicrous encoding like UTF-8, not sure 
your opinion matters.


It _is_ kind of ludicrous, isn't it?  But it really is the 
least-bad option for the most text.  Sorry, bub.


No. The common string-handling use case is code that is 
unaware which script (not language, btw) your text is in.


Lol, this may be the dumbest argument put forth yet.


This just makes it feel like you're trolling.  You're not just 
trolling, right?


I don't think anyone here even understands what a good encoding 
is and what it's for, which is why there's no point in debating 
this.


And I don't think you realise how backwards you sound to people 
who had to live through the character encoding hell of the past.  
This has been an ongoing headache for the better part of a 
century (it still comes up in old files, sites, and systems) and 
you're literally the only person I've ever seen seriously suggest 
we turn back now that the madness has been somewhat tamed.


If you have to deal with delivering the fastest possible i18n at 
GSM data rates, well, that's a tough problem and it sounds like 
you might need to do something pretty special. Turning the entire 
ecosystem into your special case is not the answer.


-Wyatt