Re: Convert UTF-8 to ISO-8859-1 under CF5

2006-11-18 Thread Claude Schneegans
>>But when my boss came and asked a non-German name which we need to use as is and could not figure out with same encoding in same page, >>The other headache was converting our old CF5 database to send another company via XML. These all all good reasons to switch for Unicode, and you made

Re: Convert UTF-8 to ISO-8859-1 under CF5

2006-11-18 Thread Sixten Otto
Phillip B. Holmes wrote: >>Care to quote the relevant passage? >Its there.. Read it again. Ever heard of surrogates? I have read it. Several times. I can't find anything in that article to support your assertion that there are parts of Unicode (or any range of 16-bit values) that cannot be enco

Re: Convert UTF-8 to ISO-8859-1 under CF5

2006-11-17 Thread Oğuz Demirkapı
Claude Schneegans wrote: > >>claude's a long standing shoe thrower when it comes to unicode > > On the contrary, I have a full respect fof Unicode, when it is necessary. > But I find simply ridiculous to have all French characters using 2 or 3 > bytes in UTF-8 > when 8859 can do the job with only

Re: Convert UTF-8 to ISO-8859-1 under CF5

2006-11-17 Thread Claude Schneegans
>>claude's a long standing shoe thrower when it comes to unicode On the contrary, I have a full respect fof Unicode, when it is necessary. But I find simply ridiculous to have all French characters using 2 or 3 bytes in UTF-8 when 8859 can do the job with only one byte. I do not argue that a ha

Re: Convert UTF-8 to ISO-8859-1 under CF5

2006-11-17 Thread Claude Schneegans
>>if you're using CFMX. I'M NOT using CFMX. This issue is with CF5. >>CF-5 rendered in UTF-7 by default. >>We deal in 22 languages I deal with only ONE European language, and I only need 8bit extended ASCII: iso-8859-1 I have no problem with iso-8859-1 and there is no reason I need Unicode.

Re: Convert UTF-8 to ISO-8859-1 under CF5

2006-11-17 Thread Paul Hastings
Phillip Holmes wrote: > I've had issue with surrogates / ligatures in Asian languages. for instance? ~| Introducing the Fusion Authority Quarterly Update. 80 pages of hard-hitting, up-to-date ColdFusion information by your peers,

RE: Convert UTF-8 to ISO-8859-1 under CF5

2006-11-17 Thread Phillip Holmes
11 PM To: CF-Talk Subject: Re: Convert UTF-8 to ISO-8859-1 under CF5 Phillip B. Holmes wrote: > 8859-1 is a character set not an encoding (like UTF-8). that's true but since cf's docs often uses charset & encoding as the same thing i guess it's to be expected that cf d

RE: Convert UTF-8 to ISO-8859-1 under CF5

2006-11-17 Thread Phillip B. Holmes
is coming from an nvarchar or ntext field, the unicode is rendered correctly. Please let me know. --phillip -Original Message- From: Claude Schneegans [mailto:[EMAIL PROTECTED] Sent: Friday, November 17, 2006 5:18 PM To: CF-Talk Subject: Re: Convert UTF-8 to ISO-8859-1 under CF5 >

Re: Convert UTF-8 to ISO-8859-1 under CF5

2006-11-17 Thread Paul Hastings
Phillip B. Holmes wrote: > 8859-1 is a character set not an encoding (like UTF-8). that's true but since cf's docs often uses charset & encoding as the same thing i guess it's to be expected that cf developers will follow suite. > For DBCS(double byte character sets), you'll need UTF-16 (double

Re: Convert UTF-8 to ISO-8859-1 under CF5

2006-11-17 Thread Paul Hastings
Oğuz Demirkapı wrote: > But in general I wonder what is your problem? Writing into DB? Showing > an Unicode text on page? claude's a long standing shoe thrower when it comes to unicode ;-) ~| Introducing the Fusion Authority Q

RE: Convert UTF-8 to ISO-8859-1 under CF5

2006-11-17 Thread Phillip B. Holmes
> Unicode isn't actually a "double-byte character set". Of course it isnt. >2. Some characters in DBCS charsets cannot be rendered via UTF-8. period. >Care to quote the relevant passage? Its there.. Read it again. Ever heard of surrogates? You should read this too: http://www.microsoft.co

Re: Convert UTF-8 to ISO-8859-1 under CF5

2006-11-17 Thread Sixten Otto
Phillip B. Holmes wrote: >1. Java being designed 10 years ago has nothing to do with the fact that >Unicode is represented as hex internally. No, it doesn't. But my point was that Unicode isn't actually a "double-byte character set". The number of code points in the latest version of Unicode exc

Re: Convert UTF-8 to ISO-8859-1 under CF5

2006-11-17 Thread Claude Schneegans
>>Yep. So, this is the wrong name for the attribute. There should be an >>encoding attribute and a charset attribute. Who cares? As far as Internet is concerned, there is no difference. The only problem will be when (and if) UTF-8 is used to represent something else that Unicode, which is not c

RE: Convert UTF-8 to ISO-8859-1 under CF5

2006-11-17 Thread Phillip B. Holmes
ginal Message- From: Sixten Otto [mailto:[EMAIL PROTECTED] Sent: Friday, November 17, 2006 3:18 PM To: CF-Talk Subject: Re: Convert UTF-8 to ISO-8859-1 under CF5 Phillip Holmes wrote: >If you're using a character set that uses 7 bits for US ascii and the >8th for special characters, that

RE: Convert UTF-8 to ISO-8859-1 under CF5

2006-11-17 Thread Phillip B. Holmes
> they all call it "charset" Yep. So, this is the wrong name for the attribute. There should be an encoding attribute and a charset attribute. Allaire got this wrong and it should be corrected, IMHO. There is no such thing as converting a character set from 8859-1 to UTF-8. You're not slouthing 8

Re: Convert UTF-8 to ISO-8859-1 under CF5

2006-11-17 Thread Sixten Otto
Phillip Holmes wrote: >If you're using a character set that uses 7 bits for US ascii and the 8th >for special characters, that's fine. But, if you're using a charset that >utilizes the second byte, UTF8 will not be a suffiencent encoding and you'll >have garbling on some characters. With all due r

Re: Convert UTF-8 to ISO-8859-1 under CF5

2006-11-17 Thread Claude Schneegans
>>There is a custom tag that you can check out. >>CF_CharsetConvert With just some little tune up, this tag has solved my problem. Thanks for the hint. -- ___ REUSE CODE! Use custom tags; See http://www.contentbox.com/claude/customtags/tagstore.cfm (Please

RE: Convert UTF-8 to ISO-8859-1 under CF5

2006-11-17 Thread Phillip B. Holmes
P.s. CFMX renders UTF-8 by default. I think that the CF page load can be switched by using the cfcontent tag (in CF 5). --phil -Original Message- From: Claude Schneegans [mailto:[EMAIL PROTECTED] Sent: Friday, November 17, 2006 10:06 AM To: CF-Talk Subject: Convert UTF-8 to ISO-8859-1

RE: Convert UTF-8 to ISO-8859-1 under CF5

2006-11-17 Thread Phillip B. Holmes
ect: Re: Convert UTF-8 to ISO-8859-1 under CF5 Phillip Holmes wrote: >Think of UTF-8,16, etc as the telephone line and the charset as the >conversation over it. >For DBCS(double byte character sets), you'll need UTF-16 (double byte >telephone line) to properly handle the chars

Re: Convert UTF-8 to ISO-8859-1 under CF5

2006-11-17 Thread Sixten Otto
Phillip Holmes wrote: >Think of UTF-8,16, etc as the telephone line and the charset as the >conversation over it. >For DBCS(double byte character sets), you'll need UTF-16 (double byte >telephone line) to properly handle the charset. Huh? If that were true, how would you ever use UTF-8 to encode U

Re: Convert UTF-8 to ISO-8859-1 under CF5

2006-11-17 Thread Claude Schneegans
>>CF_CharsetConvert (Converts iso-8859-1 (latin-1) characters to utf-8 and back.) Thanks, I will give it a try. >>But in general I wonder what is your problem? The problem is that the whole site is under CF5 and using iso-88569-1 The text I want to import will be embedded inside a page in iso

Re: Convert UTF-8 to ISO-8859-1 under CF5

2006-11-17 Thread Claude Schneegans
>>8859-1 is a character set not an encoding (like UTF-8). I know pretty well about UTF-8, iso-8859 and the difference between a charset and an encoding. But neither HTTP headers, neither META tags, neither MIME make the distinction: they all call it "charset". Anyway, what I need is to convert

RE: Convert UTF-8 to ISO-8859-1 under CF5

2006-11-17 Thread Phillip B. Holmes
thing through cfhttp (which is kind of a hack). Warmest Regards, Phillip B. Holmes http://www.phillipholmes.com -Original Message- From: Pete Freitag [mailto:[EMAIL PROTECTED] Sent: Friday, November 17, 2006 11:12 AM To: CF-Talk Subject: Re: Convert UTF-8 to ISO-8859-1 under CF5 Hi C

Re: Convert UTF-8 to ISO-8859-1 under CF5

2006-11-17 Thread Pete Freitag
Hi Claude, You might be able to use java to do this... http://java.sun.com/j2se/1.5.0/docs/api/java/lang/String.html#getBytes(java.lang.String) I haven't tested this but something like this may work: -- Pete Freitag http://foundeo.com/ - ColdFusion Consulting & Products http://petefreit

Re: Convert UTF-8 to ISO-8859-1 under CF5

2006-11-17 Thread Oğuz Demirkapı
There is a custom tag that you can check out. CF_CharsetConvert (Converts iso-8859-1 (latin-1) characters to utf-8 and back.) http://www.adobe.com/cfusion/exchange/index.cfm?view=sn131&extID=1003076 But in general I wonder what is your problem? Writing into DB? Showing an Unicode text on page?

Convert UTF-8 to ISO-8859-1 under CF5

2006-11-17 Thread Claude Schneegans
Hi, I have a site on a CF5 server that imports some page by CFHTTP from another site under CFMX (not under my control). Unfortunately, this site sends its content in UTF-8 format. Is there any tool somewhere, usable under CF5, to convert UTF-8 text to ISO-8859-1? -- ___