>>But when my boss came and asked a non-German name which we need to use
as is and could not figure out with same encoding in same page,
>>The other headache was converting our old CF5 database to send another
company via XML.
These all all good reasons to switch for Unicode, and you made
Phillip B. Holmes wrote:
>>Care to quote the relevant passage?
>Its there.. Read it again. Ever heard of surrogates?
I have read it. Several times. I can't find anything in that article to support
your assertion that there are parts of Unicode (or any range of 16-bit values)
that cannot be enco
Claude Schneegans wrote:
> >>claude's a long standing shoe thrower when it comes to unicode
>
> On the contrary, I have a full respect fof Unicode, when it is necessary.
> But I find simply ridiculous to have all French characters using 2 or 3
> bytes in UTF-8
> when 8859 can do the job with only
>>claude's a long standing shoe thrower when it comes to unicode
On the contrary, I have a full respect fof Unicode, when it is necessary.
But I find simply ridiculous to have all French characters using 2 or 3
bytes in UTF-8
when 8859 can do the job with only one byte.
I do not argue that a ha
>>if you're using CFMX.
I'M NOT using CFMX. This issue is with CF5.
>>CF-5 rendered in UTF-7 by default.
>>We deal in 22 languages
I deal with only ONE European language, and I only need 8bit extended
ASCII: iso-8859-1
I have no problem with iso-8859-1 and there is no reason I need Unicode.
Phillip Holmes wrote:
> I've had issue with surrogates / ligatures in Asian languages.
for instance?
~|
Introducing the Fusion Authority Quarterly Update. 80 pages of hard-hitting,
up-to-date ColdFusion information by your peers,
11 PM
To: CF-Talk
Subject: Re: Convert UTF-8 to ISO-8859-1 under CF5
Phillip B. Holmes wrote:
> 8859-1 is a character set not an encoding (like UTF-8).
that's true but since cf's docs often uses charset & encoding as the same
thing i guess it's to be expected that cf d
is coming from an nvarchar or ntext
field, the unicode is rendered correctly.
Please let me know.
--phillip
-Original Message-
From: Claude Schneegans [mailto:[EMAIL PROTECTED]
Sent: Friday, November 17, 2006 5:18 PM
To: CF-Talk
Subject: Re: Convert UTF-8 to ISO-8859-1 under CF5
>
Phillip B. Holmes wrote:
> 8859-1 is a character set not an encoding (like UTF-8).
that's true but since cf's docs often uses charset & encoding as the same thing
i guess it's to be expected that cf developers will follow suite.
> For DBCS(double byte character sets), you'll need UTF-16 (double
OÄuz Demirkapı wrote:
> But in general I wonder what is your problem? Writing into DB? Showing
> an Unicode text on page?
claude's a long standing shoe thrower when it comes to unicode ;-)
~|
Introducing the Fusion Authority Q
> Unicode isn't actually a "double-byte character set".
Of course it isnt.
>2. Some characters in DBCS charsets cannot be rendered via UTF-8. period.
>Care to quote the relevant passage?
Its there.. Read it again. Ever heard of surrogates?
You should read this too:
http://www.microsoft.co
Phillip B. Holmes wrote:
>1. Java being designed 10 years ago has nothing to do with the fact that
>Unicode is represented as hex internally.
No, it doesn't. But my point was that Unicode isn't actually a "double-byte
character set". The number of code points in the latest version of Unicode
exc
>>Yep. So, this is the wrong name for the attribute. There should be an
>>encoding attribute and a charset attribute.
Who cares? As far as Internet is concerned, there is no difference.
The only problem will be when (and if) UTF-8 is used to represent
something else that Unicode,
which is not c
ginal Message-
From: Sixten Otto [mailto:[EMAIL PROTECTED]
Sent: Friday, November 17, 2006 3:18 PM
To: CF-Talk
Subject: Re: Convert UTF-8 to ISO-8859-1 under CF5
Phillip Holmes wrote:
>If you're using a character set that uses 7 bits for US ascii and the
>8th for special characters, that
> they all call it "charset"
Yep. So, this is the wrong name for the attribute. There should be an
encoding attribute and a charset attribute.
Allaire got this wrong and it should be corrected, IMHO.
There is no such thing as converting a character set from 8859-1 to UTF-8.
You're not slouthing 8
Phillip Holmes wrote:
>If you're using a character set that uses 7 bits for US ascii and the 8th
>for special characters, that's fine. But, if you're using a charset that
>utilizes the second byte, UTF8 will not be a suffiencent encoding and you'll
>have garbling on some characters.
With all due r
>>There is a custom tag that you can check out.
>>CF_CharsetConvert
With just some little tune up, this tag has solved my problem. Thanks
for the hint.
--
___
REUSE CODE! Use custom tags;
See http://www.contentbox.com/claude/customtags/tagstore.cfm
(Please
P.s.
CFMX renders UTF-8 by default. I think that the CF page load can be switched
by using the cfcontent tag (in CF 5).
--phil
-Original Message-
From: Claude Schneegans [mailto:[EMAIL PROTECTED]
Sent: Friday, November 17, 2006 10:06 AM
To: CF-Talk
Subject: Convert UTF-8 to ISO-8859-1
ect: Re: Convert UTF-8 to ISO-8859-1 under CF5
Phillip Holmes wrote:
>Think of UTF-8,16, etc as the telephone line and the charset as the
>conversation over it.
>For DBCS(double byte character sets), you'll need UTF-16 (double byte
>telephone line) to properly handle the chars
Phillip Holmes wrote:
>Think of UTF-8,16, etc as the telephone line and the charset as the
>conversation over it.
>For DBCS(double byte character sets), you'll need UTF-16 (double byte
>telephone line) to properly handle the charset.
Huh? If that were true, how would you ever use UTF-8 to encode U
>>CF_CharsetConvert (Converts iso-8859-1 (latin-1) characters to utf-8
and back.)
Thanks, I will give it a try.
>>But in general I wonder what is your problem?
The problem is that the whole site is under CF5 and using iso-88569-1
The text I want to import will be embedded inside a page in iso
>>8859-1 is a character set not an encoding (like UTF-8).
I know pretty well about UTF-8, iso-8859 and the difference between a
charset and an encoding.
But neither HTTP headers, neither META tags, neither MIME make the
distinction:
they all call it "charset".
Anyway, what I need is to convert
thing through cfhttp (which is kind of a hack).
Warmest Regards,
Phillip B. Holmes
http://www.phillipholmes.com
-Original Message-
From: Pete Freitag [mailto:[EMAIL PROTECTED]
Sent: Friday, November 17, 2006 11:12 AM
To: CF-Talk
Subject: Re: Convert UTF-8 to ISO-8859-1 under CF5
Hi C
Hi Claude,
You might be able to use java to do this...
http://java.sun.com/j2se/1.5.0/docs/api/java/lang/String.html#getBytes(java.lang.String)
I haven't tested this but something like this may work:
--
Pete Freitag
http://foundeo.com/ - ColdFusion Consulting & Products
http://petefreit
There is a custom tag that you can check out.
CF_CharsetConvert (Converts iso-8859-1 (latin-1) characters to utf-8 and
back.)
http://www.adobe.com/cfusion/exchange/index.cfm?view=sn131&extID=1003076
But in general I wonder what is your problem? Writing into DB? Showing
an Unicode text on page?
Hi,
I have a site on a CF5 server that imports some page by CFHTTP from
another site
under CFMX (not under my control).
Unfortunately, this site sends its content in UTF-8 format.
Is there any tool somewhere, usable under CF5, to convert UTF-8 text to
ISO-8859-1?
--
___
26 matches
Mail list logo