Hi Bill,
Ok, so maybe I can blame JRoller ;-)
I'd still like to find a way to enter Unicode characters into the
title and body of an entry directly, though. From looking at the
source of other Roller blog pages and RSS feeds I can see that
they're getting UTF-8 encoding glyphs in there somehow... I'd really
rather not have to type HTML entity escapes by hand!
Unfortunately I'm a total newcomer to this, so there may be an
obvious way.
On a related note: Let's say I wanted to embed some code examples in
a blog entry or title, for example:
Enum <E extends Enum<E>>
The left & right angle brackets are HTML chars, so I'd have to escape
them. Doing it manually is a pain. If I want to put them in a title
then I get complaints from RSS validators... Is there any way to
embed HTML characters in a blog title that won't generate validation
warnings? I can live with JRoller not displaying things correctly ;-)
Thanks again,
Sean
On Apr 21, 2006, at 12:15 AM, Bill Tribley wrote:
Hi Sean,
It sounds like the jroller blog viewer is not properly set up to
handle true utf-8 encoding. When you put the character into HTML
then you are causing it to pass through jroller, your browser is
picking up on the utf-8, recognizing Chinese and displaying it, as
long as all the characters make it.
Bill
On Thu, 20 Apr 2006 23:11:26 -0400, Sean Montgomery wrote:
Greetings,
I use Safari under OS X 10.4.6 to access my Roller blog at
www.jroller.com
using, well, whatever version of Roller they're using today ;-)
I'd like to be able to use Unicode characters in the titles of my
blog
entries. If I use OS X's built in Chinese input method editor to
enter a
Chinese character in an entry title via the Edit Entry screen I'll
see the
correct character show up in the entry listing (to the right of
the Edit
Entry screen under Recent Entries) but all I get is a '?' when I
view the
blog.
If I try entering an HTML entity like "胖" into the title
then I see
those seven characters under Recent Entries, but I do see the
correct single
(Chinese) character when I view the blog. The correct character
also shows
up in the RSS feed when viewed via Safari. The trouble comes when
I try to
view the new entry via the front page of the JRoller website - it
displays
"胖" explicitly.
Sure, I could just blame JRoller ;-) Instead I pointed
feedvalidator.org at
my RSS feed and validated it - they flagged the "&#x80d6;" and
gave a
warning that the title should not contain HTML and that I
shouldn't be
surprised if some viewers strip the characters or leave them there
- like I
saw at JRoller.
I've seen Roller blogs that contain entries with titles containing
explicit
Unicode characters - I've check out their RSS source (using Safari's
View:View Source command). Their RSS feed source, like mine,
contains
charset="UTF-8", so that makes sense.
So what am I doing wrong? It looks like there's no way for me to
input
Unicode via the Chinese input method using the existing web- based
Roller
interface that JRoller has configured. Is it a configuration
issue? Or do I
need to use an alternative method of posting entries that uses the
Blogger or
MetaWeblog APIs?
I didn't find anything useful on the Roller user guides and wiki
about
this... Any suggestions on where to turn?
??!