Hi John, *,

On Tue, Feb 05, 2008 at 07:34:50AM -0000, John McCreesh wrote:
> On the Marketing Project planet, http://marketing.openoffice.org/planet
> there are posts appearing with "non-displayable" characters. There's a
> good example in Ben Horsts's post of Feb 4th.which has lots of these
> "problem" characters.
> 
> Any suggestions please as to how I might correct them?

Define the charset in the generater to be UTF-8/set it to don't touch
and ask your peers to provide the feed in utf-8 only.

As there are various charsets, there is only hope when the charset is
specified somewhere, just guessing can break seriously. You cannot take
the page as it is and pass it through a converter. You would have to
pass the individual feeds through an appropriate filter/recoder.

As Ben's own site (and feed) is already UTF-8 and displays properly,
there is a misconfiguration in the aggregator "planetplanet" used (or a
bug), or the page is corrupted somewhere else in between.

Sum-up:
Hard/troublesome to clean up afterwards (need to identify the broken
parts manually and determine encoding manually), generating a clean page
in the first place should be easier/less work.

ciao
Christian
-- 
NP: Agressor - Someone To Eat

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to