Hi John, *, On Tue, Feb 05, 2008 at 07:34:50AM -0000, John McCreesh wrote: > On the Marketing Project planet, http://marketing.openoffice.org/planet > there are posts appearing with "non-displayable" characters. There's a > good example in Ben Horsts's post of Feb 4th.which has lots of these > "problem" characters. > > Any suggestions please as to how I might correct them?
Define the charset in the generater to be UTF-8/set it to don't touch and ask your peers to provide the feed in utf-8 only. As there are various charsets, there is only hope when the charset is specified somewhere, just guessing can break seriously. You cannot take the page as it is and pass it through a converter. You would have to pass the individual feeds through an appropriate filter/recoder. As Ben's own site (and feed) is already UTF-8 and displays properly, there is a misconfiguration in the aggregator "planetplanet" used (or a bug), or the page is corrupted somewhere else in between. Sum-up: Hard/troublesome to clean up afterwards (need to identify the broken parts manually and determine encoding manually), generating a clean page in the first place should be easier/less work. ciao Christian -- NP: Agressor - Someone To Eat --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
