TT and UTF8?

2010-01-29 Thread Dave Hodgkinson
Anyone had issues with TT and UTF8? sheriff and theorbtwo have got me a long way down the line but... I have a string which is_utf8() and contains weird characters. I restart apache and Mötorhead displays fine. Next time through it's Mot�rhead. Printing unpack(H*) shows the right octets in the

Re: TT and UTF8?

2010-01-29 Thread Ash Berlin
On 29 Jan 2010, at 14:30, Dave Hodgkinson wrote: Anyone had issues with TT and UTF8? sheriff and theorbtwo have got me a long way down the line but... I have a string which is_utf8() and contains weird characters. I restart apache and Mötorhead displays fine. Next time through it's

Re: TT and UTF8?

2010-01-29 Thread Dave Hodgkinson
On 29 Jan 2010, at 14:48, Ash Berlin wrote: 2) stick a BOM in the .tt file BOM? -- Dave HodgkinsonMSN: daveh...@hotmail.com Site: http://www.davehodgkinson.com UK: +44 7768 490620 Blog: http://www.davehodgkinson.com/blog Photos:

Re: TT and UTF8?

2010-01-29 Thread David Dorward
On 29 Jan 2010, at 15:25, Dave Hodgkinson wrote: On 29 Jan 2010, at 14:48, Ash Berlin wrote: 2) stick a BOM in the .tt file BOM? Byte Order Mark. It signals the signal the endianness of the data. -- David Dorward http://dorward.me.uk

Re: TT and UTF8?

2010-01-29 Thread David Cantrell
On Fri, Jan 29, 2010 at 03:25:14PM +, Dave Hodgkinson wrote: On 29 Jan 2010, at 14:48, Ash Berlin wrote: 2) stick a BOM in the .tt file BOM? Byte Order Mark http://en.wikipedia.org/wiki/Byte_order_mark -- David Cantrell | Cake Smuggler Extraordinaire Immigration: making Britain

Re: TT and UTF8?

2010-01-29 Thread Joel Bernstein
On 29 January 2010 15:25, Dave Hodgkinson daveh...@gmail.com wrote: On 29 Jan 2010, at 14:48, Ash Berlin wrote: 2) stick a BOM in the .tt file BOM? U+FEFF - unicode codepoint used to indicate endianness in encodings where word length is not single octet multiples.

Re: TT and UTF8?

2010-01-29 Thread David Precious
Dave Hodgkinson wrote: On 29 Jan 2010, at 14:48, Ash Berlin wrote: 2) stick a BOM in the .tt file BOM? Byte-Order Mark - http://en.wikipedia.org/wiki/Byte_order_mark

Re: TT and UTF8?

2010-01-29 Thread Matt Lawrence
Joel Bernstein wrote: On 29 January 2010 15:25, Dave Hodgkinson daveh...@gmail.com wrote: On 29 Jan 2010, at 14:48, Ash Berlin wrote: 2) stick a BOM in the .tt file BOM? U+FEFF - unicode codepoint used to indicate endianness in encodings where word length is not single

Re: TT and UTF8?

2010-01-29 Thread Joel Bernstein
On 29 January 2010 16:59, Matt Lawrence matt.lawre...@virgin.net wrote: Joel Bernstein wrote: On 29 January 2010 15:25, Dave Hodgkinson daveh...@gmail.com wrote: On 29 Jan 2010, at 14:48, Ash Berlin wrote: 2) stick a BOM in the .tt file BOM? U+FEFF - unicode codepoint used to

Re: TT and UTF8?

2010-01-29 Thread Matt Lawrence
Joel Bernstein wrote: On 29 January 2010 16:59, Matt Lawrence matt.lawre...@virgin.net wrote: Joel Bernstein wrote: On 29 January 2010 15:25, Dave Hodgkinson daveh...@gmail.com wrote: IIRC, you can say :set bomb in vim to do this. Someone set up us the ^^!^ytNO CARRIER

Re: TT and UTF8?

2010-01-29 Thread Dave Cross
On 01/29/2010 02:30 PM, Dave Hodgkinson wrote: Anyone had issues with TT and UTF8? sheriff and theorbtwo have got me a long way down the line but... I have a string which is_utf8() and contains weird characters. I restart apache and Mötorhead displays fine. Next time through it's Mot�rhead.

Re: TT and UTF8?

2010-01-29 Thread Dave Cross
On 01/29/2010 07:07 PM, Dave Cross wrote: Templates that contain UTF-9 These Template are one more encoded!

Re: TT and UTF8?

2010-01-29 Thread Peter Edwards
On 29 January 2010 19:20, Dave Cross d...@dave.org.uk wrote: On 01/29/2010 07:07 PM, Dave Cross wrote: Templates that contain UTF-9 These Template are one more encoded! /me waits for inevitable UTF-11 gag. Oh! Too late.

Re: TT and UTF8?

2010-01-29 Thread Nicholas Clark
On Fri, Jan 29, 2010 at 08:12:15PM +, Peter Edwards wrote: On 29 January 2010 19:20, Dave Cross d...@dave.org.uk wrote: On 01/29/2010 07:07 PM, Dave Cross wrote: Templates that contain UTF-9 These Template are one more encoded! /me waits for inevitable UTF-11 gag. Oh! Too

Re: TT and UTF8?

2010-01-29 Thread Dave Hodgkinson
On 29 Jan 2010, at 19:07, Dave Cross wrote: There's a Perlanet fork that has a hack for dealing correctly with Templates that contain UTF-9 whether or not they contain a BOM. http://github.com/kappa/perlanet/blob/master/lib/Perlanet.pm It does it by overriding the