TT and UTF8?

2010-01-29 Thread Dave Hodgkinson
Anyone had issues with TT and UTF8? sheriff and theorbtwo have got me a long way down the line but... I have a string which is_utf8() and contains weird characters. I restart apache and Mötorhead displays fine. Next time through it's Mot�rhead. Printing unpack(H*) shows the right octets in the

Re: TT and UTF8?

2010-01-29 Thread Ash Berlin
On 29 Jan 2010, at 14:30, Dave Hodgkinson wrote: > > Anyone had issues with TT and UTF8? > > sheriff and theorbtwo have got me a long way down the line but... > > I have a string which is_utf8() and contains weird characters. I > restart apache and Mötorhead displays fine. Next time through i

Re: TT and UTF8?

2010-01-29 Thread Dave Hodgkinson
On 29 Jan 2010, at 14:48, Ash Berlin wrote: > 2) stick a BOM in the .tt file > BOM? -- Dave HodgkinsonMSN: daveh...@hotmail.com Site: http://www.davehodgkinson.com UK: +44 7768 490620 Blog: http://www.davehodgkinson.com/blog Photos: http://www.

Re: TT and UTF8?

2010-01-29 Thread David Dorward
On 29 Jan 2010, at 15:25, Dave Hodgkinson wrote: > > On 29 Jan 2010, at 14:48, Ash Berlin wrote: > >> 2) stick a BOM in the .tt file >> > > BOM? Byte Order Mark. It signals the signal the endianness of the data. -- David Dorward http://dorward.me.uk

Re: TT and UTF8?

2010-01-29 Thread David Cantrell
On Fri, Jan 29, 2010 at 03:25:14PM +, Dave Hodgkinson wrote: > On 29 Jan 2010, at 14:48, Ash Berlin wrote: > > 2) stick a BOM in the .tt file > BOM? Byte Order Mark http://en.wikipedia.org/wiki/Byte_order_mark -- David Cantrell | Cake Smuggler Extraordinaire Immigration: making Britain

Re: TT and UTF8?

2010-01-29 Thread Joel Bernstein
On 29 January 2010 15:25, Dave Hodgkinson wrote: > > On 29 Jan 2010, at 14:48, Ash Berlin wrote: > >> 2) stick a BOM in the .tt file >> > > BOM? U+FEFF - unicode codepoint used to indicate endianness in encodings where word length is not single octet multiples. http://en.wikipedia.org/wiki/Byte_

Re: TT and UTF8?

2010-01-29 Thread David Precious
Dave Hodgkinson wrote: On 29 Jan 2010, at 14:48, Ash Berlin wrote: 2) stick a BOM in the .tt file BOM? Byte-Order Mark - http://en.wikipedia.org/wiki/Byte_order_mark

Re: TT and UTF8?

2010-01-29 Thread Matt Lawrence
Joel Bernstein wrote: On 29 January 2010 15:25, Dave Hodgkinson wrote: On 29 Jan 2010, at 14:48, Ash Berlin wrote: 2) stick a BOM in the .tt file BOM? U+FEFF - unicode codepoint used to indicate endianness in encodings where word length is not single octet multiples.

Re: TT and UTF8?

2010-01-29 Thread Joel Bernstein
On 29 January 2010 16:59, Matt Lawrence wrote: > Joel Bernstein wrote: >> >> On 29 January 2010 15:25, Dave Hodgkinson wrote: >> >>> >>> On 29 Jan 2010, at 14:48, Ash Berlin wrote: >>> >>> 2) stick a BOM in the .tt file >>> >>> BOM? >>> >> >> U+FEFF - unicode codepoint used to

Re: TT and UTF8?

2010-01-29 Thread Matt Lawrence
Joel Bernstein wrote: On 29 January 2010 16:59, Matt Lawrence wrote: Joel Bernstein wrote: On 29 January 2010 15:25, Dave Hodgkinson wrote: IIRC, you can say ":set bomb" in vim to do this. Someone set up us the &^&^!^ytNO CARRIER I first encountered BOMs when dealin

Re: TT and UTF8?

2010-01-29 Thread Dave Cross
On 01/29/2010 02:30 PM, Dave Hodgkinson wrote: Anyone had issues with TT and UTF8? sheriff and theorbtwo have got me a long way down the line but... I have a string which is_utf8() and contains weird characters. I restart apache and Mötorhead displays fine. Next time through it's Mot�rhead. Pr

Re: TT and UTF8?

2010-01-29 Thread Dave Cross
On 01/29/2010 07:07 PM, Dave Cross wrote: Templates that contain UTF-9 These Template are one more encoded!

Re: TT and UTF8?

2010-01-29 Thread Peter Edwards
On 29 January 2010 19:20, Dave Cross wrote: > On 01/29/2010 07:07 PM, Dave Cross wrote: > >> Templates that contain UTF-9 >> > > These Template are one more encoded! > > /me waits for inevitable UTF-11 gag. Oh! Too late.

Re: TT and UTF8?

2010-01-29 Thread Nicholas Clark
On Fri, Jan 29, 2010 at 08:12:15PM +, Peter Edwards wrote: > On 29 January 2010 19:20, Dave Cross wrote: > > > On 01/29/2010 07:07 PM, Dave Cross wrote: > > > >> Templates that contain UTF-9 > >> > > > > These Template are one more encoded! > > > > > /me waits for inevitable UTF-11 gag. Oh! T

Re: TT and UTF8?

2010-01-29 Thread Dave Hodgkinson
On 29 Jan 2010, at 19:07, Dave Cross wrote: > > There's a Perlanet fork that has a hack for dealing correctly with Templates > that contain UTF-9 whether or not they contain a BOM. > > http://github.com/kappa/perlanet/blob/master/lib/Perlanet.pm > > It does it by overriding the Template::Provi