Non-ASCII data in POD

2012-04-25 Thread Grant McLean
Hi POD people There's been a discussion on #metacpan about non-ASCII characters in POD being rendered incorrectly on the metacpan.org web site. The short story is that some people use utf8 characters without including: =encoding utf8. Apparently the metacpan tool chain assumes latin1 encoding, b

Re: Non-ASCII data in POD

2012-04-25 Thread Russ Allbery
Grant McLean writes: > My thoughts on the second issue are that we could modify Pod::Simple to > 'whine' if it sees non-ASCII bytes but no =encoding. This in turn would > cause Test::Pod to pick up the error and help people fix it. I would be in favor of that. -- Russ Allbery (r...@stanford.e

Re: Non-ASCII data in POD

2012-04-26 Thread Karl Williamson
On 04/25/2012 09:25 PM, Russ Allbery wrote: Grant McLean writes: My thoughts on the second issue are that we could modify Pod::Simple to 'whine' if it sees non-ASCII bytes but no =encoding. This in turn would cause Test::Pod to pick up the error and help people fix it. I would be in favor o

Re: Non-ASCII data in POD

2012-04-27 Thread Grant McLean
On Thu, 2012-04-26 at 15:23 +1200, Grant McLean wrote: Hi POD people > > There's been a discussion on #metacpan about non-ASCII characters in POD > being rendered incorrectly on the metacpan.org web site. > > The short story is that some people use utf8 characters without > including: =encoding u

Re: Non-ASCII data in POD

2012-04-27 Thread David E. Wheeler
On Apr 27, 2012, at 12:10 AM, Grant McLean wrote: > OK, so I went ahead and implemented both the warning and the heuristic > to guess Latin-1 vs UTF-8 (only when no encoding was specified). The > resulting patch is here: > > https://github.com/theory/pod-simple/pull/26 I like this, but wonder

Re: Non-ASCII data in POD

2012-04-27 Thread Grant McLean
On Fri, 2012-04-27 at 09:17 -0700, David E. Wheeler wrote: > On Apr 27, 2012, at 12:10 AM, Grant McLean wrote: > > > OK, so I went ahead and implemented both the warning and the heuristic > > to guess Latin-1 vs UTF-8 (only when no encoding was specified). The > > resulting patch is here: > > >

Re: Non-ASCII data in POD

2012-04-27 Thread David E. Wheeler
On Apr 27, 2012, at 12:54 PM, Grant McLean wrote: > I'm not sure I quite understand what you're saying. The first time a > non-ASCII byte is encountered, the code will 'fire' and apply the > heuristic to set an encoding. Once the encoding is set, the code won't > be called again. Oh, perfect. I

Re: Non-ASCII data in POD

2012-05-03 Thread Johan Vromans
Grant McLean writes: > OK, so I went ahead and implemented both the warning and the heuristic > to guess Latin-1 vs UTF-8 (only when no encoding was specified). The > resulting patch is here: > > https://github.com/theory/pod-simple/pull/26 This patch enforces authors to add an "=encoding UTF

Re: Non-ASCII data in POD

2012-05-03 Thread Grant McLean
On Mon, 2012-04-30 at 14:24 +0200, Johan Vromans wrote: > Grant McLean writes: > > > OK, so I went ahead and implemented both the warning and the heuristic > > to guess Latin-1 vs UTF-8 (only when no encoding was specified). The > > resulting patch is here: > > > > https://github.com/theory/po