[PHP] strip tags but preserve title attributes

2009-12-14 Thread Ashley Sheridan
I'm looking for a way to strip HTML tags out of some text content (sourced from a web page) to leave just the text which I'll be running some basic analysis on. The thing is, I want to preserve text that is in alt and title attributes. I can't use any DOM functions, as I can't guarantee that the co

Re: [PHP] strip tags but preserve title attributes

2009-12-15 Thread Andrew Ballard
On Mon, Dec 14, 2009 at 6:43 PM, Ashley Sheridan wrote: > I'm looking for a way to strip HTML tags out of some text content > (sourced from a web page) to leave just the text which I'll be running > some basic analysis on. The thing is, I want to preserve text that is in > alt and title attributes

Re: [PHP] strip tags but preserve title attributes

2009-12-15 Thread Wouter van Vliet / Interpotential
I've had quite some luck using the html2text class by Jon Abernathy http://www.chuggnutt.com/html2text.php It's targetted to php 4, and rather old code - but it does the job for me. Where the 'job for me' is converting html to text for when I'm sending out emails in HTML format and want to off

Re: [PHP] strip tags but preserve title attributes

2009-12-15 Thread Brady Mitchell
On Tue, Dec 15, 2009 at 6:44 AM, Wouter van Vliet / Interpotential wrote: > And if that doesn't suit your needs - you might want to take a look at this: > >    http://sourceforge.net/projects/simplehtmldom/ +1 I've never used the html2text library, but simplehtmldom is very easy to use and has wo