On 28/01/14 19:00, Chris Bannister wrote: > On Tue, Jan 28, 2014 at 12:18:56PM +1100, Scott Ferguson wrote: >> >> and if it exists, scrape the page for the license e.g.:- >> $ curl -s http://en.wikipedia.org/wiki/Viber|html2text|grep License|head -n1 > > That will pick up bad grammar, maybe better is: > curl -s http://en.wikipedia.org/wiki/Viber|html2text|grepLicen\[sc\]e|head -n1 > > Licence - is the noun > License - is the verb
Are you serious? Grammar (butter for those without bread).... how old are you? :) > > Just wondering why without sending it through html2text it returns: > <th scope="row" style="text-align:left;white-space: nowrap;;"><a > href="/wiki/Software_license" title="Software license">License</a></th> > > That's why I convert the page to text, so it works on all wikipedia software pages *as* they are, not re-imagined to suit pendantic semantics. ;) Kind regards. -- To UNSUBSCRIBE, email to debian-user-requ...@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org Archive: http://lists.debian.org/52e76699.8080...@gmail.com