On 12 September 2011 10:17, James Courtier-Dutton <james.dut...@gmail.com> wrote: > I want extract the "some url" bits. I.e. Remove the href. > You can probably do this quite easily in perl. > Are there any nice short programs to do this? > Is it easier to do in some other language? >
lynx -dump --hiddenlinks=ignore foo.html Will dump it to stdout in plain text form with URLs removed. Al. -- Please post to: Hampshire@mailman.lug.org.uk Web Interface: https://mailman.lug.org.uk/mailman/listinfo/hampshire LUG URL: http://www.hantslug.org.uk --------------------------------------------------------------