On 12 September 2011 10:54, James Courtier-Dutton
<james.dut...@gmail.com> wrote:
>> lynx -dump --hiddenlinks=ignore foo.html
>>
>> Will dump it to stdout in plain text form with URLs removed.
>>
>
> Sorry, I was not very clear.
> I wish to keep the "some url" bits, and get rid of all the "some junk" bits.
> I.e. I wish to keep the contents of the href only, and drop everything
> else, e.g. the href text itself.
> I wish to end up with a file listing all the urls.
>

Omit the '--hiddenlinks=ignore' then. It will dump out all the URLs at the end.

Al.

--
Please post to: Hampshire@mailman.lug.org.uk
Web Interface: https://mailman.lug.org.uk/mailman/listinfo/hampshire
LUG URL: http://www.hantslug.org.uk
--------------------------------------------------------------

Reply via email to