is it possible to use a program to get all the text only of
the html? as if I open the html with a browser, then click
ctrl+a and then copy paste all the selected text
I need to do this in batch
can lynx do it?
Sounds like you're reaching for the "-dump" parameter that Lynx
supports, as described in the man-page:
lynx -dump http://www.example.com
This can then be automated via a script, or you may be able to
use the '-crawl' parameter in conjunction with -dump to walk a
site. I didn't see anything in my man-page to limit link
recursion-depth as wget offers.
If you don't want the link-lists, you can use the -nolist
parameter as well.
-tim
_______________________________________________
Lynx-dev mailing list
[email protected]
http://lists.nongnu.org/mailman/listinfo/lynx-dev