On Tue, January 14, 2014 1:31 pm, JiÅÃ Baum wrote: > Ah, skip the lynx step. Just work with the html directly. > All the tools (sed, awk, grep) can work directly with html. > To some extent it depends on how variable the original page is.
> Once you skip the lynx step, you might even find that it's trivial. For > instance, the values you want might all be in a tag with a single, fixed > "class" or "id"; you can just grep for that, then strip off the HTML > with sed. > > This is especially likely if they're all from the same website, > generated from a single template. Jiri, thanks the page is reasonably constant (until any fixes?) all the values I want are in a single 6000 char line, how do I break the 6000 char line into individual vaules, 'grep any_value file' gives me the whole 6000 chars ? V -- SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/ Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html