On Tue, January 14, 2014 1:31 pm, Jiří Baum wrote:

> Ah, skip the lynx step. Just work with the html directly.
> All the tools (sed, awk, grep) can work directly with html.
> To some extent it depends on how variable the original page is.

> Once you skip the lynx step, you might even find that it's trivial. For
> instance, the values you want might all be in a tag with a single, fixed
> "class" or "id"; you can just grep for that, then strip off the HTML
> with sed.
>
> This is especially likely if they're all from the same website,
> generated from a single template.

Jiri, thanks

the page is reasonably constant (until any fixes?)

all the values I want are in a single 6000 char line, how do I break the
6000 char line into individual vaules, 'grep any_value file' gives me the
whole 6000 chars ?

V


-- 
SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/
Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html

Reply via email to