On 14/8/04 1:17 pm, "Robin Taylor" <[EMAIL PROTECTED]> wrote:
> Hi. I just can't get this right. I tried JPlucker. I kind of need to know how > many levels to go ... 0 gives me just the front page, 2 gives me hundreds. Is > there any level way to guess the right amount of levels? > > If that program is not allowed to be discussed here, I understand. If that is > the case, can someone please give me a lead-in the parser program so that I > can grab a couple of websites? There is not really; trial and error is usually the way to go. The combinations to try are: Max. Depth (very rarely more than 2, and usually 2); Restrict to (usually 'host', but may be the less restrictive 'domain'); To further restrict the pages you have to start adding patterns to the URL Inclusion and URL Exclusion boxes; I have never had to do this, but it may be necessary in some cases where there is a big jump (level = 1 plucks a few pages, level = 2 hundreds). Alastair _______________________________________________ plucker-list mailing list [EMAIL PROTECTED] http://lists.rubberchicken.org/mailman/listinfo/plucker-list

