On 14/8/04 1:17 pm, "Robin Taylor" <[EMAIL PROTECTED]> wrote:

> Hi. I just can't get this right. I tried JPlucker. I kind of need to know how
> many levels to go ... 0 gives me just the front page, 2 gives me hundreds. Is
> there any level way to guess the right amount of levels?
>  
> If that program is not allowed to be discussed here, I understand. If that is
> the case, can someone please give me a lead-in the parser program so that I
> can grab a couple of websites?

There is not really; trial and error is usually the way to go. The
combinations to try are:

Max. Depth (very rarely more than 2, and usually 2);

Restrict to (usually 'host', but may be the less restrictive 'domain');

To further restrict the pages you have to start adding patterns to the URL
Inclusion and URL Exclusion boxes; I have never had to do this, but it may
be necessary in some cases where there is a big jump (level = 1 plucks a few
pages, level = 2 hundreds).

Alastair


_______________________________________________
plucker-list mailing list
[EMAIL PROTECTED]
http://lists.rubberchicken.org/mailman/listinfo/plucker-list

Reply via email to