> Suppose you have a book with a ToC, and each of the N chapters is a
> separate page which links to the next/prev chapter.

        So you have:

        [TOC.html]
        |       |
        |       |
        [ch1]   [ch2]
        /\      /\
        p n     p n

        Now where is the chapter itself? Where is the content?

> Instead of starting at the ToC, you want to start at Chapter 1.
> Link-depth = 1, spider-from = ToC.html, start-page = chapter_1.html would
> accomplish this.  If the chapters don't link to the ToC, you'd need to
> have link-depth = n-1....

        This still doesn't make sense to me. How can you start at Chapter 1
(assuming you want to read the content, not the empty page with two links on
it; previous/next), if you start spidering from the TOC, 3 levels above it?

        In a real world example, let's say I want to spider news.bbc.co.uk's
news articles. How would I point the 'spider-from' value to www.cnn.com,
which at 2-levels deep, points to news.bbc.co.uk, but exists "above" your
initial point of spidering penetration?


                [cnn.com]               # "TOC" in your example
                   |
                [links.html]            # previous/next page in your example
                   |
start ->        [news.bbc.co.uk]        # Chapter data
                   |
                [news_articles]         # Sub-chapter data

        Can you give me an example of how this could be useful, because I'm
afraid I'm missing the entire concept. You can't spider something that isn't
linked from anywhere in your structure, and somehow have it included in your
final document.

> Personally, I don't need a feature like this for my documents, but I'm
> just imagining a situation where it might be wanted.

        Can you give me a doctored up .pdb that exhibits this?


d.

_______________________________________________
plucker-list mailing list
[EMAIL PROTECTED]
http://lists.rubberchicken.org/mailman/listinfo/plucker-list

Reply via email to