Thanks for the clarification -- man-page versus authorative info-manual.
If you allow me to give you some feedback:
I honestly overread the information in the manpage, as I thought the
bottom paragraphs are only some examples and special cases.
I think the fact, that --page-requisites internall
not al, but in this particular case I pretty sure they have
On Thu, Jun 19, 2008 at 10:42 AM, Tony Lewis <[EMAIL PROTECTED]> wrote:
> mm w wrote:
>
>> a simple url-rewriting conf should fix the problem, wihout touch the file
>> system
>> everything can be done server side
>
> Why do you assume th
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Stefan Nowak wrote:
> Does --page-requisites load content from other hosts as well, or must I
> explicitly issue a --span-hosts with it?
>
> The manpage unambiguously says about --span-hosts "Enable spanning
> across hosts when doing recursive retriev
mm w wrote:
> a simple url-rewriting conf should fix the problem, wihout touch the file
> system
> everything can be done server side
Why do you assume the user of wget has any control over the server from which
content is being downloaded?
without touching the file system
On Thu, Jun 19, 2008 at 9:23 AM, mm w <[EMAIL PROTECTED]> wrote:
> a simple url-rewriting conf should fix the problem, wihout touch the file
> system
> everything can be done server side
>
> Best Regards
>
> On Thu, Jun 19, 2008 at 6:29 AM, Coombe, Allan David (DP
a simple url-rewriting conf should fix the problem, wihout touch the file system
everything can be done server side
Best Regards
On Thu, Jun 19, 2008 at 6:29 AM, Coombe, Allan David (DPS)
<[EMAIL PROTECTED]> wrote:
> Thanks averyone for the contributions.
>
> Ultimately, our purpose is to process
Does --page-requisites load content from other hosts as well, or must
I explicitly issue a --span-hosts with it?
The manpage unambiguously says about --span-hosts "Enable spanning
across hosts when doing recursive retrieving", but at the --span-
hosts section it does not mention whether wget
Thanks averyone for the contributions.
Ultimately, our purpose is to process documents from the site into our
search database, so probably the most important thing is to limit the
number of files being processed. The case of the URLs in the html
probably wouldn't cause us much concern, but I cou