According to Phil Glatz:
> I'm using last Sunday's 3.2b4 build.  I can't get start_url to work on more 
> than one URL.
> 
> i.e., if I set
> start_url:   http://www.foo.com/page1.html  \
>               http://www.foo.com/page2.html
> 
> and turn debugging to -vv, I see only page1.html getting indexed
> 
> 
> According to the help page, "This is the list of URLs that will be used to 
> start a dig when there was no existing database. Note that multiple URLs 
> can be given here."
> 
> This implies what I'm doing *should* work.

Yes, as long as you don't have a space after the backslash, as Geoff
suggested.

> What does the part about "no existing database" mean?  Does this mean 
> multiple URLs might work with no database, but wouldn't on a subsequent dig 
> pass?
> 
> I tried removing the contents of the db directory, still didn't work.

No, it means that if you have existing databases, the start_url value
isn't actually needed anymore, because htdig will prime the queue with
all the URLs in the database, so it will revisit every document it's already
indexed.  In this case, however, you can still use start_url to add more
URLs to the existing database.

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to