According to Phil Glatz: > I'm using last Sunday's 3.2b4 build. I can't get start_url to work on more > than one URL. > > i.e., if I set > start_url: http://www.foo.com/page1.html \ > http://www.foo.com/page2.html > > and turn debugging to -vv, I see only page1.html getting indexed > > > According to the help page, "This is the list of URLs that will be used to > start a dig when there was no existing database. Note that multiple URLs > can be given here." > > This implies what I'm doing *should* work.
Yes, as long as you don't have a space after the backslash, as Geoff suggested. > What does the part about "no existing database" mean? Does this mean > multiple URLs might work with no database, but wouldn't on a subsequent dig > pass? > > I tried removing the contents of the db directory, still didn't work. No, it means that if you have existing databases, the start_url value isn't actually needed anymore, because htdig will prime the queue with all the URLs in the database, so it will revisit every document it's already indexed. In this case, however, you can still use start_url to add more URLs to the existing database. -- Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930 _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

