According to Alex Rousskov:
>       I need to index a site that has a few .htaccess protected
> directories. I do not want protected directories to be indexed. I want
> to index that site via local file system only. I do not want to have a
> static list of directories that htdig must avoid. Instead, I simply
> want htdig to exclude any directory that has .htaccess file in it.
> 
>       I tried to find a solution in the archives, but it seems like
> my problem is unique. Would I have to code the solution?

Yes, this does seem to be a rather unique problem, as far as I recall,
but not that different from other problems.  I'd recommend using a
find command, similar to the one in http://www.htdig.org/FAQ.html#q5.25,
to find all directories that have a .htaccess file and make URLs out of
the directory names.  Put these in a file, and use that file for your
exclude_urls attribute (rather than start_url as in the example).  The
whole process can be automated with a script which runs this before
running htdig and htmerge.

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to