According to Alex Rousskov:
> On Thu, 14 Jun 2001, Gilles Detillieux wrote:
> > Yes, this does seem to be a rather unique problem, as far as I
> > recall, but not that different from other problems. I'd recommend
> > using a find command, similar to the one in
> > http://www.htdig.org/FAQ.html#q5.25, to find all directories that
> > have a .htaccess file and make URLs out of the directory names.
> > Put these in a file, and use that file for your exclude_urls
> > attribute (rather than start_url as in the example). The whole
> > process can be automated with a script which runs this before
> > running htdig and htmerge.
>
> Yes, that would work OK, and I have considered it as a possible
> workaround.
>
> Ideally, since this is a "security" issue, I would prefer not to rely
> on an external preparation step: I do not want to end up with
> protected content displayed in search summaries when I (or somebody
> else!) forget to run the script first. Renaming the script into htdig
> would help, but this change might get lost when new htdig versions are
> installed. Etc, etc.
>
> I middle ground (and very general) solution would be to teach htdig to
> import the result of a program execution into the configuration file
> run-time. Something along these lines:
>
> exclude_urls: \
> [/mail-archive/] \
> `/usr/local/bin/myscript.sh` \
> ...
>
> ... where the command in the back-quoted string is executed by htdig
> and then substituted with the output of the executed command. Htdig
> should exit if the command fails (which is difficult to check for
> reliably, unfortunately). Htdig should at leat warn if the command
> produces no output.
Instead, why not just make a point of never running htdig directly?
Just set up a script that does the list generation, then runs htdig,
and finally htmerge or htpurge. Then, run this script whenever you need
to reindex. It's usually a good idea to run htdig from a script anyway,
so that TMPDIR is set appropriately, and so that htmerge or htpurge is
run afterward. If you set up your own script, you don't need to worry
about it being lost when installing a new version of htdig.
In ht://Dig, the left quote syntax is already used for loading attribute
values from a file, so changing it to run a command would cause a lot
of confusion.
--
Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba Phone: (204)789-3766
Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html