I have a site with a large number of static pages I'd like to search, but
these static pages are passed through a template system before they're
served.  Therefore, the URLs take the form of:

http://www.site.com/template.htm?file=foo.htm&action=this&session=xxx...x

The file and action are unique but for all the static pages, but the
session id varies drastically.  htdig has no problem indexing the
site, but does end up with multiple listings for a single page as the
session variable changes.

Is there any way to get htdig to ignore/discard the url beginning at the
"&sessoin="?  Note that I don't want to exlude urls with a "&session",
just that portion of the url.  Is this possible?

Thanks,
mth.




_______________________________________________
htdig-general mailing list
[EMAIL PROTECTED]
http://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to