From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of greenough, dave
Sent: Wednesday, November 01, 2006 5:12 PM
To: '[email protected]'
Subject: [htdig] Problem indexing shtml files
I am trying to setup htdig to search our site, and have run into a problem. Most pages on our site are .shtml files as they use server side includes to include common graphics, menu structure, sidebars on each page. I would like to only index by keywords and descriptions and would like to replace the normal excerpt with the meta description from the file.
Here is where the problem occurs, if I set all of the index factors to 0 except title, keywords, and description nothing gets indexed. If I set the text_factor to something above 0 then all of the files get indexed but the use_meta_description does not work.
If I rename the files to .html files everything indexes fine and the use_meta_description works like a charm. Ofcourse by doing this none of my pages would display properly.
Is there a way around this outside of renaming the files to .html and turning on the xbithack?
I am using version 3.1.6 of htdig (provided by an isp).
I have read some information in the mailing list archive etc but I could not find anything specific to this issue. To make sure pages would get indexed I created an html file with links to most of the pages on the site (as a lot of the page links are in the includes and have some _javascript_ with them). The start url's in the config file are all http:/ addresses as there was something mentioning that file based searches had issues with .shtml files without changing some of the htdig code.
Thank you,
Dave.
[EMAIL PROTECTED]
************************
This email and any attachments may contain confidential and privileged information. If you are not the intended recipient, please notify the sender immediately by return e-mail, delete this e-mail and destroy any copies. Any dissemination or use of this information by a person other than the intended recipient is unauthorized and may be illegal. Unless otherwise stated, opinions expressed in this e-mail are those of the author and are not endorsed by the author's employer.
------------------------------------------------------------------------- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________ ht://Dig general mailing list: <[email protected]> ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-general

